End to End Invoice Processing Application Based on Key Fields Extraction

dc.authorid0000-0003-3286-5159tr
dc.contributor.authorArslan, Halil
dc.date.accessioned2023-06-22T07:18:27Z
dc.date.available2023-06-22T07:18:27Z
dc.date.issued2022tr
dc.departmentMühendislik Fakültesitr
dc.description.abstractIn this paper, an automatic invoice processing system, which is in great demand among private and public companies, was proposed. The proposed system supports all invoice file types that can be submitted by companies. Companies can easily submit invoices to the system via the web interface or email, and all invoices submitted to the system are queued and processed sequentially. If the invoice is a text file, the invoice information is extracted from the text by using template matching. If the invoice is an image, the text and table areas are detected and extracted. For table detection, we used both image processing based and YOLOv5-based deep learning method. Cell extraction was then performed from the extracted table images. As a result of these processes, all text and table cells were obtained as images and these images were converted into machine-readable text using the open-source software Tesseract OCR. Tesseract already provides trained models for English and Turkish. However, these models do not provide successful results for invoices submitted by companies in Turkish. Therefore, the new fine-tuned model trained with invoices in Turkish was used for OCR. The experimental results showed that the trained Turkish model was more accurate than the Turkish and English models provided by Tesseract. In addition, the YOLOv5-based table detection model was more accurate than the image-processing-based table detection method.tr
dc.identifier.citationArslan, H. (2022). End to End Invoice Processing Application Based on Key Fields Extraction. IEEE Access, 10, 78398-78413.tr
dc.identifier.doi10.1109/access.2022.3192828en_US
dc.identifier.endpage78413tr
dc.identifier.scopus2-s2.0-85135216628en_US
dc.identifier.scopusqualityN/A
dc.identifier.startpage78398tr
dc.identifier.urihttps://ieeexplore.ieee.org/document/9834945
dc.identifier.urihttps://hdl.handle.net/20.500.12418/13916
dc.identifier.volume10tr
dc.identifier.wosWOS:000832950300001en_US
dc.identifier.wosqualityQ2
dc.indekslendigikaynakWeb of Scienceen_US
dc.indekslendigikaynakScopusen_US
dc.language.isoenen_US
dc.publisherIEEEtr
dc.relation.ispartofIEEE ACCESSen_US
dc.relation.publicationcategoryUluslararası Hakemli Dergide Makale - Kurum Öğretim Elemanıtr
dc.rightsinfo:eu-repo/semantics/openAccesstr
dc.subjectInvoice processingtr
dc.subjectkey fields extractiontr
dc.subjecttext detectiontr
dc.subjectdeep learningtr
dc.subjecttable extractiontr
dc.subjectoptical character recognitiontr
dc.titleEnd to End Invoice Processing Application Based on Key Fields Extractionen_US
dc.typeArticleen_US

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
End_to_End_Invoice_Processing_Application_Based_on_Key_Fields_Extraction.pdf
Boyut:
1.62 MB
Biçim:
Adobe Portable Document Format
Açıklama:
Lisans paketi
Listeleniyor 1 - 1 / 1
Küçük Resim Yok
İsim:
license.txt
Boyut:
1.44 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: