IAES International Journal of Artificial Intelligence (IJ-AI)
Vol 12, No 2: June 2023

Automated invoice data extraction using image processing

Akanksh Aparna Manjunath (R V College of Engineering)
Manjunath Sudhakar Nayak (R V College of Engineering)
Santhanam Nishith (R V College of Engineering)
Satish Nitin Pandit (R V College of Engineering)
Shreyas Sunkad (R V College of Engineering)
Pratiba Deenadhayalan (R V College of Engineering)
Shobha Gangadhara (R V College of Engineering)



Article Info

Publish Date
01 Jun 2023

Abstract

Manually processing invoices which are in the form of scanned photocopies is a time-consuming process. There is a need to automate the task of extraction of data from the invoices with a similar format. In this paper we investigate and analyse various techniques of image processing and text extraction to improve the results of the optical character recognition (OCR) engine, which is applied to extract the text from the invoice. This paper also proposes the design and implementation of a web enabled invoice processing system (IPS). The IPS consists of an annotation tool and an extraction tool. The annotation tool is used to mark the fields of interest in the invoice which are to be extracted. The extraction tool makes use of opensource computer vision library (OpenCV) algorithms to detect text. The proposed system was tested on more than 25 types of invoices with the average accuracy score lying between 85% and 95%. Finally, to provide ease of use, a web application is developed which also presents the results in a structured format. The entire system is designed so as to provide flexibility and automate the process of extracting details of interest from the invoices.

Copyrights © 2023






Journal Info

Abbrev

IJAI

Publisher

Subject

Computer Science & IT Engineering

Description

IAES International Journal of Artificial Intelligence (IJ-AI) publishes articles in the field of artificial intelligence (AI). The scope covers all artificial intelligence area and its application in the following topics: neural networks; fuzzy logic; simulated biological evolution algorithms (like ...