Bulletin of Electrical Engineering and Informatics
Vol 15, No 2: April 2026

Fine-tuned LayoutLMv3 for Indonesian receipts extraction

Sudana, Oka (Unknown)
Wirdiani, Ayu (Unknown)
Winama Putra, Andre Dwi (Unknown)



Article Info

Publish Date
01 Apr 2026

Abstract

Shopping is a transaction that generates a record as a payment receipt. Typically, a receipt is given as a small piece of paper that can be easily lost. It is essential to store the transaction information in the receipt digitally. Keeping the information in a digital form will make it easily accessible and will overcome the problem of easily lost receipts. Currently, the process of transferring receipt information into digital form is still being done manually. Having a system that can extract this information helps speed up the digitalization process tremendously. This research proposes a method that applies finetuning to the LayoutLMv3 model and with the help of optical character recognition (OCR) from Google Vision, can be used to extract transaction information contained in the receipt. The system works by using Google Vision to parse and segment every word contained within the receipt and its bounding box The LayoutLMv3 model will then assign labels to each word, and important words will be extracted. The finetuned LayoutLMv3 model successfully achieved an accuracy of 97.98% on training data and 90% accuracy on real-time test scenarios for extracting information on receipts written in the Indonesian.

Copyrights © 2026






Journal Info

Abbrev

EEI

Publisher

Subject

Electrical & Electronics Engineering

Description

Bulletin of Electrical Engineering and Informatics (Buletin Teknik Elektro dan Informatika) ISSN: 2089-3191, e-ISSN: 2302-9285 is open to submission from scholars and experts in the wide areas of electrical, electronics, instrumentation, control, telecommunication and computer engineering from the ...