Muhammad Noko Darpito
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Comparative Evaluation of Thresholding Methods for Optimized Digital Document Parsing Accuracy Muhammad Noko Darpito; Kartika Firdausy; Abdul Fadlil
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Vol 10 No 2 (2026): April 2026
Publisher : Ikatan Ahli Informatika Indonesia (IAII)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29207/resti.v10i2.6982

Abstract

Automated parsing of semi-structured documents has become increasingly important, particularly in standardized formats like SATS-LN, which contain fixed-layout fields such as permit number, addresses, validity period, and item types. This study investigates the impact of two thresholding methods Otsu and Sauvola on object detection accuracy using Faster R-CNN with Detectron2. A dataset of 200 SATS-LN documents, captured via scanner and camera, was augmented into 3,600 images and labeled for seven key fields. Image quality was evaluated using PSNR, SSIM, and MSE, while detection performance was measured through mAP, AP50, AP75, AR@100, precision, recall, and F1-score. Results showed that Sauvola preserved structural layout more effectively (SSIM: 0.76 for scanner, 0.47 for camera), although Otsu achieved higher PSNR on scanned images. Sauvola attained the highest macro and weighted F1-score (0.998), with near-perfect label detection and consistent performance across augmentations. Overall, Sauvola is more reliable for enhancing segmentation and detection in layout-based document processing.