Jurnal Teknik Informatika (JUTIF)
Vol. 6 No. 5 (2025): JUTIF Volume 6, Number 5, Oktober 2025

An Integrated Pipeline with Hierarchical Segmentation and CNN for Automated KTP-el Data Extraction on the e-Magang Platform

Syafrie Rahardian, Nuansa (Unknown)
Maryanto, Eddy (Unknown)
Nawangnugraeni, Devi Astri (Unknown)



Article Info

Publish Date
16 Oct 2025

Abstract

In alignment with Indonesia's digital transformation agenda, this research addresses the inefficiencies and error-prone nature of manual data entry on the Foreign Policy Strategy Agency's (BSKLN) e-magang platform. This study introduces a comprehensive, end-to-end Optical Character Recognition (OCR) pipeline, specifically designed for structured identity documents and real-world government platform integration. The proposed methodology features a robust workflow, including image preprocessing with histogram matching, hierarchical segmentation using vertical projection, and intelligent postprocessing to structure the output. To overcome the limitations of a small dataset, three specialized Convolutional Neural Network (CNN) models were rigorously trained and validated using a stratified 5-fold cross-validation technique. The final system was successfully integrated, connecting a Flask-based model engine with the existing Laravel and React platform. End-to-end testing demonstrated strong performance, achieving an average character-reading accuracy of 93.31% with a mean processing time of 14.48 seconds per image. The primary contribution of this research to the field of informatics is the development of a complete and deployable system architecture that ensures data interoperability and reliability, providing a practical blueprint for integrating intelligent automation into digital public services.

Copyrights © 2025






Journal Info

Abbrev

jurnal

Publisher

Subject

Computer Science & IT

Description

Jurnal Teknik Informatika (JUTIF) is an Indonesian national journal, publishes high-quality research papers in the broad field of Informatics, Information Systems and Computer Science, which encompasses software engineering, information system development, computer systems, computer network, ...