Garuda - Garba Rujukan Digital

Article Per Year (5 Year)

p-Index From 2021 - 2026

0.23

P-Index

This Author published in this journals

All Journal Jurnal Teknologi Informatika dan Komputer

Samosir, Masniari

Unknown Affiliation

Author-ID : 9721337

Computer Science & IT Electrical & Electronics Engineering

Published : 1 Documents Claim Missing Document

Claim Missing Document

Articles

Comparison of Faster R-CNN and YOLO v12 on Passport Text Extraction Based on Optical Character Recognition Samosir, Masniari; Anggai, Sajarwo; Taryo, Taswanda
Jurnal Teknologi Informatika dan Komputer Vol. 12 No. 1 (2026): Jurnal Teknologi Informatika dan Komputer
Publisher : Universitas Mohammad Husni Thamrin

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.37012/jtik.v12i1.3307

Current developments in information technology are driving the need for digitalization of official identity documents, including passports, to improve service efficiency and reduce reliance on manual processes. The digitalization of official identity documents such as passports still faces efficiency and accuracy challenges due to manual data entry processes. This study aims to compare the performance of Faster R-CNN and YOLO v12 in an automatic text extraction system based on Optical Character Recognition (OCR). The research employed an experimental method with a comparative approach using 31 preprocessed passport images. YOLO v12 was integrated with EasyOCR, while Faster R-CNN was combined with a PyTorch-based OCR module. The evaluation metrics included mAP, Character Accuracy Rate (CAR), Word Error Rate (WER), F1-score, and inference time. The results indicate that YOLO v12 outperforms Faster R-CNN in object detection, achieving an mAP@50 of 95.0% and mAP@50–95 of 90.0%, compared to 93.0% and 89.0%, respectively. In terms of text extraction accuracy, Faster R-CNN achieved a CAR of 50.01% and an F1-score of 55.75%, slightly higher than YOLO v12 with a CAR of 47.72% and an F1-score of 53.84%. However, YOLO v12 produced a lower WER and faster inference time of 2.4202 seconds (0.45 FPS). The findings suggest that YOLO v12 excels in efficiency and detection performance, while Faster R-CNN performs better in specific text extraction accuracy.

Co-Authors Anggai, Sajarwo Taswanda Taryo

Title

Found 1 Documents
Search

Abstract

Title Search

Found 1 Documents Search

Abstract

Title

Found 1 Documents
Search