Siti Nikmat
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Studi Komparatif YOLOv8 dan Vision Transformer dalam Deteksi Kendaraan Ekstrem Nurul Jamila; Angga Saputra; Siti Nikmat; Hifni Khakim
Prosiding SISFOTEK Vol 9 No 1 (2025): SISFOTEK IX 2025
Publisher : Ikatan Ahli Informatika Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

The core challenge in object detection for autonomous systems lies in maintaining accuracy across extreme object scales, particularly for small, distant targets. This study conducts a quantitative performance comparison between two distinct deep learning architectures: the CNN-based YOLOv8-m and the Vision Transformer (ViT)-based YOLOS. Both models were implemented and evaluated on a custom vehicle detection dataset. YOLOv8-m was trained from scratch, while YOLOS was evaluated using a proxy precision method on a pre-trained model to gauge its inherent capability in contextual reasoning. The results, analyzed using Mean Average Precision (mAP) categorized by object scale (mAPS,mAPM,mAPL), reveal a significant architectural trade-off. YOLOv8 demonstrated superior overall performance and excelled in mAPL (Large objects), affirming the strength of CNNs in local feature extraction. Conversely, YOLOS showed higher precision for mAPS (Small objects), suggesting that the global attention mechanism of ViT is more effective for long-range surveillance where objects are scarce in pixels. This research provides evidence-based guidance for selecting the optimal detection architecture based on the target object scale and application scenario.