Garuda - Garba Rujukan Digital

Buletin Ilmiah Sarjana Teknik Elektro

Vol. 7 No. 2 (2025): June

Mangkunegara, Iis Setiawan (Unknown)
Purwono, Purwono (Unknown)
Ma’arif, Alfian (Unknown)
Basil, Noorulden (Unknown)
Marhoon, Hamzah M. (Unknown)
Sharkawy, Abdel-Nasser (Unknown)

Publish Date
24 Jun 2025

Transformer models have significantly advanced deep learning by introducing parallel processing and enabling the modeling of long-range dependencies. Despite their performance gains, their high computational and memory demands hinder deployment in resource-constrained environments such as edge devices or real-time systems. This review aims to analyze and compare Transformer architectures by categorizing them into encoder-only, decoder-only, and encoder-decoder variants and examining their applications in natural language processing (NLP), computer vision (CV), and multimodal tasks. Representative models BERT, GPT, T5, ViT, and MobileViT are selected based on architectural diversity and relevance across domains. Core components including self-attention mechanisms, positional encoding schemes, and feed-forward networks are dissected using a systematic review methodology, supported by a visual framework to improve clarity and reproducibility. Performance comparisons are discussed using standard evaluation metrics such as accuracy, F1-score, and Intersection over Union (IoU), with particular attention to trade-offs between computational cost and model effectiveness. Lightweight models like DistilBERT and MobileViT are analyzed for their deployment feasibility. Major challenges including quadratic attention complexity, hardware constraints, and limited generalization are explored alongside solutions such as sparse attention mechanisms, model distillation, and hardware accelerators. Additionally, ethical aspects including fairness, interpretability, and sustainability are critically reviewed in relation to Transformer adoption across sensitive domains. This study offers a domain-spanning overview and proposes practical directions for future research aimed at building scalable, efficient, and ethically aligned. Transformer-based systems suited for mobile, embedded, and healthcare applications.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Buletin Ilmiah Sarjana Teknik Elektro

Website

Abbrev

biste

Publisher

Universitas Ahmad Dahlan

Subject

Electrical & Electronics Engineering

Description

Buletin Ilmiah Sarjana Teknik Elektro (BISTE) adalah jurnal terbuka dan merupakan jurnal nasional yang dikelola oleh Program Studi Teknik Elektro, Fakultas Teknologi Industri, Universitas Ahmad Dahlan. BISTE merupakan Jurnal yang diperuntukkan untuk mahasiswa sarjana Teknik Elektro. Ruang lingkup ...

Article Info

Abstract

Transformer Models in Deep Learning: Foundations, Advances, Challenges and Future Directions

Article Info

Abstract