Garuda - Garba Rujukan Digital

Journal of Information Technology (JINTECH)

Vol. 7 No. 1 (2026): Februari 2026

Ahmadian, Hendri (Unknown)

Publish Date
26 Feb 2026

This systematic literature review (SLR) investigates the evolution of Natural Language Processing (NLP) for Indonesian regional languages from 2020 to 2025. Analyzing 13 pivotal studies, the research identifies a significant transition from fragmented studies of high-population languages, such as Sundanese and Madurese, toward inclusive, archipelago-wide frameworks covering low-resource dialects like Acehnese and Nias. Architecturally, the field has progressed from classical machine learning to Transformer-based Large Language Models (LLMs), including IndoBART and GPT. Furthermore, data provenance has evolved from unstructured social media corpora to standardized multilingual benchmarks like NusaX and NusaCrowd. Despite these advancements, persistent gaps in data standardization and large-scale pretraining resources remain. Future research should prioritize cross-lingual transfer learning and specialized benchmarks to ensure the technological sustainability of Indonesia’s diverse linguistic heritage

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Journal of Information Technology (JINTECH)

Website

Abbrev

jintech

Publisher

Universitas Islam Negeri Ar-Raniry Banda Aceh

Subject

Computer Science & IT Other

Description

This journal provides opportunities for students, lecturers and information technology practitioners to contribute in providing new understanding and concepts related to the basic concepts of computer science that aim to develop information technology. Scope article includes: Information Technology ...

Article Info

Abstract

A SYSTEMATIC LITERATURE REVIEW OF NATURAL LANGUAGE PROCESSING FOR INDONESIAN REGIONAL LANGUAGES

Article Info

Abstract