Kana, A. F. Donfack
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

A Systematic Review on Machine Translation for Low Resource Nigerian Languages M. Abdulmusawir, Tijani; Kana, A. F. Donfack; Abubakar, Amina H.
IT Journal Research and Development Vol. 10 No. 2 (2025)
Publisher : UIR PRESS

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.25299/itjrd.2025.21277

Abstract

Nigeria ranks among Africa's most linguistically diverse countries with over 500 indigenous languages, yet machine translation (MT) research remains severely limited for these low-resource languages. This systematic review examines the current state of MT research for Nigerian languages, identifies persistent challenges, and analyzes methodological trends. A systematic literature search was conducted across eleven databases including PubMed, Web of Science, and Scopus from January 2010 to August 2025. Search terms combined machine translation approaches with Nigerian language terms. Studies were screened using PRISMA guidelines requiring original research with evaluation metrics. From 51 papers, 25 duplicates were removed, 7 excluded for selection criteria, and 3 for lack of contribution, resulting in 16 studies. Only 11 Nigerian languages (2.2% of over 500 languages) were covered, creating a 97.8% research gap. Yoruba led with 4 studies, followed by Igala (3), Igbo and Nigerian Pidgin (2 each). Methods evolved from rule-based (4 studies, 2014 to 2021) through SMT (2 studies, 2016 to 2019) to NMT dominance (10 studies, 2018 to 2025). Idiomatic expression handling was the most persistent challenge (16.7%), followed by complex sentences, data scarcity, and domain specificity (each 9.5%). Nigerian MT research shows severe underrepresentation with persistent challenges in idiomatic expressions and data scarcity across all approaches. Neural method adoption reflects global trends but doesn't address resource constraints. Coordinated national approaches prioritizing parallel corpora creation and institutional partnerships are needed to prevent digital divides and support language preservation.