Garuda - Garba Rujukan Digital

Indonesian Journal on Computing (Indo-JC)

Vol. 10 No. 2 (2026): February, 2026

Fikri Rahmanda Noor (Telkom University)
Rifki Wijaya (Telkom University)
Ade Romadhony (Telkom University)

Publish Date
10 Feb 2026

This study presents the implementation of IndoRoBERTa, a pre-trained Indonesian language model, to improve the contextual clarity of homograph words in Text-to-Speech (TTS) systems, particularly for virtual chatbot applications addressing early marriage education in Lombok. The proposed system integrates IndoRoBERTa into the TTS pipeline to classify the context of homographs prior to grapheme-to-phoneme (G2P) conversion, ensuring accurate pronunciation based on meaning. The research was conducted in two fine-tuning phases: the first utilized 500 manually labeled conversational samples, achieving 96% test accuracy, while the second expanded the dataset with 2,000 auto-labeled samples and yielded 88% accuracy. Evaluation metrics including precision, recall, and F1-score demonstrated the model’s effectiveness across 20 homograph categories. Despite strong results, the study acknowledges limitations in data authenticity and challenges in underrepresented classes. Future work is recommended to incorporate real-world dialogue data and enhance the system’s generalization in more complex linguistic settings. This research contributes to the advancement of Indonesian NLP in TTS systems, particularly in socially impactful educational contexts.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Indonesian Journal on Computing (Indo-JC)

Website

Abbrev

indojc

Publisher

Universitas Telkom

Subject

Computer Science & IT Control & Systems Engineering Education Engineering

Description

Indonesian Journal on Computing (Indo-JC) is an open access scientific journal intended to bring together researchers and practitioners dealing with the general field of computing. Indo-JC is published by School of Computing, Telkom University (Indonesia). The journal coverage includes, but is not ...

Article Info

Abstract

Implementation of IndoRoBERTa to Improve the Clarity of the Context of Homograph Words in the Text-to-Speech System for Education Chatbot Early Marriage in Lombok

Article Info

Abstract