Claim Missing Document
Check
Articles

Found 2 Documents
Search
Journal : Indonesian Journal on Computing (Indo-JC)

Speech to Text Correction for Indonesian Early Marriage Counseling Chatbots Using IndoRoBERTa and Mistral-7B Firdhaus Dwi Sukma; Rifki Wijaya; Ade Romadhony
Indonesian Journal on Computing (Indo-JC) Vol. 10 No. 1 (2025): August, 2025
Publisher : Telkom University

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21108/indojc.v10i1.9708

Abstract

Early marriage among individuals of immature age continues to draw significant attention in Lombok. As of 2021, the prevalence rate stands at 16.59%, indicating that this social issue remains unresolved within the region's community dynamics. Limited access to counseling services particularly in rural areas poses a significant barrier to prevention efforts. This study introduces a virtual counseling chatbot designed to detect and correct Indonesian language text errors during user interactions. The system integrates IndoRoBERTa for error detection and Mistral-7B-Instruct to refine speech to text transcriptions. IndoRoBERTa was trained on synthetic datasets to classify user input as accurate or incorrect, while Mistral-7B-Instruct generates context aware corrections. Achieving an accuracy rate of 98.90%, IndoRoBERTa outperformed benchmark models such as BERT and RNN. The proposed chatbot offers an adaptive and accessible digital solution, especially for communities with limited access to conventional counseling services. This approach highlights the potential of AI-driven tools to support early intervention strategies and reduce the incidence of child marriage in underserved regions.
Implementation of IndoRoBERTa to Improve the Clarity of the Context of Homograph Words in the Text-to-Speech System for Education Chatbot Early Marriage in Lombok Fikri Rahmanda Noor; Rifki Wijaya; Ade Romadhony
Indonesian Journal on Computing (Indo-JC) Vol. 10 No. 2 (2026): February, 2026
Publisher : Telkom University

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21108/indojc.v10i2.9709

Abstract

This study presents the implementation of IndoRoBERTa, a pre-trained Indonesian language model, to improve the contextual clarity of homograph words in Text-to-Speech (TTS) systems, particularly for virtual chatbot applications addressing early marriage education in Lombok. The proposed system integrates IndoRoBERTa into the TTS pipeline to classify the context of homographs prior to grapheme-to-phoneme (G2P) conversion, ensuring accurate pronunciation based on meaning. The research was conducted in two fine-tuning phases: the first utilized 500 manually labeled conversational samples, achieving 96% test accuracy, while the second expanded the dataset with 2,000 auto-labeled samples and yielded 88% accuracy. Evaluation metrics including precision, recall, and F1-score demonstrated the model’s effectiveness across 20 homograph categories. Despite strong results, the study acknowledges limitations in data authenticity and challenges in underrepresented classes. Future work is recommended to incorporate real-world dialogue data and enhance the system’s generalization in more complex linguistic settings. This research contributes to the advancement of Indonesian NLP in TTS systems, particularly in socially impactful educational contexts.
Co-Authors A, Subaveerapandiyan Aditia Rafif Khoerulloh Adiwijaya Affan Fattahila, Ananda Agung Toto Wibowo Al Aufar, Arya Prima Al Faraby, Said Alfian Akbar Gozali Ali Ridho Fauzi Rahman Ananda Wulandari Anditya Arifianto Anisa Herdiani Anisah Firli Ardiansyah, Yusfi Arya Prima Al Aufar Bambang Pudjoatmodjo Bambang Pudjotatmodjo Barawi, Mohamad Hardyman Bedy Purnama Bhudi Jati Prio Utomo Bimmo Satryo Wicaksono Brady Rikumahu Dadan Rahadian Dade Nurjanah Dana Kusumo Dana S Kusumo Dana S Kusumo Dodi Wisaksono Sudiharto Donni Richasdy Ema Rachmawati Ema Rachmawati Fat'hah Noor Prawira Fat’hah Noor Prawira Fat’hah Noor Prawira Fazainsyah Azka Wicaksono Fazmah Arif Yulianto Fikri Rahmanda Noor Firdhaus Dwi Sukma Frima, Mariana Gheartha, I Gusti Bagus Yogiswara H Hasmawati Hamdy Nur Saidy Haryo Adi Nugroho Haryo Adi Nugroho Haryo Nugroho Hasmawat, Hasmawat Hasmawati Hasmawati Hasmawati Hasmawati Hasmawati Herman, Fizio Ramadhan Imelda Atastina Januarahman, Faishal Kemas Rahmat S.W Kemas Rahmat Saleh Wiharja Lintani Afina Hajar Raudhoti Luh Putri Ayu Ningsih Mahmud Dwi Sulistiyo Moch Arif Bijaksana Muhammad Arzaki Muhammad Aziz Pratama Muhammad Farrel Muhammad Iqbal Muhammad Iqbal Muhammad Taufik Wahdiat Muhammad Zaky Aonillah Nadine Azhalia Purbani Ningsih, Shabrina Retno Nugraha, Azhar Baihaqi Nur, Farhan Ahmadi Javier othman, mohd kamal Pramana, Rifki Adi Prawita, Fat’hah Noor Putu Harry Gunawan Ramanti Dharayani Rhesa Hermawan Ridea Valentini Peristiwari Siwabessy Rifki Wijaya Rimba Whidiana Ciptasari Riska Junia Wulandari Rita Rismala Said Faraby Selly Meliana Setiawan, Muhammad Rizki Ramadhan Siti Saadah Tresna Ariesta, Bayu Untari Novia Wisesty Wijaya, Kurniadi Ahmad