Rahmadi, Novianto
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

LexIndoLLM: Large Language Model untuk Konsultasi Regulasi Daerah di Indonesia: LexIndoLLM: A Large Language Model for Consulting Local Regulations in Indonesia Rahmadi, Novianto; Setyanto, Arief
Jurnal Buana Informatika Vol. 17 No. 1 (2026): Jurnal Buana Informatika, Volume 17, Nomor 1, April 2026
Publisher : Universitas Atma Jaya Yogyakarta

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24002/jbi.v17i1.14326

Abstract

Large Language Model (LLM) berpotensi meningkatkan akses terhadap layanan konsultasi regulasi daerah, tetapi model generik masih sering menghasilkan jawaban yang kurang akurat pada dokumen hukum Indonesia yang panjang, formal, dan kontekstual. Penelitian ini mengembangkan LexIndoLLM, model ringan berbasis Llama 3.2-1B, melalui fine-tuning bertahap pada 393 dokumen regulasi Kabupaten Kutai Kartanegara dan integrasi Retrieval-Augmented Generation (RAG) berbasis FAISS. Evaluasi dilakukan menggunakan RAGAS, perplexity, ROUGE-L, dan metrik efisiensi inferensi. Hasil menunjukkan bahwa pendekatan yang diusulkan meningkatkan kualitas jawaban, ditandai dengan penurunan perplexity dari 9,13 menjadi 1,74, peningkatan ROUGE-L dari 0,2058 menjadi 0,4429, serta nilai faithfulness 0,77 dan answer correctness 0,66. Waktu respons rata-rata di bawah 3,4 detik sehingga cocok untuk deployment lokal. Temuan ini menunjukkan bahwa model ringan yang dipadukan dengan retrieval layak digunakan untuk konsultasi regulasi daerah pada lingkungan komputasi terbatas.   Large Language Models (LLMs) have the potential to improve access to local regulatory consultation services, yet general-purpose models often produce inaccurate responses when handling Indonesian legal documents that are lengthy, formal, and highly contextual. This study develops LexIndoLLM, a lightweight model based on Llama 3.2-1B, through staged fine-tuning on 393 local regulatory documents from Kutai Kartanegara Regency and the integration of FAISS-based Retrieval-Augmented Generation (RAG). The model was evaluated using RAGAS, perplexity, ROUGE-L, and inference efficiency metrics. The results show that the proposed approach improves answer quality, as indicated by a reduction in perplexity from 9.13 to 1.74, an increase in ROUGE-L from 0.2058 to 0.4429, and faithfulness and answer correctness scores of 0.77 and 0.66, respectively. The system maintains an average response time under 3.4 seconds, suitable for local deployment. These findings indicate that a lightweight model combined with retrieval is feasible for local regulatory consultation in resource-constrained environments.