JSAI (Journal Scientific and Applied Informatics)
Vol 8 No 3 (2025): November

Pemodelan Topik Berdasarkan Dokumen Penelitian Bidang Ilmu Komputer Menggunakan Text Mining

Bakhtiar Bakhtiar (Universitas Sjakhyakirti, Palembang, Indonesia)
Azhar Andika Putra (Universitas Sjakhyakirti, Palembang, Indonesia)
Muhammad Al Hapiz (Universitas Sjakhyakirti, Palembang, Indonesia)
Firga Abel Astiawan (Universitas Sjakhyakirti, Palembang, Indonesia)



Article Info

Publish Date
11 Nov 2025

Abstract

This study aimed to develop a document clustering model using a combination of the IndoBERT model and the K-Means algorithm to group research abstracts in the field of computer science and technology. The data used consisted of 1000 research abstracts, divided into two parts: 80% for training data (800 abstracts) and 20% for testing data (200 abstracts). The IndoBERT model was used to represent the abstracts as embedding vectors, which were then processed with the K-Means algorithm to form 10 topic clusters, including artificial intelligence, computer systems and networks, programming, cybersecurity, and others. The training experiment used the training data to generate clusters and centroids for mapping new documents into the appropriate clusters. Evaluation was carried out using several metrics, including accuracy, cluster homogeneity, Davies-Bouldin Index, and Silhouette Score. The testing results showed that the developed model achieved an accuracy of 85%, indicating good performance in clustering the test data. The cluster homogeneity value of 0.90 indicated that documents that should belong to the same cluster were grouped together effectively. The Davies-Bouldin Index value was 0.34, while the Silhouette Score was 0.76.

Copyrights © 2025






Journal Info

Abbrev

JSAI

Publisher

Subject

Computer Science & IT

Description

Jurnal terbitan dibawah fakultas teknik universitas muhammadiyah bengkulu. Pada jurnal ini akan membahas tema tentag Mobile, Animasi, Computer Vision, dan Networking yang merupakan jurnal berbasis science pada informatika, beserta penelitian yang berkaitan dengan implementasi metode dan atau ...