Journal of applied statistics and data mining
Vol. 6 No. 1 (2025): Journal Applied Statistics and Data Mining

Implementation of Text Mining for Grouping Thesis Titles Using the K-Harmonic Means Method

Sukowati (Unknown)
Lisna Zahrotun (Unknown)



Article Info

Publish Date
30 Jun 2025

Abstract

Background: A thesis is a course required for completing a Bachelor's degree (S-1). Thesis documents are typically collected in a library file. This data will be more useful if subjected to in-depth analysis. One such analysis is identifying trends in student thesis topics using clustering techniques. Objective: To analyze and group thesis titles from the years 2012 to 2017 using text mining techniques, with the aim of identifying topic trends through clustering with the K-Harmonic Means method. Methods: The method used in this study is K-Harmonic Means clustering. The stages carried out include tokenization, filtering, stemming, TF-IDF, grouping using K-Harmonic Means, and testing using purity. Results: The result of this research is an application that can process thesis titles into trend groups of thesis titles. From the test conducted using purity obtained a value of 0.63 which means the K-Harmonic method is quite good in grouping. The results of the analysis show that the topic of Multimedia and soft computing became a trend for 3 years, namely 2012, 2013 and 2014, while the topic of mobile applications and web programming became a trend in 2013 and 2015. Conclusion: The results of grouping using the K-Harmonic Means method show a sufficient accuracy value of 0.63. This proves that the K-Harmonic Means method is quite suitable for carrying out the process of grouping text-based data.

Copyrights © 2025






Journal Info

Abbrev

jasdm

Publisher

Subject

Computer Science & IT Decision Sciences, Operations Research & Management Economics, Econometrics & Finance Social Sciences

Description

Journal of applied statistics and data mining provide open access, which in principle makes research open and freely available to the public so that it becomes a means of global knowledge exchange. Published twice a year, in June and December. This journal publishes scientific articles as research ...