Jurnal: International Journal of Engineering and Computer Science Applications (IJECSA)
Vol 3 No 2 (2024): September 2024

Thesis Topic Modeling Study: Latent Dirichlet Allocation (LDA) and Machine Learning Approach

Hairani, Hairani (Unknown)
Janhasmadja, Mengas (Unknown)
Tholib, Abu (Unknown)
Ximenes Guterres, Juvinal (Unknown)
Ariyanto, Yuri (Unknown)



Article Info

Publish Date
03 Sep 2024

Abstract

The thesis reports housed in the campus repository have yet to be analyzed to reveal valuable knowledge patterns. Analyzing trends in thesis research topics can facilitate the selection of research topics, aid in mapping research areas, and identify underexplored topics.Therefore, this research aims to model and classify thesis topics using Latent Dirichlet Allocation (LDA) and the Naïve Bayes and Support Vector Machine (SVM) methods. This study employs the LDA method for thesis topic modeling, while SVM and Naïve Bayes are used for classifying these topics. The research results show that LDA successfully modeled five of the most popular thesis topics, namely two related to computer networks, two on software engineering, and one on multimedia. For thesis topic classification, the SVM method demonstrated higher accuracy than Naïve Bayes, reaching 92.80% after the data was balanced using Synthetic Minority Oversampling Technique (SMOTE). The implication of this study is that the topic modeling approach using LDA is able to identify dominant thesis topics. In addition, the SVM classification results obtained better accuracy than Naïve Bayes in the thesis topic classification task.

Copyrights © 2024






Journal Info

Abbrev

IJECSA

Publisher

Subject

Computer Science & IT

Description

Description of Journal : The International Journal of Engineering and Computer Science Applications (IJECSA) is a scientific journal that was born as a forum to facilitate scientists, especially in the field of computer science, to publish their research papers. The 12th of the 12th month of 2021 is ...