Indonesian Journal of Electrical Engineering and Computer Science
Vol 36, No 2: November 2024

Enhancing diagonal comprehension with advanced topic modeling technique: DIAG-LDA

Sifi, Fatima-Zahrae (Unknown)
Sabbar, Wafae (Unknown)
El Mzabi, Amal (Unknown)



Article Info

Publish Date
01 Nov 2024

Abstract

With the speed increase of reviews or other forms of text, natural language has the ability to convey large and complex amounts of information in relatively small communications. This capability is being leveraged by the machine-learning algorithm known as latent dirichlet allocation (LDA), which can be utilized to discover latent topics within documents. LDA can be also used to generate summaries or abstracts from a given set of documents. However, LDA can struggle to identify topics in short documents or in data with high levels of noise. This article will introduce a new method for topic modeling with LDA based on diagonal reading for sentences (DIAG-LDA). Primarily, the features are selected using the TF-IDF algorithm, and the highest relevant features are extracted using the confidence value. Besides, the classification step is executed utilizing the LDA classifier. Ultimately, we evaluate our model using the convolutional neural network algorithm. The experiment results show that DIAG-LDA performs well in identifying features from text data, achieving a 94.4%, and 89.5% in accuracy for the datasets on international economics and the political economy.

Copyrights © 2024