Jurnal Teknik Informatika C.I.T. Medicom
Vol 16 No 3 (2024): July: Intelligent Decision Support System (IDSS)

Topic modeling using LDA and performance evaluation of classification algorithm: k-NN, SVM, NBC, and DT

Singgalen, Yerik Afrianto (Unknown)



Article Info

Publish Date
30 Jul 2024

Abstract

This research investigates the integration of Latent Dirichlet Allocation (LDA) for topic modeling with the performance evaluation of various classification algorithms—specifically, k-nearest Neighbors (k-NN), Support Vector Machines (SVM), Naive Bayes Classifier (NBC), and Decision Trees (DT)—within the Digital Content Reviews and Analysis Framework. The framework systematically processes and analyzes digital content, including data cleaning, extraction, evaluation, and visualization techniques, to enhance machine learning models' interpretability and predictive accuracy. The study demonstrates that combining LDA with these classification algorithms significantly improves data interpretation and model performance, particularly in handling large-scale textual datasets. Notably, the Decision Tree algorithm achieved a 98.86% accuracy post-SMOTE. At the same time, the Support Vector Machine reached a near-perfect AUC of 1.000, highlighting the efficacy of these methods in managing imbalanced datasets. The findings provide valuable insights for optimizing model selection and developing more robust and adaptive machine-learning models across various applications. This research contributes to advancing the field of artificial intelligence by proposing a comprehensive framework that effectively addresses complex data-driven challenges, encouraging further exploration of more flexible and scalable models to accommodate evolving data environments.

Copyrights © 2024






Journal Info

Abbrev

JTI

Publisher

Subject

Computer Science & IT

Description

The Jurnal Teknik Informatika C.I.T a scientific journal of Decision support sistem , expert system and artificial inteligens which includes scholarly writings on pure research and applied research in the field of information systems and information technology as well as a review-general review of ...