INOVTEK Polbeng - Seri Informatika
Vol. 10 No. 1 (2025): Maret

Comparison of Naïve Bayes, Random Forest, and Logistic Regression Algorithms for Sentiment Analysis Online Gambling

Dwi Nanda Agustia (Universitas Teknokrat Indonesia)
Ryan Randy Suryono (Universitas Teknokrat Indonesia)



Article Info

Publish Date
21 Mar 2025

Abstract

This study aims to compare the performance of Naïve Bayes, Random Forest, and Logistic Regression algorithms for sentiment analysis on the topic of online gambling. The dataset consisted of 4592 entries after preprocessing and applying the SMOTE technique to address class imbalance. The evaluation results show that Random Forest achieved the best performance with an accuracy of 78%, followed by Naïve Bayes and Logistic Regression, both achieving 77%. Random Forest excelled in classifying positive and negative sentiments, while Naïve Bayes demonstrated a significant improvement in recall for neutral sentiment, increasing from 0.45 to 0.82 after the SMOTE application. Logistic Regression showed less optimal performance, particularly for neutral sentiment. This study provides essential guidance for selecting the best algorithms for sentiment analysis in specific domains such as online gambling and highlights the importance of SMOTE in handling imbalanced datasets. The findings of this study can be used by practitioners and policymakers to make more informed decisions in regulating online gambling.

Copyrights © 2025






Journal Info

Abbrev

ISI

Publisher

Subject

Computer Science & IT

Description

The Journal of Innovation and Technology (INOVTEK Polbeng—Seri Informatika) is a distinguished publication hosted by the State Polytechnic of Bengkalis. Dedicated to advancing the field of informatics, this scientific research journal serves as a vital platform for academics, researchers, and ...