INOVTEK Polbeng - Seri Informatika
Vol. 10 No. 3 (2025): November

Comparison of SVM and Naive Bayes Algorithms in Sentiment Analysis of User Reviews on Bukalapak

Alghifari, M Yasir (Unknown)
Sanjaya, M. Rudi (Unknown)
Dwi Rosa Indah (Unknown)
Ruskan, Endang Lestari (Unknown)



Article Info

Publish Date
15 Nov 2025

Abstract

Indonesia’s rapid e-commerce growth has produced a vast volume of user reviews, yet their use for insight extraction remains limited—particularly for the Bukalapak platform. This study compares the performance of Naïve Bayes and Support Vector Machine for sentiment classification on 10,000 Bukalapak reviews. The workflow includes text preprocessing (cleaning, case folding, tokenization, stopword removal, and stemming) and feature extraction using Term Frequency–Inverse Document Frequency (TF-IDF; max_features = 10,000). Evaluation employs 10-fold cross-validation with accuracy, precision, recall, and F1-score, complemented by a paired t-test for significance. Results show SVM outperforming NB (accuracy 84.48% vs. 83.96%; F1 0.8253 vs. 0.8205) with better consistency (standard deviation ±1.08% vs. ±1.24%). The t-test confirms a significant difference (p = 0.019), with SVM’s advantage most evident for the negative class (precision 0.80 vs. 0.78). Both models underperform on the neutral class due to severe class imbalance. These findings provide empirical evidence for algorithm selection in Indonesian e-commerce sentiment analysis and open avenues for future research using deep learning and class-imbalance handling techniques.

Copyrights © 2025






Journal Info

Abbrev

ISI

Publisher

Subject

Computer Science & IT

Description

The Journal of Innovation and Technology (INOVTEK Polbeng—Seri Informatika) is a distinguished publication hosted by the State Polytechnic of Bengkalis. Dedicated to advancing the field of informatics, this scientific research journal serves as a vital platform for academics, researchers, and ...