J-KOMA : Jurnal Ilmu Komputer dan Aplikasi
Vol 8 No 1 (2025): J-KOMA : Jurnal Ilmu Komputer dan Aplikasi

Evaluation of TF-IDF Extraction Techniques in Sentiment Analysis of Indonesian-Language Marketplaces Using SVM, Logistic Regression, and Naive Bayes

Budi Lestari, Verra (Unknown)
Apriansyah Hutagalung, Carli (Unknown)



Article Info

Publish Date
22 Jun 2025

Abstract

This study evaluates the application of TF-IDF feature extraction in sentiment analysis of Indonesian-language marketplace product reviews using Logistic Regression, Naïve Bayes, and Support Vector Machine (SVM) algorithms. The dataset, sourced from Kaggle, comprises 831 reviews (385 positive, 446 negative), processed through preprocessing steps including text cleaning, tokenization, stopword removal, and stemming. The data was split into 80% training and 20% testing sets. Results show that Logistic Regression with TF-IDF achieved the highest performance, with 90.4% accuracy, 91.8% precision, 90.4% recall, and 90.9% F1-measure, outperforming Naïve Bayes (87.4% accuracy) and SVM (89.8% accuracy). Logistic Regression effectively captures linear relationships in TF-IDF features, while Naïve Bayes struggles with emotional context, and SVM requires complex parameterization. TF-IDF is efficient for explicit reviews but limited in handling complex semantic contexts like sarcasm. This study confirms that Logistic Regression combined with TF-IDF is the most effective approach for sentiment analysis of Indonesian marketplace reviews, with recommendations for future exploration of methods like word embedding.

Copyrights © 2025






Journal Info

Abbrev

jkoma

Publisher

Subject

Computer Science & IT

Description

J-KOMA is an open access journal, with core focus in two aspect: computer science general and information technology. All copyrights are retained by each respective author, but we hold publishing right. Currently, this journal has E-ISSN :2620-4827 published by LIPI which made it as a national ...