Budi Lestari, Verra
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Evaluation of TF-IDF Extraction Techniques in Sentiment Analysis of Indonesian-Language Marketplaces Using SVM, Logistic Regression, and Naive Bayes Budi Lestari, Verra; Apriansyah Hutagalung, Carli
J-KOMA : Jurnal Ilmu Komputer dan Aplikasi Vol 8 No 1 (2025): J-KOMA : Jurnal Ilmu Komputer dan Aplikasi
Publisher : Universitas Negeri Jakarta

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21009/j-koma.v8i1.05

Abstract

This study evaluates the application of TF-IDF feature extraction in sentiment analysis of Indonesian-language marketplace product reviews using Logistic Regression, Naïve Bayes, and Support Vector Machine (SVM) algorithms. The dataset, sourced from Kaggle, comprises 831 reviews (385 positive, 446 negative), processed through preprocessing steps including text cleaning, tokenization, stopword removal, and stemming. The data was split into 80% training and 20% testing sets. Results show that Logistic Regression with TF-IDF achieved the highest performance, with 90.4% accuracy, 91.8% precision, 90.4% recall, and 90.9% F1-measure, outperforming Naïve Bayes (87.4% accuracy) and SVM (89.8% accuracy). Logistic Regression effectively captures linear relationships in TF-IDF features, while Naïve Bayes struggles with emotional context, and SVM requires complex parameterization. TF-IDF is efficient for explicit reviews but limited in handling complex semantic contexts like sarcasm. This study confirms that Logistic Regression combined with TF-IDF is the most effective approach for sentiment analysis of Indonesian marketplace reviews, with recommendations for future exploration of methods like word embedding.