Jurnal Manajemen Informatika
Vol 16 No 1 (2026): Jurnal Manajemen Informatika (JAMIKA)

Komparasi TF-IDF dan BoW pada Analisis Sentimen Shopee-Tokopedia

Salsabila, Jihan (Unknown)
Meida, Silvia (Unknown)
Shifani, Efelien Anindya (Unknown)
Afifah, Hana Mar’atul (Unknown)
Hidayat, Hidayat (Unknown)



Article Info

Publish Date
30 Apr 2026

Abstract

The rapid growth of e-commerce in Indonesia has led to an increase in user interactions in the form of reviews and opinions on services and products. These textual data contain valuable information that can be processed through sentiment analysis to better understand user perceptions. This study aims to compare the effectiveness of Term Frequency–Inverse Document Frequency (TF-IDF) and Bag of Words (BoW) feature extraction methods in classifying user sentiments, as well as to evaluate the performance of Support Vector Machine (SVM) and Random Forest (RF) algorithms on Shopee and Tokopedia platforms. A total of 5,000 user reviews were analyzed through text preprocessing, lexicon-based sentiment labeling, application of TF-IDF and BoW feature extraction methods, model training using SVM and RF algorithms, and performance evaluation using accuracy, precision, recall, and F1-score metrics. The experimental results show the combination of BoW and SVM achieved the highest accuracy of 90% on Shopee reviews, making it the most optimal configuration in this study. Additionally, in Tokopedia reviews, the same configuration (BoW and SVM) also produced a strong accuracy of 88%. In general, the SVM algorithm showed more stable performance than RF, while the BoW method proved to be more effective (measured at up to 90% accuracy) in representing this Indonesian-language e-commerce review data. These findings contribute to the development of more accurate sentiment analysis systems in the local e-commerce domain.

Copyrights © 2026






Journal Info

Abbrev

jamika

Publisher

Subject

Computer Science & IT

Description

Jurnal Manajemen Informatika, merupakan kumpulan dari jurnal, artikel, gagasan, ide, konsep, teori maupun hasil penelitian dari berbagai bidang yang berkaitan dengan teknologi informasi yang merupakan karya dari para staf pengajar di lingkungan Universitas Komputer Indonesia dan Perguruan Tinggi ...