Building of Informatics, Technology and Science
Vol 7 No 3 (2025): December 2025

Customer Sentiment Analysis of E-Commerce Products Using the Naïve Bayes Method and Word Embedding

Harpad, Bartolomius (Unknown)
Azahari, Azahari (Unknown)
Salmon, Salmon (Unknown)



Article Info

Publish Date
31 Dec 2025

Abstract

This study discusses customer sentiment analysis toward e-commerce products using the Naïve Bayes method combined with Word Embedding techniques to enhance the semantic understanding of Indonesian-language customer reviews. The research background is based on the rapid growth of e-commerce, which has created a strong need to understand consumer opinions through online reviews. The main challenge in sentiment analysis lies in the complexity of natural language, such as the use of informal words, abbreviations, and diverse emotional expressions. This study utilizes 40,607 Tokopedia customer reviews across five product categories with three sentiment labels (positive, neutral, and negative). The research stages include data collection, text preprocessing (case folding, tokenization, stopword removal, stemming, and slang normalization), feature representation using Word2Vec and FastText, and classification using Multinomial Naïve Bayes. Experimental results show that the combination of Word2Vec and Naïve Bayes achieved an accuracy of 87.92%, while FastText and Naïve Bayes improved accuracy to 91.52%. The FastText-based model proved superior in handling morphological variations and non-standard spellings, making it more effective for Indonesian customer review texts. The WordCloud visualization reveals the dominance of positive words such as “sesuai” (appropriate), “barang” (item), and “cepat” (fast), indicating customer satisfaction regarding product conformity and service speed. The Confusion Matrix results indicate a bias toward the positive class due to data imbalance, where the model still struggles to recognize neutral and negative classes. Overall, this study demonstrates that integrating Word Embedding with Naïve Bayes enhances classification performance and provides richer semantic representations compared to traditional Bag of Words approaches. This approach has the potential to be applied in developing data-driven recommendation systems and marketing strategies within Indonesia’s e-commerce ecosystem.

Copyrights © 2025






Journal Info

Abbrev

bits

Publisher

Subject

Computer Science & IT

Description

Building of Informatics, Technology and Science (BITS) is an open access media in publishing scientific articles that contain the results of research in information technology and computers. Paper that enters this journal will be checked for plagiarism and peer-rewiew first to maintain its quality. ...