Suryanti, Rahmatika
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Perbandingan Metode TF-IDF dan Bag of Words dalam Analisis Sentimen Diet Kopi Americano di Media Sosial Twitter Menggunakan Naïve Bayes Suryanti, Rahmatika; Prasetyaningrum, Putri
Building of Informatics, Technology and Science (BITS) Vol 7 No 1 (2025): June (2025)
Publisher : Forum Kerjasama Pendidikan Tinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/bits.v7i1.7244

Abstract

The popularity of diet coffee, particularly the Americano variant, has risen alongside the growing trend of healthy lifestyles in society. This phenomenon has led to various public opinions circulating on social media, which need to be analyzed to better understand consumer perceptions. This study compares two commonly used text feature representation methods, Term Frequency-Inverse Document Frequency (TF-IDF) and Bag of Words (BoW), in sentiment analysis using the Naïve Bayes algorithm. Using relevant keywords, data were collected from Twitter and underwent preprocessing stages including case folding, cleansing, tokenizing, stopword removal, and stemming. Sentiment labeling was conducted manually based on keyword indicators, and the data were classified into positive, negative, and neutral categories. The evaluation results show that the TF-IDF model achieved an accuracy of 85%, outperforming BoW which obtained 64%. This performance gap indicates that the choice of feature representation method plays a crucial role in the success of sentiment classification. This research is expected to serve as a reference for optimizing text representation techniques to analyze public opinion on social media, particularly concerning diet products and low-calorie beverages.