Building of Informatics, Technology and Science
Vol 6 No 4 (2025): March 2025

Perbandingan Algoritma SVM, Random Forest, KNN untuk Analisis Sentimen Terhadap Overclaim Skincare pada Media Sosial X

Rahmawati, Ira Tri (Unknown)
Alita, Debby (Unknown)



Article Info

Publish Date
01 Mar 2025

Abstract

The cosmetic industry in Indonesia, especially skincare products, is growing rapidly along with changes in people's lifestyles and technological advances. One of the main issues that arise is overclaiming, which can harm consumers and damage the company's reputation. This study aims to compare the performance of three algorithms in sentiment analysis of skincare overclaims on X social media. The evaluated algorithms include Support Vector Machine (SVM), Random Forest, and K-Nearest Neighbors (KNN). The research dataset consists of 7,774 tweets collected between October 1 and November 30, 2024, with 5,559 tweets after the preprocessing stage, consisting of 4,281 negative sentiment tweets and 1,275 positive sentiment tweets. Data imbalance was addressed using the Synthetic Minority Over-sampling Technique (SMOTE), with 80% data split for training and 20% for testing. The results showed that before the application of SMOTE, the Random Forest algorithm had the highest accuracy of 95%, followed by Support Vector Machine at 91% and K-Nearest Neighbors at 80%. After the application of SMOTE, the accuracy increased significantly, with Random Forest reaching 98%, Support Vector Machine 97%, and K-Nearest Neighbors 84%. Random Forest proved to be the best algorithm, with the highest performance before and after SMOTE implementation, and was effective in handling both sentiment classes. This research provides insights for the skincare industry and regulators to detect and address product over-claiming issues through machine learning-based approaches.

Copyrights © 2025






Journal Info

Abbrev

bits

Publisher

Subject

Computer Science & IT

Description

Building of Informatics, Technology and Science (BITS) is an open access media in publishing scientific articles that contain the results of research in information technology and computers. Paper that enters this journal will be checked for plagiarism and peer-rewiew first to maintain its quality. ...