Jurnal Ilmiah Teknik Elektro Komputer dan Informatika (JITEKI)
Vol. 10 No. 2 (2024): June

Word Embedding Feature for Improvement Machine Learning Performance in Sentiment Analysis Disney Plus Hotstar Comments

Jasmir, Jasmir (Unknown)
Nurhadi, Nurhadi (Unknown)
Rohaini, Eni (Unknown)
Pahlevi B, M Riza (Unknown)
Pardamean Simanjuntak, Daniel Sintong (Unknown)



Article Info

Publish Date
28 Jun 2024

Abstract

In this research we apply several machine learning methods and word embedding features to process social media data, specifically comments on the Disney Plus Hotstar application. The word embedding features used include Word2Vec, GloVe, and FastText. Our aim is to evaluate the impact of these features on the classification performance of machine learning methods such as Naive Bayes (NB), K-Nearest Neighbor (KNN), and Random Forest (RF). NB is very simple and efficient and very sensitive to feature selection. Meanwhile, KNN is known for its weaknesses such as biased k values, overly complex computations, memory limitations, and ignoring irrelevant attributes. Then RF has a weakness, namely that the evaluation value can change significantly with just a slight change in the data. Feature selection in text classification is crucial for enhancing scalability, efficiency, and accuracy. Our testing results indicate that KNN achieved the highest accuracy both before and after feature selection. The FastText feature led to the highest performance for KNN, yielding balanced accuracy, precision, recall, and F1-score values.

Copyrights © 2024






Journal Info

Abbrev

JITEKI

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering

Description

JITEKI (Jurnal Ilmiah Teknik Elektro Komputer dan Informatika) is a peer-reviewed, scientific journal published by Universitas Ahmad Dahlan (UAD) in collaboration with Institute of Advanced Engineering and Science (IAES). The aim of this journal scope is 1) Control and Automation, 2) Electrical ...