Building of Informatics, Technology and Science
Vol 6 No 2 (2024): September 2024

Enhancing Sentiment Analysis Effectiveness with LSTM Variants, and Stratified K-Fold on Imbalanced Dataset

Andriyanto, Rifki (Unknown)
Kusrini, Kusrini (Unknown)



Article Info

Publish Date
11 Sep 2024

Abstract

Sentiment analysis on hotel reviews often faces the challenge of class imbalance, where positive reviews significantly outnumber negative or neutral ones. This study aims to improve the effectiveness of sentiment analysis models on imbalanced hotel reviews by examining combinations of word embedding methods (FastText, Word2Vec, Doc2Vec) and model architectures (LSTM, BiLSTM, BiLSTM-Attention). Class imbalance is addressed using SMOTE, and model evaluation is conducted using Stratified K Fold cross-validation. Results show that Doc2Vec consistently outperforms FastText and Word2Vec as a word embedding method, especially when combined with the BiLSTM-Attention architecture. The use of SMOTE and Stratified K Fold also proves effective in improving model performance on imbalanced datasets. This study concludes that the selection of appropriate word embedding methods and model architectures, along with the implementation of class imbalance techniques, is crucial in developing effective and robust sentiment analysis models for hotel reviews.

Copyrights © 2024






Journal Info

Abbrev

bits

Publisher

Subject

Computer Science & IT

Description

Building of Informatics, Technology and Science (BITS) is an open access media in publishing scientific articles that contain the results of research in information technology and computers. Paper that enters this journal will be checked for plagiarism and peer-rewiew first to maintain its quality. ...