Garuda - Garba Rujukan Digital

Building of Informatics, Technology and Science

Vol 6 No 2 (2024): September 2024

Andriyanto, Rifki (Unknown)
Kusrini, Kusrini (Unknown)

Publish Date
11 Sep 2024

Sentiment analysis on hotel reviews often faces the challenge of class imbalance, where positive reviews significantly outnumber negative or neutral ones. This study aims to improve the effectiveness of sentiment analysis models on imbalanced hotel reviews by examining combinations of word embedding methods (FastText, Word2Vec, Doc2Vec) and model architectures (LSTM, BiLSTM, BiLSTM-Attention). Class imbalance is addressed using SMOTE, and model evaluation is conducted using Stratified K Fold cross-validation. Results show that Doc2Vec consistently outperforms FastText and Word2Vec as a word embedding method, especially when combined with the BiLSTM-Attention architecture. The use of SMOTE and Stratified K Fold also proves effective in improving model performance on imbalanced datasets. This study concludes that the selection of appropriate word embedding methods and model architectures, along with the implementation of class imbalance techniques, is crucial in developing effective and robust sentiment analysis models for hotel reviews.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Building of Informatics, Technology and Science

Website

Abbrev

bits

Publisher

Forum Kerjasama Pendidikan TInggi

Subject

Computer Science & IT

Description

Building of Informatics, Technology and Science (BITS) is an open access media in publishing scientific articles that contain the results of research in information technology and computers. Paper that enters this journal will be checked for plagiarism and peer-rewiew first to maintain its quality. ...

Article Info

Abstract

Enhancing Sentiment Analysis Effectiveness with LSTM Variants, and Stratified K-Fold on Imbalanced Dataset

Article Info

Abstract