Compiler
Vol 14, No 1 (2025): May

Enhancing Sentiment and Emotion Classification with LSTM-Based Semi-Supervised Learning

Husaini, Rochmat (Unknown)
Cahyana, Nur Heri (Unknown)
Wisnalmawati, Wisnalmawati (Unknown)
Mardiana, Tri (Unknown)
Fauziah, Yuli (Unknown)



Article Info

Publish Date
13 Jun 2025

Abstract

The evolution of sentiment analysis has increasingly relied on semi-supervised learning (SSL) models, particularly due to their efficiency in utilizing large amounts of unlabeled data. This study employed four Indonesian datasets—Ridife (sentiment classification), Emotion Indonlu (emotion classification), Sentiment Indonlu (sentiment classification), and Hate Speech (offensive content detection). The LSTM model was trained using labeled data and used to generate pseudo-labels for unlabeled data across three iterations. The performance of the pseudo-labels was evaluated using Random Forest, Logistic Regression, and Support Vector Machine (SVM). The LSTM model demonstrated varying effectiveness across different datasets. For the Sentiment Ridife dataset, LSTM achieved an accuracy of 70.23%, slightly lower than Random Forest but higher than Logistic Regression and SVM. In the Sentiment IndoNLU dataset, LSTM's accuracy was 86.12%, showing strong performance but slightly below Random Forest and Logistic Regression. The Emotion IndoNLU dataset revealed similar performance across models, while the Hate Speech dataset saw LSTM perform well with an accuracy of 86.49%. The results indicate that while LSTM-based SSL can effectively generate pseudo-labels and enhance model performance, its performance varies depending on the dataset and task. This study underscores the need for further research into optimizing pseudo-labeling techniques and exploring advanced NLP models to improve sentiment and emotion analysis in diverse languages.

Copyrights © 2025






Journal Info

Abbrev

compiler

Publisher

Subject

Computer Science & IT

Description

Jurnal "COMPILER" dengan ISSN Cetak : 2252-3839 dan ISSN On Line 2549-2403 adalah jurnal yang diterbitkan oleh Departement Informatika Sekolah Tinggi Teknologi Adisutjipto Yogyakarta. Jurnal ini memuat artikel yang merupakan hasil-hasil penelitian dengan bidang kajian Struktur Diskrit, Ilmu ...