Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control
Vol. 11, No. 3, August 2026 (Article in Progress)

Comparison of Word2Vec and GloVe performance in Bi-LSTM models for Indonesian news classification

Muhammad Faris Wafda (Universitas Trunojoyo Madura)
Husni (Universitas Trunojoyo Madura)
Ika Oktavia Suzanti (Unknown)
Firdaus Solihin (Universitas Trunojoyo Madura)
Mula'ab (Universitas Trunojoyo Madura)
Army Justitia (National Cheng Kung University)



Article Info

Publish Date
07 Jun 2026

Abstract

The explosion in the volume of textual data from digital news presents challenges in classifying content automatically and efficiently. For the task of classifying Indonesian-language news, this study aims to compare the performance of several word embeddings specifically Word2Vec using CBOW and Skip-Gram architectures and GloVe when applied to a Bidirectional Long Short-Term Memory (Bi-LSTM) model. This study uses a dataset consisting of 6,715 news articles from the Indonesian news portal that have undergone pre-processing, divided into five categories. The model was trained using 80% of the training data with K-Fold Cross Validation (K=5), while the remaining 20% of the data was used for testing. The experimental findings indicate that the Bi-LSTM model, when combined with CBOW embedding, yielded the best performance, achieving 95.16% accuracy and a 95.15% F1-Score. The Skip-Gram model followed with solid performance, achieving an accuracy of 93.30% and the fastest computation time. Conversely, the model that used pre-trained GloVe embedding delivered the poorest performance, achieving 88.98% accuracy. This result suggests that training embeddings on a specific domain is more effective at capturing local context. The conclusion of this study confirms that selecting a word embedding method specifically trained on local datasets is also an important step in achieving optimal accuracy in Indonesian news text classification.

Copyrights © 2026






Journal Info

Abbrev

kinetik

Publisher

Subject

Computer Science & IT Control & Systems Engineering Electrical & Electronics Engineering Energy Engineering

Description

Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control was published by Universitas Muhammadiyah Malang. journal is open access journal in the field of Informatics and Electrical Engineering. This journal is available for researchers who want to improve ...