Jurnal Nasional Pendidikan Teknik Informatika (JANAPATI)
Vol. 13 No. 2 (2024)

Improving Sentiment Analysis and Topic Extraction in Indonesian Travel App Reviews Through BERT Fine-Tuning

Irmawan, Oky Ade (Unknown)
Budi, Indra (Unknown)
Santoso, Aris Budi (Unknown)
Putra, Prabu Kresna (Unknown)



Article Info

Publish Date
27 Jul 2024

Abstract

Abstract The increasing use of the internet in Indonesia has an influence on the presence of Online Travel Agents (OTA). Through the OTA application, users can book transportation and accommodation tickets more easily and quickly. The increasingly rigorous competition is causing companies like PT XYZ to be able to provide solutions to the needs and problems of their customers in the field of online ticket booking. Many customers submit reviews of the use of the PT XYZ application through Playstore and Appstore, and it needs a technique to group thousands of reviews and detect the topics discussed by customers automatically. In this study, we classified reviews from Android and iOS applications using BERT that had been adjusted through fine-tuning with IndoBERT, as well as modeling topics using LDA to evaluate the coherence score of each sentiment. The result of the comparison of hyperparameter models for the most optimal classification is epoch 4 with a learning rate of 5e-5. The accuracy obtained is 0.91, with an f1-score of 0.74. In addition, testing was carried out to compare BERT with other traditional machine learning. The best performing algorithm was Logistic Regression using TF-IDF word embeddings, achieving an accuracy of 0.890 and an F1-score of 0.865. Therefore, it can be inferred that the accuracy achieved by the fine-tuned classification model of IndoBert is sufficiently high for application in the PT XYZ review classification. Using a coherence score, we found 29 positive topics, 6 neutral topics, and 3 negative topics that were considered the most optimal. This finding can be used as evaluation material for PT XYZ to provide the best service to customers.

Copyrights © 2024






Journal Info

Abbrev

janapati

Publisher

Subject

Computer Science & IT Education Engineering

Description

Jurnal Nasional Pendidikan Teknik Informatika (JANAPATI) is a collection of scientific articles in the field of Informatics / ICT Education widely and the field of Information Technology, published and managed by Jurusan Pendidikan Teknik Informatika, Fakultas Teknik dan Kejuruan, Universitas ...