The rapid adoption of e-government services in Indonesia has increased the importance of understanding public sentiment toward digital platforms. This study presents a comparative analysis of five models—IndoBERT, mBERT, XLM-R, CNN, and BiLSTM—for sentiment classification on user reviews of NEWSAKPOLE, a public service application for vehicle tax and licensing. A custom dataset of 11,000+ reviews was scraped from the Google Play Store and labeled using a hybrid rating-based and manual validation approach. Each model was evaluated using accuracy, precision, recall, and F1-score. IndoBERT achieved the highest performance with an F1-score of 0.882, outperforming multilingual and classical deep learning models. Confusion matrix analysis showed that transformer-based models were more effective in detecting neutral and mixed sentiments, while CNN and BiLSTM struggled with misclassification. The results highlight IndoBERT's robustness in low-resource sentiment analysis and its potential to enhance public service monitoring and policy feedback mechanisms in Indonesian digital governance.
Copyrights © 2025