bit-Tech
Vol. 8 No. 2 (2025): bit-Tech

Automatic Classification of Public Complaints Using Naive Bayes

Rico Andrean Hardiansyah (Universitas Buana Perjuangan Karawang)
Jamaludin Indra (Universitas Buana Perjuangan Karawang)
Dwi Sulistya Kusumaningrum (Universitas Buana Perjuangan Karawang)
Tohirin Al Mudzakir (Universitas Buana Perjuangan Karawang)



Article Info

Publish Date
10 Dec 2025

Abstract

Public complaint services are essential for improving government service quality by providing a direct channel for citizens to report issues. In Karawang Regency, the Tanggap Karawang (TANGKAR) platform serves this function; however, the manual classification of complaints causes delays and potential misrouting, especially due to the highly imbalanced distribution of complaint categories. This study develops an automatic classification model for public complaints in eight categories economy, education, health, social, infrastructure, security, environment, and transportation by integrating Term Frequency–Inverse Document Frequency (TF–IDF), Multinomial Naive Bayes, and Synthetic Minority Oversampling Technique (SMOTE). This integration addresses domain-specific class imbalance challenges, combining the computational efficiency of Naive Bayes, the feature representation strength of TF–IDF, and the improved minority class recognition from SMOTE. A dataset of 800 complaint records from TANGKAR underwent preprocessing, including cleaning, case folding, normalization, tokenizing, stemming, and stopword removal. TF–IDF with unigram and bigram features was used for feature extraction, followed by classification under two scenarios: original unbalanced data and balanced data via SMOTE. Evaluation metrics included accuracy, precision, recall, F1-score, and confusion matrix. The model achieved 85.09% accuracy without SMOTE and 83.40% with SMOTE, with notable improvement in detecting minority categories after balancing. Although overall accuracy slightly decreased, SMOTE enhanced equitable prediction across all categories. This approach advances current public complaint classification methods by adapting to the linguistic diversity and uneven category distribution in actual e-government data, supporting faster and more accurate decision-making in public complaint management systems.

Copyrights © 2025






Journal Info

Abbrev

bt

Publisher

Subject

Computer Science & IT

Description

The bit-Tech journal was developed with the aim of accommodating the scientific work of Lecturers and Students, both the results of scientific papers and research in the form of literature study results. It is hoped that this journal will increase the knowledge and exchange of scientific ...