Indonesia has experienced a surge in the spread of political hoax news, posing a potential threat to democratic and social stability. This study aims to develop a model for detecting political hoax news in the Indonesian language using IndoBERT, a language model optimized for Indonesian text. The dataset was sourced from Kaggle and comprises 20,928 factual news articles and 2,251 hoax news articles from major Indonesian media outlets, including CNN, Kompas, Tempo, and Turnbackhoax. The imbalance between factual and hoax news articles was addressed through undersampling, resulting in 1,302 samples for each class. The research stages include data collection, preprocessing, IndoBERT model training, and model evaluation. Results indicate that fine-tuning IndoBERT can detect political hoax news with an accuracy of 94.1% and an ROC AUC of 0.991, demonstrating high performance in accuracy and generalization capability. This research is expected to contribute to minimizing the spread of political hoax news in Indonesia and enhance media literacy among the public.
Copyrights © 2025