The authenticity of former President Joko Widodo's diploma has become a hot topic on the digital space, especially in the comments section of Kompas TV's YouTube channel. The wide diversity of opinions reflects a polarization of public opinion that is worth further analysis. Given the large volume of text data from public comments, manual analysis is ineffective; a technology-based approach is needed to systematically group opinions. Therefore, this study was conducted to analyze public opinion polarization using a machine learning approach. Two classification algorithms, Naive Bayes and Random Forest, were used to distinguish between pro and con public comments on the issue. Data were obtained through an automated collection process (web scraping), followed by text pre-processing and TF-IDF (Term Frequency–Inverse Document Frequency) word weighting. The test results showed that the Random Forest algorithm performed best, with an accuracy of 91%, while Naïve Bayes achieved only 74%. This shows that the Random Forest method is more effective than the Naïve Bayes approach in detecting unstructured text patterns. This study concludes that machine learning can be used effectively to identify trends in public opinion on social media and can serve as a basis for further research using word embedding and deep learning models.
Copyrights © 2026