Claim Missing Document
Check
Articles

Found 2 Documents
Search
Journal : Jurnal Riset Informatika

COMPARATIVE MACHINE LEARNING ALGORITHMS FOR YOUTUBE SENTIMENT ANALYSIS ON DPR DEMONSTRATION 2025 USING LEXICON Samsudin, Syafri; Abdul Chamid, Ahmad; Jazuli, Ahmad
Jurnal Riset Informatika Vol. 8 No. 1 (2025): Desember 2025
Publisher : Kresnamedia Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34288/jri.v8i1.470

Abstract

The high volume of public comments on YouTube regarding the DPR Demonstrasion August 2025, which reached 43,910 raw data, presents a significant challenge in conducting efficient sentiment analysis. Time and cost limitations in manual labeling for large-scale datasets are a major obstacle in the development of predictive models. This study aims to address this problem by proposing a hybrid approach that integrates Lexicon-Based auto-labeling with a comparative evaluation of five Machine Learning algorithms. The research methodology included a text preprocessing stage that generated 40,097 unique comments, feature extraction using TF-IDF, and data sharing with an 80:20 ratio. The performance of the Support Vector Machine algorithm was comprehensively compared to Random Forest, Decision Tree, K-Nearest Neighbors, and Naive Bayes. The results of the experiment showed that the SVM model recorded the most superior performance with an accuracy of 96.5% and a weighted F1-Score of 0.966. This score significantly outperformed other benchmarking algorithms, where Random Forest came in second place with 89.2% accuracy, followed by Decision Tree at 85.6%, KNN at 84.6%, and Naive Bayes at the lowest with 84.0%. These findings validate that the integration of Lexicon-Based labeling with SVM classification is a highly accurate, robust, and efficient solution for handling sentiment analysis on large-scale social media data in Indonesia.
TOPIC MODELING OF PUBLIC DISCOURSE ON TWITTER ABOUT THE ASSET CONFISCATION BILL USING LATENT DIRICHLET ALLOCATION (LDA) Azka Bima Aditya; Ahmad Abdul Chamid; Rizkysari Mei Maharani
Jurnal Riset Informatika Vol. 8 No. 2 (2026): Maret 2026
Publisher : Kresnamedia Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34288/jri.v8i2.477

Abstract

This study examines the structure of public discourse on Twitter regarding the Indonesian Asset Confiscation Bill, a policy initiative aimed at strengthening anti corruption enforcement and ensuring legal certainty. Moving beyond conventional sentiment classification, this research identifies how substantive public concerns are thematically organized within digital debate. A total of 14,319 cleaned and deduplicated tweets collected between January and September 2025 were analyzed using Latent Dirichlet Allocation with the optimal model configuration of nine topics selected based on coherence evaluation to ensure semantic interpretability. The findings reveal nine dominant thematic clusters, with law enforcement and regulatory enactment emerging as the primary focus, followed by legislative process dynamics, protest mobilization, party politics, and institutional accountability. These results indicate that online discourse is structured around normative concerns, particularly procedural clarity, fairness, and institutional legitimacy, rather than driven solely by emotional polarity. Scientifically, this study contributes by shifting the analytical emphasis from sentiment polarity toward systematic thematic mapping of digital political discourse using an optimized LDA framework tailored to Indonesian Twitter data characteristics. Practically, the findings provide policymakers with an evidence based monitoring instrument to identify priority public concerns, strengthen legislative communication strategies, and reduce interpretive ambiguity in sensitive regulatory deliberations.