Claim Missing Document
Check
Articles

Found 2 Documents
Search

Klasifikasi Cyberbullying Pada Tweet Bahasa Sunda Dengan Menggunakan Hybrid Learning Model Setyaningrum, Anisa Putri; Nadhif, Muhammad Fahmy
Rekayasa Hijau : Jurnal Teknologi Ramah Lingkungan Vol 9, No 1 (2025)
Publisher : Institut Teknologi Nasional, Bandung

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26760/jrh.v9i1.58-69

Abstract

ABSTRAKCyberbullying dalam bahasa Sunda semakin marak di media sosial, dengan kasus seperti penghinaan fisik, body shaming, dan ancaman yang dapat berdampak negatif pada korban. Namun, deteksi otomatis masih menghadapi tantangan, terutama dalam keterbatasan dataset dan efektivitas metode pemrosesan bahasa alami. Penelitian ini bertujuan untuk mengembangkan sistem deteksi cyberbullying bahasa Sunda menggunakan gabungan model stemming dan hybrid learning. Peneliti menerapkan beberapa model machine learning yaitu random forest dan Support Vector Machine (SVM) serta model deep learning yaitu convolutional neural network-bidirectional long short-term memory (CNN-BiLSTM), CNN, dan BiLSTM. Peneliti melakukan eksperimen untuk mengevaluasi kinerja masing-masing model dengan mengukur akurasi dan F1-score. Berdasarkan hasil penelitian, model hybrid learning memperoleh kinerja terbaik dengan akurasi sebesar 97,3% dan F1-score sebesar 97%. Selain itu, waktu pelatihan pada CNN-BiLSTM lebih cepat dibandingkan dengan model lainnya yaitu sekitar 30 detik per epoch.Kata kunci: Bahasa Sunda, Cyberbullying, Hybrid LearningABSTRACTCyberbullying in the Sundanese language is becoming more common on social media, with cases like physical insults, body shaming, and threats that can seriously affect victims. However, detecting it automatically remains challenging, mainly due to limited datasets and the difficulty of processing the language effectively. This study aims to develop a Sundanese cyberbullying detection system using a combination of stemming and hybrid learning models. The researchers applied several machine learning models, namely random forest and Support Vector Machine (SVM), and deep learning models, namely convolutional neural network-bidirectional long short-term memory (CNN-BiLSTM), CNN, and BiLSTM. The researchers conducted experiments to evaluate the performance of each model by measuring the accuracy and F1-score. Based on the results, the hybrid learning model achieved the best performance, with an accuracy of 97.3% and an F1-score of 97%. Besides that, the training time on CNN-BiLSTM is faster than the others which is approximately 30 seconds per epoch.Keywords: Sundanese, Cyberbullying, Hybrid Learning
Evaluating Single and Hybrid Feature Selection for Rainfall Prediction Using XGBoost Widoyono, Bambang; Nadhif, Muhammad Fahmy; Eryadi, Ridha Adjie
Indonesian Journal of Artificial Intelligence and Data Mining Vol 9, No 1 (2026): March 2026
Publisher : Universitas Islam Negeri Sultan Syarif Kasim Riau

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24014/ijaidm.v9i1.39110

Abstract

Rainfall prediction is challenging due to the complex and nonlinear nature of meteorological data. Previous studies using XGBoost with feature selection have demonstrated superior performance compared to other models, but evaluations have focused solely on error metrics (RSME, SME, MAE). Recent research suggests that predictive models should be evaluated for generalization, stability, interpretability, and computational efficiency to ensure their reliability. To close this gap, this study uses 8,750 hourly records obtained from Open-Meteo with 81 engineered features to evaluate XGBoost under three scenarios: without feature selection, single feature selection (MI, Boruta, SHAP, mRMR, ReliefF), and hybrid feature selection. Our findings demonstrate that accuracy is not always increased by feature selection. It does, however, increase interpretability, decrease overfitting, and improve computational efficiency. SHAP provides the most reliable performance among single methods, achieving lower RMSE (0.72632) and improved stability. Hybrid feature selection produces the most balanced performance gap = 0.01325, and stable variance = 0.03315 while reducing feature complexity to 35 variables. This study theoretically shows the value of multidimensional evaluation that goes beyond error metrics. In practical terms, this study suggests a feature selection method for rainfall prediction systems that are effective, reliable, and simple to understand.