Claim Missing Document
Check
Articles

Found 2 Documents
Search
Journal : Jurnal Riset Informatika

Machine Learning for Stroke Prediction: Evaluating the Effectiveness of Data Balancing Approaches Muhamad Indra; Siti Ernawati; Ilham Maulana
Jurnal Riset Informatika Vol. 6 No. 4 (2024): September 2024
Publisher : Kresnamedia Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34288/jri.v6i4.344

Abstract

Stroke occurs due to disrupted blood flow to the brain, either from a blood clot (ischemic) or a ruptured blood vessel (hemorrhagic), leading to brain tissue damage and neurological dysfunction. It remains a leading cause of death and disability worldwide, making early prediction crucial for timely intervention. This study evaluates the impact of data balancing techniques on stroke prediction performance across different machine learning models. Random Forest (RF) consistently achieves the highest accuracy (98%) but struggles with precision and recall variations depending on the balancing method. Decision Tree (DT) and K-Nearest Neighbors (KNN) benefit most from SMOTE and SMOTETomek, improving their F1-scores (11.21% and 9.18%), indicating better balance between precision and recall. Random Under Sampling enhances recall across all models but reduces precision, leading to lower overall predictive reliability. SMOTE and SMOTETomek emerge as the most effective balancing techniques, particularly for DT and KNN, while RF remains the most accurate but requires further optimization to improve precision and recall balance.
Enhancing Obesity Risk Classification: Tackling Data Imbalance with SMOTE and Deep Learning Muhammad Syofian; Ilham Maulana
Jurnal Riset Informatika Vol. 6 No. 4 (2024): September 2024
Publisher : Kresnamedia Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (3011.529 KB) | DOI: 10.34288/jri.v6i4.349

Abstract

Data imbalance is a significant challenge in classification models, often leading to suboptimal performance, especially for minority classes. This study explores the effectiveness of the Synthetic Minority Over-sampling Technique (SMOTE) in improving classification model performance by balancing data distribution. The evaluation was conducted using a confusion matrix to measure prediction accuracy for each class. The results indicate that SMOTE successfully enhances minority class representation and improves prediction balance, although some misclassifications remain. Therefore, in addition to oversampling, additional approaches such as class weighting or ensemble learning are required to further improve model accuracy. This study provides deeper insights into the role of SMOTE in addressing data imbalance and its impact on classification model performance.