Random Forest (RF) is a popular Machine Learning (ML) approach extensively employed for addressing classification issues. Nevertheless, the RF method for classification problems demonstrates suboptimal performance in cases of data imbalance. There are several approaches to enhance RF performance when coping with data imbalance issues, such as using weighting and oversampling. This research explores the intervention of RF in addressing data imbalances, focusing on case studies of health problem classification in Java This study aims to develop models to analyze the health status of regions using RF, WRF, SMOTE-RF, and SMOTE-WRF methods. The objective is to compare the performance of these models and identify the best model for classifying DBK and Non-DBK categories in Java. The research results show that SMOTE-WRF is the most effective model in classifying DBK, achieving an accuracy level of 93.62%, sensitivity of 85.71%, precision of 75%, F-score of 80%, and AUC of 93.57%. The three key variables in the SMOTE-WRF model entail access to adequate sanitation, egg and milk consumption, and the number of doctors
Copyrights © 2025