Diabetes is a chronic metabolic disease and one of the leading causes of death worldwide, with the number of sufferers projected to reach 1.3 billion by 2050. Delayed diagnosis remains a primary challenge, as nearly half of those affected are unaware of their condition in the early stages, thereby increasing the risk of fatal complications. Data mining approaches using classification algorithms have been widely utilized for early screening. However, the development of medical record models is often hindered by imbalanced data, which causes models to be biased toward the majority class and reduces detection sensitivity for the minority class (patients with diabetes). Furthermore, there is a lack of research integrating these predictive models into responsive application interfaces for end-users. Consequently, this study implements Random Forest optimized with the SMOTE (Synthetic Minority Over-sampling Technique) into a web-based application to serve as a practical early detection tool. Random Forest was selected for its ability to handle complex data and reduce the risk of overfitting. The research stages include data preprocessing, balancing training data using SMOTE, model parameter adjustment through hyperparameter tuning with Grid Search, and the development of a client-server architecture using AstroJS and Flask. The evaluation results demonstrate that the use of SMOTE significantly improves the model's ability to identify the minority class. The model achieved a Recall of 75.0% and an overall accuracy of 95.8%, effectively minimizing False Negative errors. The developed application was verified through Black Box Testing and was declared successful as a responsive and accessible early detection tool for both healthcare professionals and the general public.
Copyrights © 2026