Diabetes is a chronic disease posing significant health challenges globally, with rising prevalence due to genetic, lifestyle, and environmental factors. This research employs the Random Forest methodology to analyze diabetes risk factors and predict outcomes using a dataset of 768 patient records. Key attributes such as glucose levels, BMI, blood pressure, and age were evaluated to uncover their contribution to diabetes risk. The study achieved an overall accuracy of 72%, with glucose emerging as the most influential predictor, followed by BMI and age. While the model showed strong performance in identifying non-diabetic cases, moderate precision and recall for diabetic cases highlighted the impact of class imbalance. Feature importance analysis provided actionable insights, emphasizing glucose and BMI monitoring in diabetes management. Despite its strengths, challenges such as class imbalance and feature redundancy were noted, suggesting the need for oversampling techniques, additional variables, and advanced feature engineering. These findings demonstrate the utility of Random Forest in healthcare analytics, supporting predictive and preventive care strategies. Future research should focus on integrating lifestyle factors, expanding datasets, and exploring advanced machine learning models to enhance predictive accuracy and real-world applicability.
Copyrights © 2023