This study explores the effectiveness of Logistic Regression in predicting heart disease using a dataset derived from multiple international databases. Employing a 5-fold cross-validation method, the research aimed to evaluate the model's accuracy, precision, recall, and F1-score. Results indicated that Logistic Regression performs robustly, with accuracy ranging from 80% to 88.29%, and high recall rates, highlighting its potential as a valuable tool in medical diagnostics. Despite some variability in precision, which may lead to higher false positive rates, the model's high recall is crucial in clinical settings where missing a diagnosis can have dire consequences. The research confirmed the applicability of Logistic Regression to binary classification problems in healthcare, aligning with existing literature that supports its use in similar contexts. The study contributes to the field by demonstrating the model's consistency and reliability across diverse data subsets, reinforcing the potential for machine learning applications in healthcare diagnostics. Future research should focus on integrating Logistic Regression with other models to improve accuracy and testing the model on more current, varied datasets to enhance its generalizability and effectiveness in real-world settings.
Copyrights © 2024