Indonesian Journal of Electrical Engineering and Computer Science
Vol 34, No 2: May 2024

Heart disease prediction using ML through enhanced feature engineering with association and correlation analysis

Annemneedi Lakshmanarao (Aditya Engineering College)
Thotakura Venkata Sai Krishna (QIS College of Engineering and Technology (Autonomous))
Tummala Srinivasa Ravi Kiran (P.B. Siddhartha College of Arts and Science Vijayawada)
Chinta Venkata Murali krishna (NRI Institute of Technology)
Samsani Ushanag (University College of Engineering Kakinada)
Nandikolla Supriya (Malla Reddy University)



Article Info

Publish Date
01 May 2024

Abstract

Heart disease remains a prevalent and critical health concern globally. This paper addresses the critical task of heart disease prediction through the utilization of advanced machine learning techniques. Our approach focuses on the enhancement of feature engineering by incorporating a novel integration of association and correlation analyses. A heart disease dataset from Kaggle was used for the experiments. Association analysis was applied to the categorical and binary features in the dataset. Correlation analysis was applied to the numerical features in the dataset. Based on the insights from association analysis and correlation analysis, a new dataset was created with combinations of features. Later, newly created features are integrated with the original dataset, and classification algorithms are applied. Five machine learning (ML) classifiers, namely decision tree, k-nearest neighbors (KNN), random forest, XG-Boost, and support vector machine (SVM), were applied to the final dataset and achieved a good accuracy rate for heart disease detection. By systematically exploring associations and relationships with categorical, binary, and numerical features, this paper unveils innovative insights that contribute to a more comprehensive understanding of the heart disease dataset.

Copyrights © 2024