Cardiovascular disease is a disorder of the heart and blood vessels that can lead to heart attacks, strokes, and heart failure, so early detection is essential. This study compares Multilayer Perceptron (MLP) and Random Forest for risk classification in a Kaggle dataset containing 70,000 samples with balanced targets. Pre-processing included age conversion, outlier cleaning, standardization, and feature selection based on feature importance. Both models were optimized using RandomizedSearchCV and evaluated using accuracy, precision, recall, F1-score, AUC-ROC, confusion matrix, and k-fold cross-validation. The results show that the accuracy of MLP is 73.90% and Random Forest is 74.23% with an AUC of 0.80 for both. Random Forest is more stable across all folds and performs better on the negative class, while MLP is slightly more sensitive to the positive class. Independent t-test and Mann-Whitney U tests show p>0.05, indicating that the difference in performance is not significant. The most influential features were diastolic blood pressure, age, cholesterol, and systolic blood pressure. The non-clinical Streamlit prototype demonstrated the model's potential for education and initial decision support.
Copyrights © 2025