Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
Vol 9 No 5 (2025): October 2025

Early Stroke Disease Prediction Based on Lifestyle Factors Applied with Machine Learning

Suastika Yulia Riska (Unknown)
Lia Farokhah (Unknown)



Article Info

Publish Date
25 Oct 2025

Abstract

Stroke prediction has many supporting features and variables. Some forecasts focus more on health or elements that are already present. Predicting stroke risk by identifying habitual factors provides more advantages for preventive action. In addition, the complexity of features or variables is a concern in predicting stroke risk. In this study, we used a public dataset from Kaggle with 10 features or variables. In this study, we propose to collaborate algorithms and preprocessing in feature selection using Pearson Correlation and Principal Component Analysis (PCA) dimension reduction to unravel the complexity of variables and data processing computing. This aims to predict stroke risk more simply. The results of the experiment show that feature selection using Pearson Correlation between features and labels produces maximum results using 5 features out of 10 provided features. This approach produces the best performance on the Naïve Bayes, Iterative Dichotomiser Tree (ID3), Support Vector Machine (SVM), K-Nearest Neighbor (KNN), and Logistic Regression with 100% accuracy and reduces features by 50% to support the reduction of the complexity of prediction variables and data processing computing.

Copyrights © 2025






Journal Info

Abbrev

RESTI

Publisher

Subject

Computer Science & IT Engineering

Description

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) dimaksudkan sebagai media kajian ilmiah hasil penelitian, pemikiran dan kajian analisis-kritis mengenai penelitian Rekayasa Sistem, Teknik Informatika/Teknologi Informasi, Manajemen Informatika dan Sistem Informasi. Sebagai bagian dari semangat ...