Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control
Vol. 10, No. 4, November 2025 (Article in Progress)

A Metaheuristic wrapper approach to feature selection with genetic algorithm for enhancing XGBoost classification in diabetes prediction

Alamsyah, Nur (Unknown)
Budiman (Unknown)
Danestiara, Venia Restreva (Unknown)
Yoga, Titan Parama (Unknown)
Nursyanti, Reni (Unknown)
Kaunang, Valencia (Unknown)



Article Info

Publish Date
16 Oct 2025

Abstract

This study addressed the problem of selecting the most relevant features for improving the accuracy of diabetes classification using health indicator data. The research focused on a binary classification task based on the Behavioral Risk Factor Surveillance System dataset, which comprised over seventy thousand records and twenty-one predictive features related to individual health behaviors and conditions. A metaheuristic wrapper approach was developed by integrating a Genetic Algorithm for feature selection with an XGBoost classifier to evaluate the predictive quality of each feature subset. The fitness function was defined as the average classification accuracy obtained through cross-validation. In addition to feature selection, hyperparameter optimization of the XGBoost model was carried out using a Bayesian-based search strategy to further enhance performance. The proposed method successfully identified a subset of fourteen optimal features that contributed most significantly to the prediction of diabetes. The final model, combining the selected features and optimized parameters, achieved an accuracy of 0.753, outperforming baseline models trained with all features and models using features selected by deterministic methods. These results confirmed the effectiveness of combining evolutionary feature selection with model tuning to build efficient and interpretable predictive models for medical data classification. This approach demonstrated a practical solution for managing high-dimensional data in the context of chronic disease prediction.

Copyrights © 2025






Journal Info

Abbrev

kinetik

Publisher

Subject

Computer Science & IT Control & Systems Engineering Electrical & Electronics Engineering Energy Engineering

Description

Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control was published by Universitas Muhammadiyah Malang. journal is open access journal in the field of Informatics and Electrical Engineering. This journal is available for researchers who want to improve ...