Infolitika Journal of Data Science
Vol. 2 No. 2 (2024): November 2024

Advanced Anemia Classification Using Comprehensive Hematological Profiles and Explainable Machine Learning Approaches

Noviandy, Teuku Rizky (Unknown)
Idroes, Ghifari Maulana (Unknown)
Suhendra, Rivansyah (Unknown)
Bakri, Tedy Kurniawan (Unknown)
Idroes, Rinaldi (Unknown)



Article Info

Publish Date
29 Nov 2024

Abstract

Anemia is a common health issue with serious clinical effects, making timely and accurate diagnosis essential to prevent complications. This study explores the use of machine learning (ML) methods to classify anemia and its subtypes using detailed hematological data. Six ML models were tested: Gradient Boosting, Random Forest, Naive Bayes, Logistic Regression, Support Vector Machine, and K-Nearest Neighbors. The dataset was preprocessed using feature standardization and the Synthetic Minority Oversampling Technique (SMOTE) to address class imbalance. Gradient Boosting delivered the highest accuracy, sensitivity, and F1-score, establishing itself as the top-performing model. SHapley Additive exPlanations (SHAP) analysis was applied to enhance model interpretability, identifying key predictive features. This study highlights the potential of explainable ML to develop efficient, accurate, and scalable tools for anemia diagnosis, fostering improved healthcare outcomes globally.

Copyrights © 2024






Journal Info

Abbrev

ijds

Publisher

Subject

Computer Science & IT Decision Sciences, Operations Research & Management Engineering

Description

Infolitika Journal of Data Science is a distinguished international scientific journal that showcases high caliber original research articles and comprehensive review papers in the field of data science. The journals core mission is to stimulate interdisciplinary research collaboration, facilitate ...