Infolitika Journal of Data Science
Vol. 3 No. 2 (2025): November 2025

An Interpretable Machine Learning Framework for Predicting Advanced Tumor Stages

Noviandy, Teuku Rizky (Unknown)
Patwekar, Mohsina (Unknown)
Patwekar, Faheem (Unknown)
Idroes, Rinaldi (Unknown)



Article Info

Publish Date
29 Nov 2025

Abstract

Accurate identification of advanced tumor stages is essential for timely clinical decision-making and personalized treatment planning. This study proposes an explainable ensemble learning framework for predicting advanced tumor stage using a dataset containing 10,000 samples with 18 clinical and radiological features. Four machine learning models, namely Logistic Regression, Naïve Bayes, AdaBoost, and LightGBM, were evaluated using stratified train–test splits along with standard performance metrics. LightGBM achieved the highest performance, with an accuracy of 86.05% and an F1-score of 76.61%, outperforming linear and probabilistic classifiers. ROC–AUC and precision–recall analyses further confirmed the superior discriminative ability of ensemble methods. SHAP explainability techniques highlighted mitotic count, Ki-67 index, enhancement, and necrosis as the most influential predictors of advanced stage. The proposed framework demonstrates strong predictive capability and provides clinically interpretable insights, underscoring its potential as a decision-support tool in oncological diagnostics. Future work will involve external validation and integration of additional multimodal data to enhance generalizability.

Copyrights © 2025






Journal Info

Abbrev

ijds

Publisher

Subject

Computer Science & IT Decision Sciences, Operations Research & Management Engineering

Description

Infolitika Journal of Data Science is a distinguished international scientific journal that showcases high caliber original research articles and comprehensive review papers in the field of data science. The journals core mission is to stimulate interdisciplinary research collaboration, facilitate ...