Breast cancer remains a major health challenge, affecting approximately 1.7 million individuals annually and often leading to severe complications. Predicting survival outcomes is difficult due to highly imbalanced data, with 3,408 death cases compared to only 616 survival cases. To address this issue, we applied the Firefly Algorithm–based under-sampling (FAUS) to balance the dataset and combined it with three machine learning classifiers: Random Forest (RF), Decision Tree (DT), and K-Nearest Neighbor (KNN). Experimental results show that FAUS substantially improves predictive performance compared to conventional under-sampling. Among the tested models, RF achieved the highest F1-score of 0.79, while DT and KNN reached 0.72 and 0.68, respectively. The results indicate that FAUS is effective in preserving representative samples, thereby enhancing model performance in breast cancer survival prediction.
Copyrights © 2026