Prostate cancer remains a leading cause of cancer-related deaths among men globally, emphasizing the critical need for accurate diagnostic tools. This study investigates the application of Gradient Boosting Machines (GBMs) for prostate cancer detection using a dataset with key tumor characteristics such as radius, texture, area, and symmetry. Data preprocessing included normalization, missing value handling, and the Synthetic Minority Oversampling Technique (SMOTE) to address class imbalance. The GBM model demonstrated an accuracy of 75%, with high precision (82%) and recall (88%) for malignant cases, underscoring its potential as a reliable diagnostic tool. However, the model's performance for benign cases was limited by severe class imbalance, reflected in a precision of 33% and recall of 25%. Interpretability was enhanced using SHAP values, identifying key predictors like tumor perimeter and compactness. While GBMs show promise in prostate cancer diagnostics, future research should incorporate multimodal data, advanced balancing techniques, and rigorous validation frameworks to enhance generalizability and fairness. This study highlights the value of machine learning in healthcare, contributing to improved diagnostic accuracy and patient outcomes.
Copyrights © 2025