Claim Missing Document
Check
Articles

Found 2 Documents
Search

Komparasi Algoritma Machine Learning dalam Klasifikasi Kanker Payudara Afiatuddin, Nurfadlan; Wicaksono, M Teguh; Akbar, Vitto Rezky; Rahmaddeni, Rahmaddeni; Wulandari, Denok
JURNAL MEDIA INFORMATIKA BUDIDARMA Vol 8, No 2 (2024): April 2024
Publisher : Universitas Budi Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30865/mib.v8i2.7457

Abstract

Every year, millions of women are faced with a serious global health issue: breast cancer. This research aims to improve the efficiency of breast cancer classification using machine learning. One of the main challenges encountered is the imbalance between the number of malignant and benign cases in the dataset. Therefore, this study aims to compare the performance of several machine learning algorithms in classifying breast cancer, such as Decision Tree, Naive Bayes, K-Nearest Neighbors, Logistic Regression, and Random Forest. Preprocessing data, dividing data with various ratios, and testing various classification algorithms are the techniques used in this research. The dataset used originates from the Wisconsin Breast Cancer Diagnosis dataset from the Kaggle platform. The Synthetic Minority Over-Sampling Technique (SMOTE) is used to achieve balance in the proportions of imbalanced classes. After hyperparameter tuning, Logistic Regression showed the best performance with accuracy reaching 100% in several situations. This study concludes that the use of machine learning, especially with techniques for handling class imbalance, can improve the ability to detect breast cancer early. Additionally, this research also helps understand the best algorithms to improve accuracy in classifying breast cancer, providing support for healthcare professionals in early diagnosis, and enhancing the quality of patient care.
Evaluation of Data Mining in Heart Failure Disease Classfication: Afiatuddin, Nurfadlan; Rahmaddeni, Rahmaddeni; Pratiwi, Fitri; Septia, Rapindra; Hendrawan, Heri
CogITo Smart Journal Vol. 10 No. 2 (2024): Cogito Smart Journal
Publisher : Fakultas Ilmu Komputer, Universitas Klabat

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.31154/cogito.v10i2.726.460-473

Abstract

This study evaluates the effectiveness of data mining algorithms in heart failure disease classification. Various algorithms, including Random Forest, Decision Tree C4.5, Gradient Boosted Machine (GBM), and XGBoost, were applied to a heart failure dataset. The dataset was collected from multiple sources and preprocessed to address imbalances using the SMOTE (Synthetic Minority Over-sampling Technique) technique. The results indicate that employing SMOTE and parameter optimization through grid search significantly enhances the performance of these algorithms. XGBoost and GBM demonstrated superior accuracy, precision, and recall in both balanced and imbalanced data scenarios. In balanced data scenarios, XGBoost achieved an accuracy of 98.75% with an error rate of 1.25%, while GBM achieved an accuracy of 98.60% with an error rate of 1.40%. The study confirms that appropriate data preprocessing and parameter optimization are crucial for improving the accuracy of medical data analysis. These findings suggest that XGBoost and GBM are highly effective for heart disease prediction, supporting early diagnosis and timely medical intervention. Future research should explore alternative preprocessing techniques and additional algorithms to further improve prediction outcomes.