TIN: TERAPAN INFORMATIKA NUSANTARA
Vol 5 No 2 (2024): July 2024

Optimasi Metode Support Vector Machine Menggunakan Seleksi Fitur Recursive Feature Elimination dan Forward Selection untuk Klasifikasi Kanker Payudara

Septiany, Eva Senia (Unknown)
Handayani, Hanny Hikmayanti (Unknown)
Mudzakir, Tohirin Al (Unknown)
Masruriyah, Anis Fitri Nur (Unknown)



Article Info

Publish Date
31 Jul 2024

Abstract

Cancer, the leading cause of global death, results from abnormal cell proliferation that spreads beyond the boundaries of normal tissue. Breast cancer is one of the most common types of cancer, with approximately 2.26 million cases reported in 2020. This research aims to develop a more effective Support Vector Machine (SVM) algorithm for breast cancer classification through efficient feature selection techniques. Previous research has used various algorithms such as K-Nearest Neighbor and Logistic Regression for breast cancer identification. This research focuses on improving accuracy by using alternative feature selection methods such as Recursive Feature Elimination (RFE) and Forward Selection. The dataset used consists of 569 instances with 32 features sourced from the UCI Machine Learning Repository, and classified into benign and malignant categories. Data pre-processing methods, including data cleaning, coding, and feature selection, were applied to the dataset. RFE and Forward Selection techniques were used to identify the most important features for model training. Evaluation of the improved SVM model shows a training accuracy of nearly 100% and a Cross Validation accuracy of 97%, demonstrating the effectiveness of the proposed approach in the context of breast cancer. In addition, the Learning Curve and testing showed the stability of the SVM model with no signs of overfitting or underfitting. Thus, this study developed an SVM algorithm with a feature selection method that produces better accuracy results in breast cancer classification.

Copyrights © 2024