Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : JAMBURA JOURNAL OF PROBABILITY AND STATISTICS

Comparing Logistic Regression and Support Vector Machine in Breast Cancer Problem Caecilia Bintang Girik Allo; Leonardus Sandy Ade Putra; Nicea Roona Paranoan; Vincentius Abdi Gunawan
Jambura Journal of Probability and Statistics Vol 4, No 1 (2023): Jambura Journal Of Probability and Statistics
Publisher : Department of Mathematics, Universitas Negeri Gorontalo

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34312/jjps.v4i1.19246

Abstract

There are several methods used for the classification problems. There are many different kinds of fields that can be used. Nowadays, Support Vector Machine (SVM) is a popular classification method that has been proposed by many researchers. Using the same method but different distribution methods for creating training and testing data in the same dataset can yield varying results in terms of prediction accuracy, which is crucial in classification. In this paper, we compare the prediction accuracy between SVM results and Logistic Regression results to determine the better method to  classify the current condition of the patient after undergoing some treatment.  Several treatments are used in this paper, including feature selection, feature extraction, separating the train and testing data using Holdout and K-Fold CV. Stepwise selection is done to reduce the features. Training and testing dataset is obtained using the five stratified and non-stratified holdout and five fold stratified and non-stratified cross validation. The result shows that the best method to classify the cancer dataset is five fold stratified cross validation SVM with radial kernel. The obtained accuracy is 81,816% with variance as much as 0,94%.