JAMBURA JOURNAL OF PROBABILITY AND STATISTICS
Vol 4, No 1 (2023): Jambura Journal Of Probability and Statistics

Comparing Logistic Regression and Support Vector Machine in Breast Cancer Problem

Caecilia Bintang Girik Allo (Universitas Cenderawasih)
Leonardus Sandy Ade Putra (Universitas Tanjungpura)
Nicea Roona Paranoan (Universitas Cenderawasih)
Vincentius Abdi Gunawan (Universitas Palangka Raya)



Article Info

Publish Date
11 Jun 2023

Abstract

There are several methods used for the classification problems. There are many different kinds of fields that can be used. Nowadays, Support Vector Machine (SVM) is a popular classification method that has been proposed by many researchers. Using the same method but different distribution methods for creating training and testing data in the same dataset can yield varying results in terms of prediction accuracy, which is crucial in classification. In this paper, we compare the prediction accuracy between SVM results and Logistic Regression results to determine the better method to  classify the current condition of the patient after undergoing some treatment.  Several treatments are used in this paper, including feature selection, feature extraction, separating the train and testing data using Holdout and K-Fold CV. Stepwise selection is done to reduce the features. Training and testing dataset is obtained using the five stratified and non-stratified holdout and five fold stratified and non-stratified cross validation. The result shows that the best method to classify the cancer dataset is five fold stratified cross validation SVM with radial kernel. The obtained accuracy is 81,816% with variance as much as 0,94%.

Copyrights © 2023






Journal Info

Abbrev

jps

Publisher

Subject

Computer Science & IT Decision Sciences, Operations Research & Management Environmental Science Social Sciences

Description

Probability Theory Mathematical Statistics Computational Statistics Stochastic Processes Financial Statistics Bayesian Analysis Survival Analysis Time Series Analysis Neural Network Another field which is related to statistics and the applications Another field which is related to Probability and ...