Claim Missing Document
Check
Articles

Found 2 Documents
Search
Journal : Inferensi

Model Evaluation for Logistic Regression and Support Vector Machines in Diabetes Problem Baiq Siska Febriani Astuti; Neni Alya Firdausanti; Santi Wulan Purnami
Inferensi Vol 1, No 2 (2018): Inferensi
Publisher : Department of Statistics ITS

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (557.911 KB) | DOI: 10.12962/j27213862.v1i2.6728

Abstract

Machine learning is a method or computational algorithm to solve problems based on data that already available from the database. Classification is one of the important methods of supervised learning in machine learning. Support Vector Machine and Logistic Regression are some supervised learning methods that can be used both for classification and regression. In datamining process, Preprocessing is an important part before doing further analysis. In preprocessing data, feature selection and deviding training and testing data are important part of preprocessing data. In this research will be compared some evaluation model of deviding method for training and testing data, namely Random Repeated Holdout, Stratified Repeated Holdout, Random Cross-Validation, and Startified Cross-Validation. Evaluation model would be implying in logistic regression and Support Vector Machines (SVMs). From the analysis, can be concluded that by selecting features can improve the accuracy of classification with logistic regression, but opposite of Support Vector Machines (SVMs). For training and testing data pertition method can not be sure what method is better, because each method of partition training and testing data using the concept of random selection. Model evaluation cannot sure influence to increase best perform for SVMs model in particular this case.
Comparisons of Logistic Regression and Support Vector Machines in Classification of Echocardiogram Dataset Neni Alya Firdausanti; Ratih Ardiati Ningrum; Siti Qomariyah
Inferensi Vol 5, No 2 (2022)
Publisher : Department of Statistics ITS

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.12962/j27213862.v5i2.14121

Abstract

Echocardiography is a test that uses sound waves to produce an image of our heart. This image is called an echocardiogram. This paper uses Echocardiogram Dataset, in which the problem is to classify from 7 features whether the patient will survive or not. In this study, the classification method is used to solve this problem. Some classification methods can be applied to classify category response variables, such as Logistic regression and Support Vector Machines (SVM). The method for predicting best accuracy used holdout and cross-validation. Before doing classification, some preprocessing procedures were applied to this dataset. The preprocessing procedures include missing value imputation using median imputation, outliers detection in univariate and multivariate procedures, and feature selection using the backward method. The result of classification in the analysis showed that SVM with unstratified holdout gave the best accuracy, that is 91.54%.