Inferensi
Vol 5, No 2 (2022)

Comparisons of Logistic Regression and Support Vector Machines in Classification of Echocardiogram Dataset

Neni Alya Firdausanti (Institut Teknologi Sepuluh Nopember)
Ratih Ardiati Ningrum (Data Science Technology, Airlangga University, Surabaya, Indonesia)
Siti Qomariyah (Institut Agama Islam Negeri Kudus, Kudus, Indonesia)



Article Info

Publish Date
30 Sep 2022

Abstract

Echocardiography is a test that uses sound waves to produce an image of our heart. This image is called an echocardiogram. This paper uses Echocardiogram Dataset, in which the problem is to classify from 7 features whether the patient will survive or not. In this study, the classification method is used to solve this problem. Some classification methods can be applied to classify category response variables, such as Logistic regression and Support Vector Machines (SVM). The method for predicting best accuracy used holdout and cross-validation. Before doing classification, some preprocessing procedures were applied to this dataset. The preprocessing procedures include missing value imputation using median imputation, outliers detection in univariate and multivariate procedures, and feature selection using the backward method. The result of classification in the analysis showed that SVM with unstratified holdout gave the best accuracy, that is 91.54%.

Copyrights © 2022






Journal Info

Abbrev

inferensi

Publisher

Subject

Computer Science & IT Decision Sciences, Operations Research & Management Engineering Mathematics Social Sciences

Description

The aim of Inferensi is to publish original articles concerning statistical theories and novel applications in diverse research fields related to statistics and data science. The objective of papers should be to contribute to the understanding of the statistical methodology and/or to develop and ...