Prosiding Seminar Nasional Sisfotek (Sistem Informasi dan Teknologi Informasi)
Vol 5 No 1 (2021): SISFOTEK V 2021

Kombinasi Algoritma Sampling dengan Algoritma Klasifikasi untuk Meningkatkan Performa Klasifikasi Dataset Imbalance

Gagah Gumelar (Universitas AMIKOM Yogyakarta)
Norlaila2 (Universitas AMIKOM Yogyakarta)
Quratul Ain (Universitas AMIKOM Yogyakarta)
Riza Marsuciati (Universitas AMIKOM Yogyakarta)
Silvi Agustanti Bambang (Universitas AMIKOM Yogyakarta)
Andi Sunyoto (Universitas AMIKOM Yogyakarta)
M. Syukri Mustafa (Universitas AMIKOM Yogyakarta)



Article Info

Publish Date
25 Sep 2021

Abstract

A class to be imbalanced when there is a class that has more data than other classes. A comparison between minority classes and the majority class is called Imbalance Ratio (IR). The greater the difference between the minority class and the majority class the value of the Imbalance Ratio (IR) is getting larger. Dataset imbalance in data mining is a serious problem. The application of the classification algorithm regardless of class balance resulted in a good prediction for the majority class and a neglected minority class. Therefore, in this research, the SMOTE algorithm was applied to balance the dataset. The study used 4 datasets with different Imbalance Ratio and used classification algorithms, C45, Naïve Bayes, K-NN, and SVM. Then compared before and after using SMOTE. The research results that have been done accuracy value and value G-mean Naïve Bayes algorithm is consistent with its performance at each level of imbalance ratio, before the implementation has no good performance, whereas after the implemented SMOTE algorithm Naïve Bayes has a consistent increase in accuracy. So it can be concluded that the combination SMOTE + Naïve Bayes most effectively used in the imbalance dataset with different levels in the scheme of 10 fold cross validation and 80% data testing tested as much as 50 times.

Copyrights © 2021






Journal Info

Abbrev

SISFOTEK

Publisher

Subject

Computer Science & IT

Description

Seminar Nasional Sistem Informasi dan Teknologi (SISFOTEK) merupakan ajang pertemuan ilmiah, sarana diskusi dan publikasi hasil penelitian maupun penerapan teknologi terkini dari para praktisi, peneliti, akademisi dan umum di bidang sistem informasi dan teknologi dalam artian ...