Informasi Interaktif
Vol 6, No 2 (2021): Jurnal Informasi Interaktif

PERBANDINGAN ANALISIS DATA FITUR NOMINAL MULTI-KATEGORI MENGGUNAKAN METODE ADAPTIVE SYNTHETIC NOMINAL (ADASYN-N) SERTA ADAPTIVE SYNTHETIC-KNN (ADASYN-KNN)

Putra, Jeffry Andhika (Universitas Janabadra)
Rahayu, Sri (Universitas Janabadra)



Article Info

Publish Date
31 May 2021

Abstract

Growing need for efficient algorithms for data manipulation, analysis, and intelligent use has been a very active research area in machine learning field. However, some research areas still not fully developed, especially when unbalanced data classification is needed. Datasets with this class imbalance occur because of an unbalanced ratio between one case and another. This class imbalance will be detrimental to data mining because machine learning in data mining has difficulty in classifying minority classes (small instances) correctly. There are several approaches to handling imbalances, one of which is by using the original data sampling method. The first sampling method approach to overcome class imbalance is undersampling which is a method to balance classes by randomly reducing the majority class instances. Over-sampling is a method of balancing class distribution by randomly replicating instances in minority classes.This study presents comparison of over-sampling techniques to overcome problem of class imbalances in datasets with nominal-multi categories features between Adaptive Synthetic-Nominal (ADASYN-N) and Adaptive Synthetic-kNN (ADASYN-KNN) methods. There are seven datasets with nominal-multi categories features which have an unbalanced class distribution. Then the dataset that has been over-sampled with both methods is classified using the Random Forest method. Furthermore, a comparison of the accuracy of the original dataset and the dataset of the ADASYN-N and ADASYN-KNN over-sampling techniques was carried out.  Keywords: ADASYN-KNN, ADASYN-N, class imbalance, nominal, multi-category, over-sampling. 

Copyrights © 2021






Journal Info

Abbrev

informasiinteraktif

Publisher

Subject

Computer Science & IT

Description

Jurnal Informasi Interaktif mempublikasikan artikel dalam bidang teknologi informasi dan komunikasi, rekayasa perangkat lunak dan sistem ...