Garuda - Garba Rujukan Digital

CAUCHY: Jurnal Matematika Murni dan Aplikasi

Vol 11, No 1 (2026): CAUCHY: JURNAL MATEMATIKA MURNI DAN APLIKASI

Rini, Dyah Setyo (Unknown)
Agwil, Winalia (Unknown)
Agustina, Dian (Unknown)
Famuji, Ahmad (Unknown)

Publish Date
30 May 2026

Data availability in large observations and dimensions is known as big data. There are several problems in processing big data, such as imbalanced datasets. In classification modeling, an imbalanced dataset is a common challenge. Data class predictions are more likely to be accurate in the majority class data and inaccurate in the minority class, resulting from the problem of imbalanced data. The data-level, the algorithm-level, and the ensemble method approach are the solutions that have been extensively researched. Some methods with a data-level approach are SMOTE, Undersampling, and Oversampling. The algorithm-level method is NWKNN. And then, the ensemble approach is UnderBagging, RUSBoosting, SMOTEBoost, and SMOTEBagging. The goal of this study is to determine the best method for handling each case of the imbalanced dataset. There are three cases of imbalance, namely mild, moderate, and extreme. A simulation study was conducted for each imbalanced case to evaluate the accuracy of each method. Based on the AUC value, the SMOTEBagging method is the best for mild imbalance cases with an AUC value of 0.9581. For moderate imbalance cases, the SMOTEBagging method is the best method, with an AUC value of 0.9033. Meanwhile, for extreme imbalance cases, the UnderBagging method provides the best performance.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

CAUCHY: Jurnal Matematika Murni dan Aplikasi

Website

Abbrev

Math

Publisher

Universitas Islam Negeri Maulana Malik Ibrahim Malang

Subject

Mathematics

Description

Jurnal CAUCHY secara berkala terbit dua (2) kali dalam setahun. Redaksi menerima tulisan ilmiah hasil penelitian, kajian kepustakaan, analisis dan pemecahan permasalahan di bidang Matematika (Aljabar, Analisis, Statistika, Komputasi, dan Terapan). Naskah yang diterima akan dikilas (review) oleh ...

Article Info

Abstract

Comparison Of Methods For Handling Imbalanced Datasets In Improving Classification Algorithm Performance

Article Info

Abstract