Sistemasi: Jurnal Sistem Informasi
Vol 13, No 6 (2024): Sistemasi: Jurnal Sistem Informasi

Optimization of the Naive Bayes Algorithm with SMOTETomek Combination for Imbalance Class Fraud Detection

Arsanto, Arief Tri (Unknown)
Faizin, Arif (Unknown)
lutfi, Moch (Unknown)
Saadah, Zulfatun Nikmatus (Unknown)



Article Info

Publish Date
27 Nov 2024

Abstract

The use of credit cards in the modern era is increasing. Therefore, it is necessary to prevent it with the use of technology such as address verification systems (AVS), card verification methods (CVM), and personal identification Numbers (PIN). Dataset analysis needs to be carried out to analyze the history of transactions that have been carried out. In the fraud detection dataset, it can be seen that there are attributes that cause data imbalance. Class imbalance in a dataset is a significant problem in machine learning that can affect overall model performance. The number of majority samples is more significant in one class than the number of minority classes. This research used an oversampling approach using a combination of smote and tomek-link. The focus of this research is card fraud classification. Detection of imbalanced datasets or imbalanced classes is carried out using the Naive Bayes method as a classification algorithm. In addition, a combination of resampling techniques is also applied to overcome imbalanced classes in this dataset through the SMOTETomek approach. SMOTETomek is a method that reduces the number of samples by considering two adjacent data from the minority and majority classes. Meanwhile, from the problems above, the results of the performance of Naïve Bayes, which experienced issues with data imbalance in this study, a resampling method was proposed in the hope of improving the performance of the Naïve Bayes algorithm and in the results of the AUC ROC curve, the SMOTETomek method could improve the performance of the Naïve Bayes algorithm. The higher the ROC score. -AUC, the better the model performance in terms of its ability to differentiate between two classes, but the accuracy results do not experience a significant change.

Copyrights © 2024






Journal Info

Abbrev

stmsi

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering

Description

Sistemasi adalah nama terbitan jurnal ilmiah dalam bidang ilmu sains komputer program studi Sistem Informasi Universitas Islam Indragiri, Tembilahan Riau. Jurnal Sistemasi Terbit 3x setahun yaitu bulan Januari, Mei dan September,Focus dan Scope Umum dari Sistemasi yaitu Bidang Sistem Informasi, ...