Garuda - Garba Rujukan Digital

Article Per Year (5 Year)

p-Index From 2021 - 2026

0.23

P-Index

This Author published in this journals

All Journal Telematika

Maulidha, Khusnul Rahmi

Lambung Mangkurat University

Author-ID : 8294838

Education

Published : 1 Documents Claim Missing Document

Claim Missing Document

Articles

Comparative Analysis of Distance Metrics in KNN and SMOTE Algorithms for Software Defect Prediction Maulidha, Khusnul Rahmi; Faisal, Mohammad Reza; Saputro, Setyo Wahyu; Abadi, Friska; Nugrahadi, Dodon Turianto; Adi, Puput Dani Prasetyo; Hariyady, Hariyady
Telematika Vol 18, No 1: February (2025)
Publisher : Universitas Amikom Purwokerto

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35671/telematika.v18i1.3008

As the complexity and scale of projects increase, new challenges arise related to handling software defects. One solution uses machine learning-based software defect prediction techniques, such as the K-Nearest Neighbors (KNN) algorithm. However, KNN’s performance can be hindered by the majority vote mechanism and the distance/similarity metric choice, especially when applied to imbalanced datasets. This research compares the effectiveness of Euclidean, Hamming, Cosine, and Canberra distance metrics on KNN performance, both before and after the application of SMOTE (Synthetic Minority Over-sampling Technique). Results show significant improvements in the AUC and F-1 measure values across various datasets after the SMOTE application. Following the SMOTE application, Euclidean distance produced an AUC of 0.7752 and an F1 of 0.7311 for the EQ dataset. With Canberra distance and SMOTE, the JDT dataset produced an AUC of 0.7707 and an F-1 of 0.6342. The LC dataset improved to 0.6752 and 0.3733 in tandem with the ML dataset, which climbed to 0.6845 and 0.4261 with Canberra distance. Lastly, after using SMOTE, the PDE dataset improved to 0.6580 and 0.3957 with Canberra distance. The findings confirm that SMOTE, combined with suitable distance metrics, significantly boosts KNN’s prediction accuracy, with a P-value of 0.0001.

Co-Authors Abadi, Friska Adi, Puput Dani Prasetyo Dodon Turianto Nugrahadi Faisal, Mohammad Reza Hariyady, Hariyady Setyo Wahyu Saputro

Title

Found 1 Documents
Search

Abstract

Title Search

Found 1 Documents Search

Abstract

Title

Found 1 Documents
Search