Claim Missing Document
Check
Articles

Found 2 Documents
Search
Journal : Indonesian Journal of Electronics, Electromedical Engineering, and Medical Informatics

Hybrid Feature Selection and Balancing Data Approach for Improved Software Defect Prediction Febrian, Muhamad Michael; Saputro, Setyo Wahyu; Saragih, Triando Hamonangan; Abadi, Friska; Herteno, Rudy
Indonesian Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol. 7 No. 2 (2025): May
Publisher : Jurusan Teknik Elektromedik, Politeknik Kesehatan Kemenkes Surabaya, Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/ijeeemi.v7i2.67

Abstract

Software Defect Prediction (SDP) plays a vital role in identifying defects within software modules. Accurate early detection of software defects can reduce development costs and enhance software reliability. However, SDP remains a significant challenge in the software development lifecycle. This study employs Particle Swarm Optimization (PSO) and addresses several challenges associated with its application, including noisy attributes, high-dimensional data, and imbalanced class distribution. To address these challenges, this study proposed a hybrid filter-based feature selection and class balancing method. The feature selection process incorporates Chi-Square (CS), Correlation-Based Feature Selection (CFS), and Correlation Matrix-Based Feature Selection (CMFS), which have been proven effective in reducing noisy and redundant attributes. Additionally, the Synthetic Minority Over-sampling Technique (SMOTE) is applied to mitigate class imbalance in the dataset. The K-Nearest Neighbors (KNN) algorithm is employed as the classification model due to its simplicity, non-parametric nature, and suitability for handling the feature subsets produced. Performance evaluation is conducted using the Area Under Curve (AUC) metric with a significance threshold of 0.05 to assess classification capability.  The proposed method achieved an AUC of 0.872, demonstrating its effectiveness in enhancing predictive performance. The proposed method was also superior to other combinations such as PSO SMOTE (0.0043), PSO SMOTE CS (0.0091), PSO SMOTE CFS (0.0111), and PSO SMOTE CFS CMFS (0.0007). The findings of this study show that the proposed method significantly enhances the efficiency and accuracy of PSO in software defect prediction tasks. This hybrid strategy demonstrates strong potential as a robust solution for future research and application in predictive software quality assurance.
Enhancing Software Defect Prediction: HHO-Based Wrapper Feature Selection with Ensemble Methods Fauzan Luthfi, Achmad; Herteno, Rudy; Abadi, Friska; Adi Nugroho, Radityo; Itqan Mazdadi, Muhammad; Athavale, Vijay Anant
Indonesian Journal of Electronics, Electromedical Engineering, and Medical Informatics Vol. 7 No. 2 (2025): May
Publisher : Jurusan Teknik Elektromedik, Politeknik Kesehatan Kemenkes Surabaya, Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35882/f2140043

Abstract

The growing complexity of data across domains highlights the need for effective classification models capable of addressing issues such as class imbalance and feature redundancy. The NASA MDP dataset poses such challenges due to its diverse characteristics and highly imbalanced classes, which can significantly affect model accuracy. This study proposes a robust classification framework integrating advanced preprocessing, optimization-based feature selection, and ensemble learning techniques to enhance predictive performance. The preprocessing phase involved z-score standardization and robust scaling to normalize data while reducing the impact of outliers. To address class imbalance, the ADASYN technique was employed. Feature selection was performed using Binary Harris Hawk Optimization (BHHO), with K-Nearest Neighbor (KNN) used as an evaluator to determine the most relevant features. Classification models including Random Forest (RF), Support Vector Machine (SVM), and Stacking were evaluated using performance metrics such as accuracy, AUC, precision, recall, and F1-measure. Experimental results indicated that the Stacking model achieved superior performance in several datasets, with the MC1 dataset yielding an accuracy of 0.998 and an AUC of 1.000. However, statistical significance testing revealed that not all observed improvements were meaningful; for example, Stacking significantly outperformed SVM but did not show a significant difference when compared to RF in terms of AUC. This underlines the importance of aligning model choice with dataset characteristics. In conclusion, the integration of advanced preprocessing and metaheuristic optimization contributes positively to software defect prediction. Future research should consider more diverse datasets, alternative optimization techniques, and explainable AI to further enhance model reliability and interpretability.
Co-Authors A.A. Ketut Agung Cahyawan W AA Sudharmawan, AA Abdullayev, Vugar Achmad Zainudin Nur Adi Mu'Ammar, Rifqi Aflaha, Rahmina Ulfah Ahmad Juhdi Alfando, Muhammad Alvin Amalia, Raisa Andi Farmadi Andi Farmandi Arif, Nuuruddin Hamid Athavale, Vijay Anant budiman, irwan Deni Kurnia Dodon Turianto Nugrahadi Dwi Kartini Dwi Kartini, Dwi Emma Andini Faisal, Mohammad Reza Fathmah, Siti Fatma Indriani Fauzan Luthfi, Achmad Febrian, Muhamad Michael Halimah Halimah Halimah Hartati Hartati Herteno, Rudy Indriani, Fatma Irwan Budiman Irwan Budiman Itqan Mazdadi, Muhammad M Kevin Warendra Mafazy, Muhammad Meftah Martalisa, Asri Mera Kartika Delimayanti Muhamad Fawwaz Akbar Muhammad Alkaff Muhammad Azmi Adhani Muhammad Denny Ersyadi Rahman Muhammad Haekal Muhammad Itqan Mazdadi Muhammad Khairin Nahwan Muhammad Mirza Hafiz Yudianto Muhammad Nazar Gunawan Muhammad Noor Muhammad Reza Faisal, Muhammad Reza Muhammad Sholih Afif Muliadi Muliadi Muliadi Aziz Muliadi Muliadi Nabella, Putri Nor Indrani Nugrahadi, Dodon Nurlatifah Amini Nursyifa Azizah Prastya, Septyan Eka Pratama, Muhammad Yoga Adha Putri Nabella Radityo Adi Nugroho Rahman Hadi Rahman Rahmat Ramadhani Reina Alya Rahma Rinaldi Riza Susanto Banner Rizal, Muhammad Nur Rizky Ananda, Muhammad Rizky, Muhammad Hevny Rudy Herteno SALLY LUTFIANI Saputro, Setyo Wahyu Saragih, Triando Hamonangan Sarah Monika Nooralifa Sa’diah, Halimatus Septyan Eka Prastya Setyo Wahyu Saputro Siti Napi'ah Tri Mulyani Ulya, Azizatul Umar Ali Ahmad Vina Maulida, Vina Wahyu Dwi Styadi Wahyu Saputro, Setyo Yunida, Rahmi