IAES International Journal of Artificial Intelligence (IJ-AI)
Vol 14, No 6: December 2025

Enhancing software fault prediction through data balancing techniques and machine learning

Raj, Akshat (Unknown)
Chavan, Durva Mahadeo (Unknown)
Agarwal, Priyal (Unknown)
Gigi, Jestin (Unknown)
Rao, Madhuri (Unknown)
Musale, Vinayak (Unknown)
Chanchlani, Akshita (Unknown)
Dholkawala, Murtaza Shabbirbhai (Unknown)
Kumar, Kulamala Vinod (Unknown)



Article Info

Publish Date
01 Dec 2025

Abstract

Software fault prediction is essential for ensuring the reliability and quality of software systems by identifying potential defects early in the development lifecycle. However, the presence of imbalanced datasets poses a significant challenge to the effectiveness of fault prediction models. In this paper, we investigate the impact of different data balancing techniques, including generative adversarial networks (GANs), synthetic minority over-sampling technique (SMOTE), and NearMiss, on machine learning (ML) model performance for software fault prediction. Through a comparative analysis across multiple datasets commonly used in software engineering research, we evaluate the efficacy of these techniques in addressing class imbalance and improving predictive accuracy. Our findings provide insights into the most effective approaches for handling imbalanced data in software fault prediction tasks, thereby advancing the state-of-the-art in software engineering research and practice. An extensive experimentation is performed and analyzed in this study here that includes 8 datasets, 4 data balancing techniques, and 4 ML techniques in order to demonstrate the efficacy of various models in software fault prediction.

Copyrights © 2025






Journal Info

Abbrev

IJAI

Publisher

Subject

Computer Science & IT Engineering

Description

IAES International Journal of Artificial Intelligence (IJ-AI) publishes articles in the field of artificial intelligence (AI). The scope covers all artificial intelligence area and its application in the following topics: neural networks; fuzzy logic; simulated biological evolution algorithms (like ...