Jurnal Informatika
Vol 12, No 2 (2025): October

SemetonBug: A Machine Learning Model for Automatic Bug Detection in Python Code Based on Syntactic Analysis

Imran, Bahtiar (Unknown)
Riadi, Selamet (Unknown)
Suryadi, Emi (Unknown)
Zulpahmi, M. (Unknown)
Zaeniah, Zaeniah (Unknown)
Wahyudi, Erfan (Unknown)



Article Info

Publish Date
28 Aug 2025

Abstract

Bug detection in Python programming is a crucial aspect of software development. This study develops an automated bug detection system using feature extraction based on Abstract Syntax Tree (AST) and a Random Forest Classifier model. The dataset consists of 100 manually classified bugged files and 100 non-bugged files. The model is trained using structural code features such as the number of functions, classes, variables, conditions, and exception handling. Evaluation results indicate an accuracy of 86.67%, with balanced precision and recall across both classes. Confusion matrix analysis identifies the presence of false positives and false negatives, albeit in relatively low numbers. The accuracy curve suggests a potential overfitting issue, as training accuracy is higher than testing accuracy. This study demonstrates that the combination of AST-based feature extraction and Random Forest can be an effective approach for automated bug detection, with potential improvements through model optimization and a larger dataset.

Copyrights © 2025






Journal Info

Abbrev

ji

Publisher

Subject

Computer Science & IT

Description

Jurnal Informatika respects all researchers Technology and Information field as a part spirit of disseminating science resulting and community service that provides download journal articles for free, both nationally and internationally. The editorial welcomes innovative manuscripts from Technology ...