Jurnal Informatika
Vol. 12 No. 2 (2025): October

SemetonBug: A Machine Learning Model for Automatic Bug Detection in Python Code Based on Syntactic Analysis

Bahtiar Imran (Universitas Teknologi Mataram)
Selamet Riadi (Universitas Teknologi Mataram)
Emi Suryadi (Universitas Teknologi Mataram)
M. Zulpahmi (Universitas Teknologi Mataram)
Zaeniah Zaeniah (Universitas Teknologi Mataram)
Erfan Wahyudi (Universitas Teknologi Mataram)



Article Info

Publish Date
01 Oct 2025

Abstract

Bug detection in Python programming is a crucial aspect of software development. This study develops an automated bug detection system using feature extraction based on Abstract Syntax Tree (AST) and a Random Forest Classifier model. The dataset consists of 100 manually classified bugged files and 100 non-bugged files. The model is trained using structural code features such as the number of functions, classes, variables, conditions, and exception handling. Evaluation results indicate an accuracy of 86.67%, with balanced precision and recall across both classes. Confusion matrix analysis identifies the presence of false positives and false negatives, albeit in relatively low numbers. The accuracy curve suggests a potential overfitting issue, as training accuracy is higher than testing accuracy. This study demonstrates that the combination of AST-based feature extraction and Random Forest can be an effective approach for automated bug detection, with potential improvements through model optimization and a larger dataset.

Copyrights © 2025






Journal Info

Abbrev

ji

Publisher

Subject

Computer Science & IT

Description

Jurnal Informatika first publication in 2014 (ISSN: e. 2528-2247 p. 2355-6579) is scientific journal research in Informatics Engineering, Informatics Management, and Information Systems, published by Universitas Bina Sarana Informatika which the articles were never published online or in print. The ...