Garuda - Garba Rujukan Digital

Article Per Year (5 Year)

p-Index From 2021 - 2026

0.23

P-Index

This Author published in this journals

All Journal Jurnal Accounting Information System (AIMS)

Atmadja, Aldi Rialdy

Unknown Affiliation

Author-ID : 9747662

Computer Science & IT Economics, Econometrics & Finance Education Other

Published : 1 Documents Claim Missing Document

Claim Missing Document

Articles

Classification of Delayed Students Graduation Risk : A Comparative Analysis of Naive Bayes, XGBoost, and Random Forest Fadillah, Khafka; Atmadja, Aldi Rialdy; Nur Lukman, Nur Lukman
Jurnal Accounting Information System (AIMS) Vol. 9 No. 1 (2026)
Publisher : Ma'soem University

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.32627/aims.v9i1.1843

One of the critical challenges affecting the effectiveness of higher education systems is delayed student graduation, which not only impacts institutional performance but also increases the financial and psychological burden on students. This study aims to classify the risk of delayed graduation by developing and evaluating machine learning models based on new student admission data. The dataset was obtained from the New Student Admission Center of UIN Sunan Gunung Djati Bandung and consists of students’ biodata, including socioeconomic characteristics and the educational background of students and their parents. The research was conducted following the CRISP-DM framework, encompassing business understanding, data understanding, data preparation, modeling, evaluation, and deployment planning. During the data preparation stage, preprocessing techniques such as data cleaning, encoding of categorical variables, and feature selection were applied to improve data quality. Three machine learning algorithms—Naïve Bayes, Random Forest, and XGBoost—were implemented and optimized using hyperparameter tuning to achieve optimal performance. Model evaluation was carried out using accuracy, precision, recall, F1-score, and ROC-AUC metrics to ensure a comprehensive comparison.The experimental results demonstrate that the Random Forest algorithm outperformed the other models, achieving an accuracy of 0.633, precision of 0.677, recall of 0.694, F1-score of 0.685, and ROC-AUC of 0.668. These findings indicate that machine learning models based on admission data are capable of providing a reasonably effective early prediction of delayed graduation risk. Nevertheless, the model performance can be further enhanced by incorporating academic performance variables during the study period. This study is expected to support higher education institutions in formulating data-driven strategies and early intervention programs for students with a high risk of delayed graduation.

Co-Authors Fadillah, Khafka Nur Lukman, Nur Lukman

Title

Found 1 Documents
Search

Abstract

Title Search

Found 1 Documents Search

Abstract

Title

Found 1 Documents
Search