Jurnal Pendidikan Sains dan Komputer
Vol. 6 No. 01 (2026): Call for Papers, February 2026

Model Prediktif Kelulusan Mahasiswa Berbasis Machine Learning Menggunakan Pipeline Terintegrasi dan Hyperparameter Tuning: A Machine Learning-Based Student Graduation Prediction Model Using an Integrated Pipeline and Hyperparameter Tuning

Mundirin, Mundirin (Unknown)
Hedin, Deden (Unknown)
Idawati, Idawati (Unknown)
Latief, Ibrahim (Unknown)
Lili, Mohamad (Unknown)



Article Info

Publish Date
04 Feb 2026

Abstract

Delays in student completion are a critical issue in higher education because they impact academic efficiency, program accreditation, and graduate quality. This study aims to develop a machine-learning-based model for predicting student graduation using an integrated pipeline. This pipeline encompasses data processing, model building, and hyperparameter optimisation. The dataset was obtained from eight semesters of student academic data, totalling 146 credits. This dataset includes both numeric and categorical variables, such as GPA, number of credits passed per semester, study load, and student background characteristics. Preprocessing was performed using ColumnTransformer, which combined StandardScaler for numeric features and OneHotEncoder for categorical features. A classification model was developed using the Random Forest algorithm and optimised with GridSearchCV to identify the optimal hyperparameter settings. Model evaluation was performed using accuracy metrics, confusion matrices, and classification reports. The findings of this study indicate that the model achieves an accuracy of 81%, suggesting a strong ability to classify students as on-time or late graduates. Feature analysis shows that the average Grade Point Average (GPA), the number of Semester Credit Units completed each semester, and consistency in study load are the main factors influencing the timeliness of study completion. The implementation of an integrated channel has proven effective in maintaining preprocessing consistency and reducing the possibility of data leakage. The developed model can be implemented as an early warning system to support data-driven academic decision-making.

Copyrights © 2026






Journal Info

Abbrev

jpsk

Publisher

Subject

Agriculture, Biological Sciences & Forestry Humanities Biochemistry, Genetics & Molecular Biology Chemical Engineering, Chemistry & Bioengineering Chemistry Computer Science & IT Decision Sciences, Operations Research & Management Education Energy Immunology & microbiology Materials Science & Nanotechnology Mathematics Physics Other

Description

Jurnal Pendidikan Sains dan Komputer (JPSK) merupakan jurnal akses terbuka nasional yang meliputi hasil kajian ilmiah interdisipliner, orisinal dan diulas oleh mitra bestari yang kompeten di bidangnya. Lingkup jurnal ini meliputi pendidikan sains baik teori dan praktek dengan bidang ilmu pendidikan ...