Garuda - Garba Rujukan Digital

Article Per Year (5 Year)

p-Index From 2021 - 2026

0.23

P-Index

This Author published in this journals

All Journal Sistemasi: Jurnal Sistem Informasi

Waliyyudin, Waliyyudin

Unknown Affiliation

Author-ID : 8636336

Computer Science & IT Electrical & Electronics Engineering

Published : 1 Documents Claim Missing Document

Claim Missing Document

Articles

Application of Artificial Intelligence using K-Means for Programming Question Assessment Waliyyudin, Waliyyudin; Ibrahim, Ichsan
Sistemasi: Jurnal Sistem Informasi Vol 14, No 4 (2025): Sistemasi: Jurnal Sistem Informasi
Publisher : Program Studi Sistem Informasi Fakultas Teknik dan Ilmu Komputer

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.32520/stmsi.v14i4.5360

The manual assessment of programming assignments remains a significant challenge in educational settings due to its time-consuming nature and susceptibility to human error. Observational studies of course instructors reveal that over 40% have made grading mistakes, often due to fatigue or inconsistent evaluation standards. This study aims to develop an automated assessment system using artificial intelligence to enhance both objectivity and efficiency in the evaluation process. The method employed is the K-Means clustering algorithm, chosen for its ability to group answers based on similarities in logic and code structure rather than mere textual similarity. Five assessment categories were used as clustering standards: Logic and Algorithm, Data Structures, Object-Oriented Programming (OOP), Implementation, and Error Handling. The system was developed using an Agile Development approach and evaluated with student responses from programming courses. System performance was validated quantitatively by comparing cluster results against ground truth labels from manual grading. The system achieved 87% clustering accuracy, reduced the average grading time to 4.5 seconds per answer (compared to 13 seconds manually—representing a 65% efficiency gain), and decreased the inter-rater score standard deviation from 7.5 to 2.8 points. The results indicate that the system can deliver accurate real-time feedback. This study focused on programming questions ranging from easy to hard difficulty levels. In the future, the system could be enhanced by integrating advanced syntax analysis and expanding the evaluation criteria to support large-scale deployment.

Co-Authors Ibrahim, Ichsan

Title

Found 1 Documents
Search

Abstract

Title Search

Found 1 Documents Search

Abstract

Title

Found 1 Documents
Search