The formation of study groups at the junior high school level plays an important role in improving the quality of learning and promoting equality in student learning outcomes. However, the process of grouping students is still largely carried out manually based on teachers’ intuition, subjective observations, or attendance data, which may lead to mismatches in students’ abilities and hinder the optimal achievement of learning objectives within the school environment. This study aims to identify study groups based on students’ academic data at SMP Muhammadiyah 5 Samarinda. The data used include scores in science (exact) and non-science (non-exact) subjects, exam results, assignment scores, attendance records, and parents’ educational backgrounds. The research stages consist of data cleaning, feature engineering, standardization, the application of the DBSCAN algorithm, and evaluation using the Silhouette Score. The analysis results reveal three main clusters: cluster 0 with 89 students (medium achievement), cluster 1 with 50 students (high achievement), and cluster 2 with 5 students (low achievement). In addition, 14 students (8.9%) were identified as noise. The Silhouette Score value of 0.217 indicates that the cluster separation quality is relatively weak; however, DBSCAN successfully detected outliers that may not be identified by other algorithms. These findings suggest that, although the cluster quality is not yet optimal, the applied algorithm remains useful for exploring students’ learning patterns and can serve as a basis for more targeted learning interventions.
Copyrights © 2025