Jurnal Profesi Keguruan
Vol. 11 No. 1 (2025)

Can We Trust AI to Assess Writing? An Analysis of Scoring Reliability and Feedback Consistency

Fitriani, Fitriani (Unknown)
Eko Rini, Puput Zuli (Unknown)



Article Info

Publish Date
30 Jun 2025

Abstract

This study analyzes AI-generated writing assessments' scoring reliability and feedback consistency using ChatGPT. Adopting a mixed-methods approach, 23 student descriptive texts were evaluated across three assessment rounds. Quantitative findings showed high scoring reliability, with an Intraclass Correlation Coefficient (ICC) of 0.93, indicating excellent consistency across repeated evaluations. Qualitative analysis revealed that ChatGPT consistently addressed five core writing criteria—content, organization, vocabulary, language use, and mechanics. However, the feedback varied in focus and detail across rounds, and the absence of reference to prior feedback limited its support for revision as a recursive process. The findings suggest that although ChatGPT demonstrates reliable scoring and generally stable feedback themes, it lacks the continuity to facilitate sustained writing development. To enhance its pedagogical value, AI-based feedback systems should be designed to build upon previous responses, thereby enabling more effective support for students' progressive improvement in writing.

Copyrights © 2025






Journal Info

Abbrev

jpk

Publisher

Subject

Education

Description

Jurnal profesi keguruan this is a scientific publication that contains about the works in the world of education by teachers, lecturers, and society. Scientific work in this published journal is an article of research or innovative and progressive ideas in the field of formal education. Articles of ...