Garuda - Garba Rujukan Digital

Jurnal Profesi Keguruan

Vol. 11 No. 1 (2025)

Fitriani, Fitriani (Unknown)
Eko Rini, Puput Zuli (Unknown)

Publish Date
30 Jun 2025

This study analyzes AI-generated writing assessments' scoring reliability and feedback consistency using ChatGPT. Adopting a mixed-methods approach, 23 student descriptive texts were evaluated across three assessment rounds. Quantitative findings showed high scoring reliability, with an Intraclass Correlation Coefficient (ICC) of 0.93, indicating excellent consistency across repeated evaluations. Qualitative analysis revealed that ChatGPT consistently addressed five core writing criteria—content, organization, vocabulary, language use, and mechanics. However, the feedback varied in focus and detail across rounds, and the absence of reference to prior feedback limited its support for revision as a recursive process. The findings suggest that although ChatGPT demonstrates reliable scoring and generally stable feedback themes, it lacks the continuity to facilitate sustained writing development. To enhance its pedagogical value, AI-based feedback systems should be designed to build upon previous responses, thereby enabling more effective support for students' progressive improvement in writing.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Jurnal Profesi Keguruan

Website

Abbrev

jpk

Publisher

Universitas Negeri Semarang

Subject

Education

Description

Jurnal profesi keguruan this is a scientific publication that contains about the works in the world of education by teachers, lecturers, and society. Scientific work in this published journal is an article of research or innovative and progressive ideas in the field of formal education. Articles of ...

Article Info

Abstract

Can We Trust AI to Assess Writing? An Analysis of Scoring Reliability and Feedback Consistency

Article Info

Abstract