Garuda - Garba Rujukan Digital

Article Per Year (5 Year)

p-Index From 2021 - 2026

0.23

P-Index

This Author published in this journals

All Journal Journal of General Education and Humanities

Kamalatuzzahroh, Aliza

Unknown Affiliation

Author-ID : 9743427

Humanities Languange, Linguistic, Communication & Media Mathematics Other

Published : 1 Documents Claim Missing Document

Claim Missing Document

Articles

The Use of Artificial Intelligence in Assessing the IELTS Academic Writing Task Essays Kamalatuzzahroh, Aliza; Priyana, Joko
Journal of General Education and Humanities Vol. 5 No. 1 (2026): February
Publisher : MASI Mandiri Edukasi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.58421/gehu.v5i1.1189

This study investigates the accuracy of Artificial Intelligence (AI) in assessing the IELTS Academic Writing Task essays by comparing AI-generated and human examiner scores and feedback. Despite the increasing adoption of AI-based assessment tools, limited empirical evidence exists regarding their validity and reliability in high-stakes IELTS writing evaluation. Therefore, this study aims to determine whether significant differences exist between AI and human scoring and to examine the qualitative characteristics of the feedback provided. This research employed a mixed-method explanatory design involving ten participants who completed a computer-based IELTS prediction test. Their essays were independently evaluated by an AI scoring system and a human rater using IELTS band descriptors. Quantitative analysis using a paired-sample t-test measured differences in assigned scores, while qualitative content analysis examined patterns, depth, and focus of the feedback provided. The findings indicate a statistically significant difference between AI-generated and human-assigned scores (p = 0.022), with a mean difference of 0.4 points, suggesting that AI tended to assign higher scores. The feedback analysis reveals that AI primarily focuses on technical aspects such as grammar, vocabulary, and sentence structure, offering general improvement suggestions, whereas human feedback demonstrates greater depth and personalization. These results suggest that while AI enhances scoring efficiency, it cannot fully replace human evaluative judgment in complex academic writing assessment.

Co-Authors Joko Priyana

Title

Found 1 Documents
Search

Abstract

Title Search

Found 1 Documents Search

Abstract

Title

Found 1 Documents
Search