Journal of Progressive Information, Security, Computer and Embedded System
Vol. 4, No. 1 Maret (2026)

Perbandingan Efektivitas Back Translation dan Easy Data Augmentation pada Automatic Short Answer Scoring Bahasa Indonesia

Nur Fadilah (Universitas Negeri Makassar)
Khawaritzmi Abdallah Ahmad (Universitas Negeri Makassar)
Muh. Isbar Pratama (Universitas Negeri Makassar)



Article Info

Publish Date
01 Mar 2026

Abstract

Automatic Short Answer Scoring (AES) is a Natural Language Processing (NLP) application designed to automatically assess short-answer responses. One of the primary challenges in developing AES systems is the limited size and diversity of available datasets, which can adversely affect a model’s generalization capability. Previous studies have demonstrated that Easy Data Augmentation (EDA) based on IndoBERT-generated synonyms can improve model performance on the UKARA dataset; however, this approach remains limited because the augmentation process is performed at the word level. This study aims to compare the effectiveness of Back Translation and IndoBERT-based Synonym EDA for Indonesian AES systems using the UKARA dataset. To ensure a fair comparison, the dataset, preprocessing procedures, FastText-based text representation, BiLSTM architecture, and evaluation methods were kept consistent across experiments, allowing performance differences to be attributed solely to the augmentation techniques. The experiments were conducted using both Non-K-Fold Evaluation and 3-Fold Cross-Validation scenarios. The results indicate that Back Translation outperformed IndoBERT-based Synonym EDA in most experimental settings, achieving the highest accuracy of 89.00% on Dataset A. Furthermore, the findings suggest that the quality and semantic diversity of the generated data have a greater impact on model performance than merely increasing the amount of training data. Therefore, Back Translation can serve as an effective alternative for enhancing dataset quality and improving the performance of Indonesian AES systems. Keywords: Automatic Short Answer Scoring, Back Translation, Easy Data Augmentation, IndoBERT, BiLSTM, FastText.

Copyrights © 2026






Journal Info

Abbrev

PISCES

Publisher

Subject

Computer Science & IT Engineering

Description

Focus and Scope, PISCES scientific journal encompasses all aspects of the latest outstanding research and developments in the field of Computer science including: Artificial intelligence, Data science, Databases, Computer performance analysis, Computer security and cryptography, Computer networks, ...