IJCCS (Indonesian Journal of Computing and Cybernetics Systems)
Vol 16, No 4 (2022): October

Automatic Essay Scoring Using Data Augmentation in Bahasa Indonesia

Nur Fadilah (Master Program of Computer Science, FMIPA UGM, Yogyakarta)
Sigit Priyanta (Department of Computer Science and Electronics, FMIPA UGM, Yogyakarta)



Article Info

Publish Date
31 Oct 2022

Abstract

Essay is one of the assessments to find out the abilities of students in depth.  UKARA is an automatic essay scoring development that combines NLP and machine learning.  This study uses the datasets provided for the UKARA challenge which consists of 2 types, datasets A and B. The dataset provided is still small for the model creation  process so that it is one of the causes of the resulting model is not optimal. This research focuses on the process of adding or augmenting data using EDA (Easy Data Augmentation Techniques). There are four methods applied, namely Synonym Replacement (SR), Random Insertion (RI), Random Swab (RS), and Random Deletion (RD).  The data is used for model creation by using the BiLSTM method. Performa model evaluated using confusion matrix with nilai accyouracy, precision, recall dan f-measure.The results showed that the dataset A without augmentation using k-fold cross validation produced the highest accuracy value with a value of 85.07%. While the results in data B show EDA insert with k-fold cross validation of 72.78%.

Copyrights © 2022






Journal Info

Abbrev

ijccs

Publisher

Subject

Computer Science & IT Control & Systems Engineering

Description

Indonesian Journal of Computing and Cybernetics Systems (IJCCS), a two times annually provides a forum for the full range of scholarly study . IJCCS focuses on advanced computational intelligence, including the synergetic integration of neural networks, fuzzy logic and eveolutionary computation, so ...