Journal of Electrical, Electronic, Information, and Communication Technology (JEEICT)
Vol 5, No 2 (2023): JOURNAL OF ELECTRICAL, ELECTRONIC, INFORMATION, AND COMMUNICATION TECHNOLOGY

Grammatical Error Correction (GEC) of Indonesian Text Based on Neural Machine Translation (NMT)

Nike Sartika (UIN Sunan Gunung Djati Bandung)
Yuda Sukmana (Institut Teknologi Bandung)



Article Info

Publish Date
30 Nov 2023

Abstract

Writing errors in Indonesian are often found in various writings made in educational, government and mass media environments. The most dominant error is in spelling. This research proposes a Grammatical Error Correction (GEC) for Indonesian using the Neural Machine Translation (NMT) method, namely seq2seq, which is popularly used for English and has achieved the best performance approaching human capabilities. The model developed is made into a web-based service that is easy for users to access. The datasets used in this experiment are artificial datasets sourced from several studies regarding error analysis in Indonesian. The research results show that with the help of currently available open-source tools such as OpenNMT-py, it is possible to simplify the training process of NMT-based GEC models. Unfortunately, the small number of datasets leads to poor predictions for random sentences.

Copyrights © 2023






Journal Info

Abbrev

jeeict

Publisher

Subject

Computer Science & IT Control & Systems Engineering Electrical & Electronics Engineering

Description

Journal of Electrical, Electronic, Information and Communication Technology (JEEICT) is a peer-reviewed open-access journal in English published twice a year by the Department of Electrical Engineering, Sebelas Maret University, Indonesia. The JEEICT aims to provide a leading-edge medium for ...