JOURNAL OF INFORMATICS AND TELECOMMUNICATION ENGINEERING
Vol. 7 No. 1 (2023): Issues July 2023

Machine Learning Model for Language Classification: Bag-of-words and Multilayer Perceptron

Devi Hawana Lubis (Universitas Sumatera Utara)
Sawaluddin Sawaluddin (Unknown)
Ade Candra (Unknown)



Article Info

Publish Date
28 Jul 2023

Abstract

The availability of data today has become a great asset for research that is used for various purposes such as for machine learning. One of the basic machine learning methods for natural language processing is bag-of-words. The problem in this study is the difficulty in classifying texts because texts still have unstructured characteristics, so this study will apply a model to classify the language of texts. Texts will be placed in four categories, English, Indonesian, German and French. Research was conducted using Bag-of-words and Multilayer Perceptron to solve this supervised machine learning problem. The use of Bag-of-words to perform text representation for simple patterns, easy processing and good performance. On the other hand, a multilayer perceptron has the ability to study complex data patterns in the form of images, text or videos. This study will collect data using text mining techniques, namely crawling Twitter social media as many as 4000 data records. This study produces a model with an accuracy of 98 percent with a loss of 0.14 percent which shows good model performance in classifying languages based on text data.

Copyrights © 2023






Journal Info

Abbrev

jite

Publisher

Subject

Computer Science & IT Engineering

Description

JURNAL TEKNIK INFORMATIKA, JITE (Journal of Informatics and Telecommunication Engineering) is a journal that contains articles / publications and research results of scientific work related to the field of science of Informatics Engineering such as Software Engineering, Database, Data Mining, ...