Lontar Komputer: Jurnal Ilmiah Teknologi Informasi
Vol 15 No 02 (2024): Vol. 15, No. 2 August 2024

The BERT Uncased and LSTM Multiclass Classification Model for Traffic Violation Text Classification

Komang Ayu Triana Indah (Politeknik Negeri Bali)
I Ketut Gede Darma Putra (nformation Technology Department Udayana University)
I Made Sudarma (Information Technology Department Udayana University)
Rukmi Sari Hartati (Electrical Engineering Department Udayana University)
Minho Jo (Department of Computer and Information Science, Korea University)



Article Info

Publish Date
31 Jan 2025

Abstract

The increasing amount of internet content makes it difficult for users to find information using the search function. This problem is overcome by classifying news based on its context to avoid material that has many interpretations. This research combines the Uncased model BiDirectional Encoder Representations from Transformer (BERT) with other models to create a text classification model. Long Short-Term Memory (LSTM) architecture trains a model to categorize news articles about traffic violations. Data was collected through the crawling method from the online media application API through unmodified and modified datasets. The BERT Uncased-LSTM model with the best hyperparameter combination scenario of batch size 16, learning rate 2e-5, and average pooling obtained Precision, Recall, and F1 values of 97.25%, 96.90%, and 98.10%, respectively. The research results show that the test value on the unmodified dataset is higher than on the modified dataset because the selection of words that have high information value in the modified dataset makes it difficult for the model to understand the context in text classification.

Copyrights © 2024






Journal Info

Abbrev

lontar

Publisher

Subject

Computer Science & IT

Description

Lontar Komputer [ISSN Print 2088-1541] [ISSN Online 2541-5832] is a journal that focuses on the theory, practice, and methodology of all aspects of technology in the field of computer science and engineering as well as productive and innovative ideas related to new technology and information ...