Putra, Anugrah Dwiatmaja
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Analysis of named-entity effect on text classification of traffic accident data using machine learning Putra, Anugrah Dwiatmaja; Girsang, Abba Suganda
Indonesian Journal of Electrical Engineering and Computer Science Vol 25, No 3: March 2022
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijeecs.v25.i3.pp1672-1678

Abstract

With the rising number of accidents in Indonesia, it is still necessary to evaluate and analyze accident data. The categorization of traffic accident data has been developed using word embedding, however additional work is needed to achieve better results. Several informative named entities are frequently sufficient to differentiate whether or not information on a traffic accident exists. Named-entities are informational characteristics that can offer details about a text. The influence of named-entities on thematic text categorization is examined in this paper. The information was collected using a Twitter social media crawl. Preprocessing is done at the beginning of the process to modify and delete useful text as well as label specified entities. On Support Vector Machine (SVM), scheme comparisons were performed for (i) Word Embedding, (ii) the number of occurrences of Named Entities, and (iii) the combination of the two is known as a Hybrid. The Hybrid scheme produced an improvement in classification accuracy of 90.27 percent when compared to Word Embedding scheme and occurrences of named entities scheme, according to tests conducted using 1.885 data consisting of 788 accident data and 1.067 non-accident data.