JuTISI (Jurnal Teknik Informatika dan Sistem Informasi)
Vol 8 No 3 (2022): JuTISI

Serverless Named Entity Recognition untuk Teks Instruksional Pertanian Kota

Trisna Gelar (Politeknik Negeri Bandung)
Aprianti Nanda (Politeknik Negeri Bandung)
Akhmad Bakhrun (Politeknik Negeri Bandung)



Article Info

Publish Date
21 Dec 2022

Abstract

The evolution of document documentation, classification, and information retrieval includes named entity recognition (NER). The implementation of NER in the agricultural domain, in particular instructional texts or transcriptions of tutorial videos, will make it easier for the general public to understand the specific concepts and terms of urban agricultural activities such as crop production processes and procedures, agricultural methods and tools, harvest cycles, and handling plant pests or diseases. Spacy is an NLP tool, has two methods of developing NER models, namely with Toc2Vec and Transformer. Both methods have advantages and disadvantages, namely different sizes, performance and prediction speeds according to needs. The NER model can be implemented into a Serverless application, using the Functional as Services (FaaS) and Backend as Services (BaaS) approaches. For the subtopic of cultivating fruit crops in agricultural instructional literature, three NER models have been built in this study. First, the IndoBERT-based model, the Toc2Vec-based model with efficiency optimization, and the Toc2Vec-based model with accuracy optimization. The most efficient toc2vec model, with a f1-score of 0.71, is followed by the effective toc2vec model, with a f1-score of 0.60. The COUNT, PERIOD, and VERIETAS entities are consistently predicted incorrectly by the Toc2Vec model, which is unable to forecast numeric entities well. In addition, the Toc2Vec Model's better efficiency optimization directly relates the size of the model to the speed of word prediction per second, and the model is simple to integrate into a FaaS- and BaaS-based Serverless. The capabilities of Serverless M have been successfully tested using the black box method.

Copyrights © 2022






Journal Info

Abbrev

jutisi

Publisher

Subject

Computer Science & IT

Description

Paper topics that can be included in JuTISI are as follows, but are not limited to: • Artificial Intelligence • Business Intelligence • Cloud & Grid Computing • Computer Networking & Security • Data Analytics • Datawarehouse & Datamining • Decision Support System • E-Systems (E-Gov, ...