TELKOMNIKA (Telecommunication Computing Electronics and Control)
Vol 19, No 3: June 2021

Bigram feature extraction and conditional random fields model to improve text classification clinical trial document

Jasmir Jasmir (Universitas Sriwijaya)
Siti Nurmaini (Universitas Dinamika Bangsa)
Reza Firsandaya Malik (Universitas Sriwijaya)
Bambang Tutuko (Universitas Sriwijaya)



Article Info

Publish Date
01 Jun 2021

Abstract

In the field of health and medicine, there is a very important term known as clinical trials. Clinical trials are a type of activity that studies how the safest way to treat patients is. These clinical trials are usually written in unstructured free text which requires translation from a computer. The aim of this paper is to classify the texts of cancer clinical trial documents consisting of unstructured free texts taken from cancer clinical trial protocols. The proposed algorithm is conditional random Fields and bigram features. A new classification model from the cancer clinical trial document text is proposed to compete with other methods in terms of precision, recall, and f-1 score. The results of this study are better than the previous results, namely 88.07 precision, 88.05 recall and f-1 score 88.06.

Copyrights © 2021






Journal Info

Abbrev

TELKOMNIKA

Publisher

Subject

Computer Science & IT

Description

Submitted papers are evaluated by anonymous referees by single blind peer review for contribution, originality, relevance, and presentation. The Editor shall inform you of the results of the review as soon as possible, hopefully in 10 weeks. Please notice that because of the great number of ...