Makara Journal of Technology
Vol. 14, No. 1

Ontology-Based Automatic Classification for News Articles in Indonesian Language

Basnur, Prajna Wira (Unknown)
Sensuse, Dana Indra (Unknown)



Article Info

Publish Date
01 Apr 2010

Abstract

Ontology-Based Automatic Classification for News Articles in Indonesian Language. Searching specific information will be difficult if relying only on query. Choosing less specific queries will result in a lot of irrelevant information fetched by the system. One of the most successful ways to overcome this problem is to perform document classification based on the topic. There are many methods that can be used to conduct such a classification, such as using statistical and machine learning approaches. However, those document classification methods require training the data or learning the documents. In this study, the authors attempted to classify documents using a method that doesn’t require learning the documents. This classification method uses ontology to classify documents. Document classification using ontology is done by comparing the value of similarity among documents and existing node in the ontology. A document is classified into a category or a node if it has the highest similarity value in one of the nodes in the ontology. The results show that ontology can be used to perform document classification. The recall value is 97.03%, the precision is 91.63%, and the f-measure is 94.02%.

Copyrights © 2010






Journal Info

Abbrev

publication:mjt

Publisher

Subject

Chemical Engineering, Chemistry & Bioengineering Civil Engineering, Building, Construction & Architecture Electrical & Electronics Engineering Engineering Materials Science & Nanotechnology Mechanical Engineering

Description

MAKARA Journal of Technology is a peer-reviewed multidisciplinary journal committed to the advancement of scholarly knowledge and research findings of the several branches of Engineering and Technology. The Journal publishes new results, original articles, reviews, and research notes whose content ...