TELKOMNIKA (Telecommunication Computing Electronics and Control)
Vol 18, No 4: August 2020

Semantics-based clustering approach for similar research area detection

Marion Oluwabunmi Adebiyi (Landmark University Omu Aran)
Emmanuel B. Adigun (Covenant University Ota)
Roseline Oluwaseun Ogundokun (Landmark University Omu Aran)
Abidemi Emmanuel Adeniyi (Landmark University Omu Aran)
Peace Ayegba (Landmark University Omu Aran)
Olufunke O. Oladipupo (Covenant University Ota)



Article Info

Publish Date
01 Aug 2020

Abstract

The manual process of searching out individuals in an already existing research field is cumbersome and time-consuming. Prominent and rookie researchers alike are predisposed to seek existing research publications in a research field of interest before coming up with a thesis. From extant literature, automated similar research area detection systems have been developed to solve this problem. However, most of them use keyword-matching techniques, which do not sufficiently capture the implicit semantics of keywords thereby leaving out some research articles. In this study, we propose the use of Ontology-based pre-processing, Latent Semantic Indexing and K-Means Clustering to develop a prototype similar research area detection system, that can be used to determine similar research domain publications. Our proposed system solves the challenge of high dimensionality and data sparsity faced by the traditional document clustering technique. Our system is evaluated with randomly selected publications from faculties in Nigerian universities and results show that the integration of ontologies in preprocessing provides more accurate clustering results.

Copyrights © 2020






Journal Info

Abbrev

TELKOMNIKA

Publisher

Subject

Computer Science & IT

Description

Submitted papers are evaluated by anonymous referees by single blind peer review for contribution, originality, relevance, and presentation. The Editor shall inform you of the results of the review as soon as possible, hopefully in 10 weeks. Please notice that because of the great number of ...