Journal of Development Research
Vol. 5 No. 1 (2021): Volume 5, Number 1, May 2021

Implementation of Weighted Tree Similarity and Cosine Sorensen-Dice Algorithms for Semantic Search in Document Repository Information System

Abdurrosyiid Amrullah (Universitas Muhammadiyah Gresik)
Indra Gita Anugrah (Universitas Muhammadiyah Gresik)



Article Info

Publish Date
31 May 2021

Abstract

Document search has several approaches, including full-text search, plain metadata search and semantic search. This study uses the Weighted Tree Similarity algorithm with the Cosine Sorensen Dice algorithm to calculate the semantic search similarity. In this study, document metadata is represented in the form of a tree that has labeled nodes, labeled branches and weighted branches. The similarity calculation on the subtree edge label uses Cosine Sorensen Dice, while the total similarity of a document uses the weighted tree similarity. The metadata structure of the document uses the taxonomy owner, description, title, disposition content and type. The result of this research is a document search application with taxonomic weight on file storage.

Copyrights © 2021






Journal Info

Abbrev

JDR

Publisher

Subject

Civil Engineering, Building, Construction & Architecture Education Social Sciences

Description

Journal of Development Research, a journal runs by the Institute for Research and Community Services Universitas Nahdlatul Ulama Blitar. Journal facilitates publication of the results of research in the field of human and region development. The journal is published in two versions, the print and ...