Jurnal Transformatika
Vol. 18 No. 1 (2020): July 2020

Sensitivitas Sistem Pencarian Artikel Bahasa Indonesia Menggunakan Metode n-gram Dan Tanimoto Cosine

Supriadi, Candra (Unknown)
Purnomo, Hidriyanto Dwi (Unknown)
Sembiring, Irwan (Unknown)



Article Info

Publish Date
29 Jul 2020

Abstract

The human need for technology and the availability of adequate infrastructure is evidence that technology is now a part of basic human needs. The increasing number of journals and scientific papers, it must be more selective in selecting and sorting even though there are already many online service providers and journal portals. Research on search engines and plagiarism and recommendation systems has been carried out with various methods deemed appropriate to improve the performance of the system itself, this paper has the purpose of calculating the similarity between one article with another article by implementing n-gram and tanimoto cosine. The number of articles tested was forty-three titles and abstracts, tested fifty times with randomly selected keywords, by breaking down each title and abstract sentence into n characters (n = 2 to 8) including spaces and punctuation, then counted similarity with the query or keyword used for system testing. The test was conducted using several threshold variations from n = 2 to 8. After observing fifty times the threshold test of 0.15 has the highest accuracy at n = 4 at 0.92, the highest precision at n = 3 at 0.42 and the highest recall at the test n = 2 = 0.44 .

Copyrights © 2020






Journal Info

Abbrev

TRANSFORMATIKA

Publisher

Subject

Computer Science & IT

Description

Transformatika is a peer reviewed Journal in Indonesian and English published two issues per year (January and July). The aim of Transformatika is to publish high-quality articles of the latest developments in the field of Information Technology. We accept the article with the scope of Information ...