Jurnal Teknik Informatika (JUTIF)
Vol. 3 No. 4 (2022): JUTIF Volume 3, Number 4, August 2022

JARO WINKLER ALGORITHM FOR MEASURING SIMILARITY ONLINE NEWS

Teguh Efriyanto (Program Studi Informatika, Fakultas Ilmu Komputer, Universitas Amikom Yogyakarta, Indonesia)
Mardhiya Hayaty (Program Studi Informatika, Fakultas Ilmu Komputer, Universitas Amikom Yogyakarta, Indonesia)



Article Info

Publish Date
20 Aug 2022

Abstract

Online news is a source of information for people; this impacts journalists as news writers who can find news information quickly and accurately every day. Journalists can plagiarise other journalists or take news material from other news media sites and use it to publish in the media without including the source. An algorithm is needed to measure the similarity of online news. This work proposed the Jaro Winkler algorithm, with the value obtained from the calculation normalised so that the value 0 means there is no resemblance, and one means it has the exact resemblance. The data used is 20 online news media sites in the Central Kalimantan area. The Scraping process utilised the Custome Search JSON API and used keywords to get the news on the same topic. The results of the calculation of news similarity with the Jaro Winkler algorithm obtained an average value of online news similarity of 74.49%, with 43 news data with severe plagiarism levels and 12 news data with moderate plagiarism levels. There are weaknesses in the Jaro Winkler algorithm in calculating the similarity value in the data obtained. Some undetected data should have a heavy plagiarism level but not severe and vice versa.

Copyrights © 2022






Journal Info

Abbrev

jurnal

Publisher

Subject

Computer Science & IT

Description

Jurnal Teknik Informatika (JUTIF) is an Indonesian national journal, publishes high-quality research papers in the broad field of Informatics, Information Systems and Computer Science, which encompasses software engineering, information system development, computer systems, computer network, ...