PIKSEL : Penelitian Ilmu Komputer Sistem Embedded and Logic
Vol. 12 No. 2 (2024): September 2024

Naive Bayes Algorithm and TF-IDF for Detecting Plagiarism in Journal Articles

Azzahrah, Ladysa (Unknown)
Lindawati, Lindawati (Unknown)
Sholihin, Sholihin (Unknown)



Article Info

Publish Date
30 Sep 2024

Abstract

This study examines the implementation of a combination of Naïve Bayes, TF-IDF, and cosine similarity algorithms in detecting plagiarism in journal articles. Technological advances have increased the risk of plagiarism, which poses a serious threat to the integrity of science. The purpose of this study is to explain in detail the implementation of the algorithm to detect plagiarism, as well as measure the effectiveness of its combination. The method used involves the development of a Python-based system that is implemented through a website. The dataset consists of one hundred abstracts of Indonesian-language journal articles on the Internet of Things (IoT) taken from Mendeley software. The plagiarism limit is set at a maximum threshold of 20%. Implementation is carried out through data preprocessing stages, extraction of text features using a combination of Naïve Bayes and TF-IDF, and measurement of similarity with cosine similarity. The results show that this combination of algorithms has proven to be effective in detecting plagiarism rates in journal article abstracts, providing high accuracy in measuring text similarity. The developed system is able to better extract text features through the combination of Naïve Bayes and TF-IDF, and accurately measure the similarity of text in various test scenarios. This research contributes to the development of fast and accurate plagiarism detection technology, especially in fields that require complex text analysis.

Copyrights © 2024






Journal Info

Abbrev

piksel

Publisher

Subject

Computer Science & IT Decision Sciences, Operations Research & Management

Description

Jurnal PIKSEL diterbitkan oleh Universitas Islam 45 Bekasi untuk mewadahi hasil penelitian di bidang komputer dan informatika. Jurnal ini pertama kali diterbitkan pada tahun 2013 dengan masa terbit 2 kali dalam setahun yaitu pada bulan Januari dan September. Mulai tahun 2014, Jurnal PIKSEL mengalami ...