This Author published in this journals
All Journal Jurnal Infra
Tomy Widjaja
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Deteksi Plagiarisme pada Kode Bahasa Pemrograman Java menggunakan XGBoost Tomy Widjaja; Andre Gunawan; Liliana Liliana
Jurnal Infra Vol 10, No 2 (2022)
Publisher : Universitas Kristen Petra

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

With the ease of access to information and cloud server technology, it makes it easier for anyone to access the code data. Coupled with the industry 4.0 era, the number of informatics students is also increasing rapidly. This makes code plagiarism easier to do, especially in academic environment Manual checking of plagiarism is repetitive, difficult, and time-consuming task. Therefore, automation for high quality source code plagiarism detection is needed. The dataset used in this research was collected from “Dasar Pemrograman” class at Petra Christian University. After that the code will continue to tokenization preprocessing using java grammar stage. Then, the algorithm will calculate pairwise features using 3 main algorithms, namely levenshtein distance, greedy string tiling, and bigram which will produce 12 features and a collection of statistic features. Finally, the features will be used for the training and inference process on the XGBoost model. The test result shows that the proposed features have better performance metrics than previous research, it has f1-score of 99%. Implementation of preprocessing can also improve performance metrics on the features proposed in this study and in previous research.