Dary Ardiansyah Haryono
Fakultas Ilmu Komputer, Universitas Brawijaya

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Klasifikasi Ulasan Palsu menggunakan MDLText dengan Seleksi Fitur IG dan SMOTE Dary Ardiansyah Haryono; Fitra Abdurrachman Bachtiar
Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer Vol 4 No 5 (2020): Mei 2020
Publisher : Fakultas Ilmu Komputer (FILKOM), Universitas Brawijaya

Show Abstract | Download Original | Original Source | Check in Google Scholar

Abstract

Advancement of technology make buying product/service easier because the order can be made via online. Most people will consider reviews before buying product/service, so reviews greatly affect the number of product/service purchases. Therefore, fake review classification system was created using MDLText algorithm. In this research, comparing the accuracy, precision, recall and F-measure of 4 methods namely: MDLText algorithm, and MDLText algorithm with IG feature selection, MDLText algorithm with SMOTE, MDLText algorithm with SMOTE and IG feature selection. The data used are imbalanced data with a ratio of 1 fake review to 6 original reviews. Based on the test results, for the MDLText algorithm the results obtained from the change in the parameter α with a pretty good accuracy of 49,67% but with relatively low recall of 19,35%, precision 10,53%, and F-measure 13,64%. While the use of MDLText algorithm with IG feature selection is better when the threshold is 60% in terms of accuracy that is 77,48% but in terms of precision, recall and F-measure threshold 90% has a better value of 20,94%, 100% and 34,64%. In the use of MDLText algorithm with SMOTE, the most balanced composition values are at SMOTE 200% with an accuracy value of 60.93%, recall 9.68%, precision 8,82% and F-measure 9.23%. It can be concluded that the MDLText method is better with 200% SMOTE and 60% IG threshold feature selection in terms of accuracy, precision, recall and F-measure. For the k-fold validation testing the best result for accuracy, recall and F-measure at 2 fold with 81,55%, 12,5%, 7,69% and 9,52%.