IJCCS (Indonesian Journal of Computing and Cybernetics Systems)
Vol 13, No 3 (2019): July

Detection Of Spam Comments On Instagram Using Complementary Naïve Bayes

Nur Azizul Haqimi (Master Program of Computer Science and Electronics, FMIPA UGM, Yogyakarta)
Nur Rokhman (Department of Computer Science and Electronics, FMIPA UGM, Yogyakarta)
Sigit Priyanta (Department of Computer Science and Electronics, FMIPA UGM, Yogyakarta)



Article Info

Publish Date
31 Jul 2019

Abstract

Instagram (IG) is a web-based and mobile social media application where users can share photos or videos with available features. Upload photos or videos with captions that contain an explanation of the photo or video that can reap spam comments. Comments on spam containing comments that are not relevant to the caption and photos. The problem that arises when identifying spam is non-spam comments are more dominant than spam comments so that it leads to the problem of the imbalanced dataset. A balanced dataset can influence the performance of a classification method. This is the focus of research related to the implementation of the CNB method in dealing with imbalance datasets for the detection of Instagram spam comments. The study used TF-IDF weighting with Support Vector Machine (SVM) as a comparison classification. Based on the test results with 2500 training data and 100 test data on the imbalanced dataset (25% spam and 75% non-spam), the CNB accuracy was 92%, precision 86% and f-measure 93%. Whereas SVM produces 87% accuracy, 79% precision, 88% f-measure. In conclusion, the CNB method is more suitable for detecting spam comments in cases of imbalanced datasets.

Copyrights © 2019






Journal Info

Abbrev

ijccs

Publisher

Subject

Computer Science & IT Control & Systems Engineering

Description

Indonesian Journal of Computing and Cybernetics Systems (IJCCS), a two times annually provides a forum for the full range of scholarly study . IJCCS focuses on advanced computational intelligence, including the synergetic integration of neural networks, fuzzy logic and eveolutionary computation, so ...