Jurnal ULTIMATICS
Vol 9 No 1 (2017): Ultimatics: Jurnal Ilmu Teknik Informatika

Deteksi Komentar Spam Bahasa Indonesia Pada Instagram Menggunakan Naive Bayes

Antonius Rachmat C (Universitas Kristen Duta Wacana Yogyakarta)
Yuan Lukito (Universitas Kristen Duta Wacana Yogyakarta)



Article Info

Publish Date
26 Apr 2017

Abstract

Instagram is the most famous pictures and videos media sharing based on the web & mobile application. Instagram users can have picture posts that can be commented by their followers. Indonesian public figures such as actors, actresses, musicians use Instagram to promote their activities to their followers. Unfortunately, there are a lot of spam comments in Instagram that need special attention and have to be removed. This research grabs Instagram comments and builds the dataset from Indonesian public figures who have more than one million followers. By using preprocessing (tokenization, stop words removal, and stemming), TF-IDF weighting, and supervised learning, Naive Bayes method is used to detect spam comments in Indonesian. Naive Bayes produces 74,31% accuracy rate on unbalanced datasets and 77,25% accuracy rate on balanced datasets. This result shows that Naïve Bayes can be used to build an automatic Indonesian spam comments detector on Instagram with high accuracy rate. The novelty of this research is that Naive Bayes can be used to detect spam comment on our Indonesian Instagram comments dataset. Index Terms—Instagram, Naive Bayes, Indonesian spam comments, spam comments detection.

Copyrights © 2017






Journal Info

Abbrev

TI

Publisher

Subject

Computer Science & IT Control & Systems Engineering Electrical & Electronics Engineering Engineering

Description

Jurnal ULTIMATICS merupakan Jurnal Program Studi Teknik Informatika Universitas Multimedia Nusantara yang menyajikan artikel-artikel penelitian ilmiah dalam bidang analisis dan desain sistem, programming, algoritma, rekayasa perangkat lunak, serta isu-isu teoritis dan praktis yang terkini, mencakup ...