EMITTER International Journal of Engineering Technology
Vol 5 No 2 (2017)

Classification of Radical Web Content in Indonesia using Web Content Mining and k-Nearest Neighbor Algorithm

Muh Subhan (Electronics Engineering Polytechnic Institute of Surabaya)
Amang Sudarsono (Electronics Engineering Polytechnic Institute of Surabaya)
Ali Ridho Barakbah (Electronics Engineering Polytechnic Institute of Surabaya)



Article Info

Publish Date
13 Jan 2018

Abstract

Radical content in procedural meaning is content which have provoke the violence, spread the hatred and anti nationalism. Radical definition for each country is different, especially in Indonesia. Radical content is more identical with provocation issue, ethnic and religious hatred that is called SARA in Indonesian languange. SARA content is very difficult to detect due to the large number, unstructure system and many noise can be caused multiple interpretations. This problem can threat the unity and harmony of the religion. According to this condition, it is required a system that can distinguish the radical content or not. In this system, we propose text mining approach using DF threshold and Human Brain as the feature extraction. The system is divided into several steps, those are collecting data which is including at preprocessing part, text mining, selection features, classification for grouping the data with class label, simillarity calculation of data training, and visualization to the radical content or non radical content. The experimental result show that using combination from 10-cross validation and k-Nearest Neighbor (kNN) as the classification methods achieve 66.37% accuracy performance with 7 k value of kNN method[1].

Copyrights © 2017






Journal Info

Abbrev

EMITTER

Publisher

Subject

Computer Science & IT

Description

EMITTER International Journal of Engineering Technology is a BI-ANNUAL journal published by Politeknik Elektronika Negeri Surabaya (PENS). It aims to encourage initiatives, to share new ideas, and to publish high-quality articles in the field of engineering technology and available to everybody at ...