Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer
Vol 2 No 10 (2018): Oktober 2018

Pengelompokan Artikel Berbahasa Indonesia Dengan Menggunakan Reduksi Fitur Information Gain Thresholding Dan K-Means

Novia Agusvina (Fakultas Ilmu Komputer, Universitas Brawijaya)
Indriati Indriati (Fakultas Ilmu Komputer, Universitas Brawijaya)
Nurudin Santoso (Fakultas Ilmu Komputer, Universitas Brawijaya)



Article Info

Publish Date
14 Feb 2018

Abstract

The increasing number of articles spread on the internet site, making it difficult for users to find the desired article. One of the online article service providers is Kompas.com. To face the competition among mass media industry, Kompas.com step is to provide features that facilitate the user, such as features related article recommendations. However, in its application Kompas.com is still less than the maximum so it remains inferior to other online mass media. In this study, researchers implemented a method of reducing the features of Information Gain Thresholding and K-Means to create a group of related articles. The purpose of this study is to improve the system related articles from Kompas.com. In implementing the use of java language. In the early stages of preprocessing to reduce the disturbance in the data, then the feature reduction is done to reduce the features used for faster process, then weighted as the basis for calculating the distance between documents, after finding the distance of the initial distance or centroid, grouping can be done. The results show that the clustering of articles using Information Gain Threshold and K-Means is good enough, has criteria of silhouette coefficient of 0.9595 and a purity measure of 0.75 with 3 clusters and 0.04 threshold limit, this conclude that it gives better purity compared to without feature reduction.

Copyrights © 2018






Journal Info

Abbrev

j-ptiik

Publisher

Subject

Computer Science & IT Control & Systems Engineering Education Electrical & Electronics Engineering Engineering

Description

Jurnal Pengembangan Teknlogi Informasi dan Ilmu Komputer (J-PTIIK) Universitas Brawijaya merupakan jurnal keilmuan dibidang komputer yang memuat tulisan ilmiah hasil dari penelitian mahasiswa-mahasiswa Fakultas Ilmu Komputer Universitas Brawijaya. Jurnal ini diharapkan dapat mengembangkan penelitian ...