Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer
Vol 1 No 9 (2017): September 2017

Analisis Perbandingan Metode K-Means Dengan Improved Semi-Supervised K-Means Pada Data Indeks Pembangunan Manusia (IPM)

Gusti Ngurah Wisnu Paramartha (Fakultas Ilmu Komputer, Universitas Brawijaya)
Dian Eka Ratnawati (Fakultas Ilmu Komputer, Universitas Brawijaya)
Agus Wahyu Widodo (Fakultas Ilmu Komputer, Universitas Brawijaya)



Article Info

Publish Date
15 Jun 2017

Abstract

At this time with the growing amount of information, the concept of data mining getting known as an important tool in the management information. Refers to the concept of data mining, the most popular concept in data mining is a clustering technique. One well known clustering method is k-means traditional. But in its application, k-means method has some problems such as determining the value of K cluster and determining the initial cluster centers were done randomly making process was inconsistent and the results of the cluster becomes worse. Therefore, there is a method to overcome these problems are improved semi-supervised k-means clustering. With improved semi-supervised method that combines the supervised and unsupervised method, users only need to label a bit of data that has not been labeled, then the labeled data is used to find the optimal value of initial cluster center and K cluster that will optimizes the process and result of clustering process. On implementation, this research combine k-means algorithm and improved semi-supervised k-means to clustering human development index (HDI) data. HDI data chosen because it has the right characteristics for clustering such amounts of data and the data is divided into several clusters. On the testing improved semi-supervised k-means method giving out the average accuracy of 90.3%, better than k-means clustering that giving 73.7% accuracy. In the second testing, improved semi-supervised k-means method produces an average time for one convergent 1222.9959 seconds, better than k-means with 1504.75 seconds. The third testing, improved semi-supervised k-means generates an average number of iterations for one convergent more efficient than k-means with the number of iterations of 7.11 compared 9.72. Last, on the cluster quality testing using silhouette coefficient, improved semi-supervised k-means method giving average value 0.69880, better than the traditional k-means with an average value of 0.62734.

Copyrights © 2017






Journal Info

Abbrev

j-ptiik

Publisher

Subject

Computer Science & IT Control & Systems Engineering Education Electrical & Electronics Engineering Engineering

Description

Jurnal Pengembangan Teknlogi Informasi dan Ilmu Komputer (J-PTIIK) Universitas Brawijaya merupakan jurnal keilmuan dibidang komputer yang memuat tulisan ilmiah hasil dari penelitian mahasiswa-mahasiswa Fakultas Ilmu Komputer Universitas Brawijaya. Jurnal ini diharapkan dapat mengembangkan penelitian ...