Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : Journal of Software Engineering and Information System (SEIS)

OPTIMISASI ALGORITMA K-MEANS DENGAN METODE REDUKSI DIMENSI UNTUK PENGELOMPOKAN BIG DATA DALAM ARSITEKTUR CLOUD COMPUTING Putra, Bayu Anugerah; Mukhtar, Harun; Br Bangun, Elsi Titasari; Gusnanda, Alris; Maisyarah, Adila; Kurniawan, Muhammad Irgi; Pradipa, Raditya; Ali, Zurrahman Muhammad
Jurnal Rekayasa Perangkat Lunak dan Sistem Informasi Vol. 5 No. 1 (2025)
Publisher : Department of Information System Muhammadiyah University of Riau

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.37859/seis.v5i1.7616

Abstract

In the era of big data, data clustering becomes a major challenge due to the complexity and huge volume of data. The K-means algorithm is one of the clustering techniques that is often used due to its simplicity. However, K-means faces difficulties in handling high-dimensional and large-volume data. This study proposes an optimization of the K-means algorithm using the Principal Component Analysis (PCA) dimensionality reduction method to improve the efficiency and accuracy of big data clustering in cloud computing architecture. The KDD Cup 1999 dataset is used to test this method. The dataset undergoes pre-processing and dimensionality reduction using PCA, then K-means clustering is applied. The clustering results are evaluated using the Silhouette Score and Davies-Bouldin Index. The implementation is carried out in the Google Colab environment to utilize cloud computing resources. The results show that dimensionality reduction using PCA significantly reduces computational complexity and improves clustering quality. This method is effective in clustering big data, making it an efficient solution for data clustering in cloud computing architecture.