Garuda - Garba Rujukan Digital

Journal of Software Engineering and Information System (SEIS)

Vol. 5 No. 1 (2025)

Putra, Bayu Anugerah (Unknown)
Mukhtar, Harun (Unknown)
Br Bangun, Elsi Titasari (Unknown)
Gusnanda, Alris (Unknown)
Maisyarah, Adila (Unknown)
Kurniawan, Muhammad Irgi (Unknown)
Pradipa, Raditya (Unknown)
Ali, Zurrahman Muhammad (Unknown)

Publish Date
24 Jan 2025

In the era of big data, data clustering becomes a major challenge due to the complexity and huge volume of data. The K-means algorithm is one of the clustering techniques that is often used due to its simplicity. However, K-means faces difficulties in handling high-dimensional and large-volume data. This study proposes an optimization of the K-means algorithm using the Principal Component Analysis (PCA) dimensionality reduction method to improve the efficiency and accuracy of big data clustering in cloud computing architecture. The KDD Cup 1999 dataset is used to test this method. The dataset undergoes pre-processing and dimensionality reduction using PCA, then K-means clustering is applied. The clustering results are evaluated using the Silhouette Score and Davies-Bouldin Index. The implementation is carried out in the Google Colab environment to utilize cloud computing resources. The results show that dimensionality reduction using PCA significantly reduces computational complexity and improves clustering quality. This method is effective in clustering big data, making it an efficient solution for data clustering in cloud computing architecture.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Journal of Software Engineering and Information System (SEIS)

Website

Abbrev

SEIS

Publisher

Universitas Muhammadiyah Riau

Subject

Computer Science & IT Decision Sciences, Operations Research & Management Engineering

Description

Journal of Software Engineering and Information System (SEIS) is a peer-reviewed journal published twice a year (January and August) by the Department of Information System - Faculty of Computer Science, Universitas Muhammadiyah Riau. The scope of the journal is: Artificial Intelligent Business ...

Article Info

Abstract

OPTIMISASI ALGORITMA K-MEANS DENGAN METODE REDUKSI DIMENSI UNTUK PENGELOMPOKAN BIG DATA DALAM ARSITEKTUR CLOUD COMPUTING

Article Info

Abstract