Claim Missing Document
Check
Articles

Found 1 Documents
Search

Optimization of the K-Means Algorithm Using PCA Dimensionality Reduction for E-Commerce Customer Segmentation Bengi, Mahara; Atika, Syarifah; Gunawan, Chici Rizka; Gunawan, Chica Rizka
Journal Of Information System And Artificial Intelligence Vol. 7 No. 2 (2026): Vol.7 No. 2 (2026): Journal of Information System and Artificial Intelligence V
Publisher : Universitas Mercu Buana Yogyakarta

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26486/jisai.v7i2.306

Abstract

The rapid growth of the e-commerce industry in recent years has generated increasingly large and complex volumes of customer data. This data holds strategic potential to be analyzed in order to understand customer behavior patterns and to support data-driven decision-making. This study aims to identify customer segmentation through an unsupervised learning approach using Principal Component Analysis (PCA) and the K-Means algorithm. The dataset used in this research demonstrates good quality with no missing values, making it suitable for further analysis. Initial exploratory findings indicate that Total Spending, Number of Items Purchased, and Average Rating are the most significant variables in representing customer characteristics. The application of PCA successfully reduced data dimensionality while retaining 79.41% of the total variance, thus producing a more concise representation without compromising essential information. The clustering process using K-Means grouped customers into three clearly distinguishable clusters. The first cluster represents customers with high activity levels, the second cluster reflects customers with moderate activity, and the third cluster corresponds to customers with lower engagement intensity. Validation using the Elbow Method and Silhouette Score confirmed that k = 3 is the most optimal number of clusters. Cluster visualizations show strong separation between groups and consistent relationships among variables. This study demonstrates that the combination of PCA and K-Means is effective in producing informative and interpretable customer segmentation. These findings provide a foundation for subsequent analyses and support data-driven decision-making in e-commerce customer management.