Jurnal Varian
Vol 6 No 1 (2022)

Application of Soft-Clustering Analysis Using Expectation Maximization Algorithms on Gaussian Mixture Model

Andi Shahifah Muthahharah (Universitas Negeri Makassar, Indonesia)
Muhammad Arif Tiro (Universitas Negeri Makassar, Indonesia)
Aswi Aswi (Universitas Negeri Makassar, Indonesia)



Article Info

Publish Date
13 Nov 2022

Abstract

Research on soft-clustering has not been explored much compared to hard-clustering. Soft-clustering algorithms are important in solving complex clustering problems. One of the soft-clustering methods is the Gaussian Mixture Model (GMM). GMM is a clustering method to classify data points into different clusters based on the Gaussian distribution. This study aims to determine the number of clusters formed by using the GMM method. The data used in this study is synthetic data on water quality indicators obtained from the Kaggle website. The stages of the GMM method are: imputing the Not Available (NA) value (if there is an NA value), checking the data distribution, conducting a normality test, and standardizing the data. The next step is to estimate the parameters with the Expectation Maximization (EM) algorithm. The best number of clusters is based on the biggest value of the Bayesian Information Creation (BIC). The results showed that the best number of clusters from synthetic data on water quality indicators was 3 clusters. Cluster 1 consisted of 1110 observations with low-quality category, cluster 2 consisted of 499 observations with medium quality category, and cluster 3 consisted of 1667 observations with high-quality category or acceptable. The results of this study recommend that the GMM method can be grouped correctly when the variables used are generally normally distributed. This method can be applied to real data, both in which the variables are normally distributed or which have a mixture of Gaussian and non-Gaussian.

Copyrights © 2022






Journal Info

Abbrev

Varian

Publisher

Subject

Decision Sciences, Operations Research & Management Economics, Econometrics & Finance Mathematics Social Sciences Other

Description

Jurnal Varian adalah salah satu Jurnal Ilmiah yang terdapat di Universitas Bumigora. Jurnal ini bertujuan untuk memberikan wadah atau sarana publikasi bagi para dosen, peneliti dan praktisi baik di lingkungan internal maupun eksternal Universitas Bumigora Mataram. Jurnal ini terbit 2 (dua) kali ...