Claim Missing Document
Check
Articles

PENERAPAN TEXT MINING DAN FUZZY C-MEANS CLUSTERING UNTUK IDENTIFIKASI KELUHAN UTAMA PELANGGAN PDAM TIRTA MOEDAL KOTA SEMARANG Genisia Pramestiloka Aulia; Tatik Widiharih; Iut Tri Utami
Jurnal Gaussian Vol 12, No 1 (2023): Jurnal Gaussian
Publisher : Department of Statistics, Faculty of Science and Mathematics, Universitas Diponegoro

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.14710/j.gauss.12.1.126-135

Abstract

Customer complaints can be handled effectively by identifying the main complaints that cause customers to be dissatisfied. Many customer complaints cause difficulty for PDAM Tirta Moedal Semarang to identify problems, which are frequently the primary complaints of customers. Grouping complaints that have similarities using Fuzzy C-Means Clustering will make the identification of the main customer complaints easier. Fuzzy C-Means uses fuzzy models, allows data to be a member of all formed clusters with membership level between 0-1. Fuzzy C-Means Clustering can also introduce more flexible patterns and show results in more accurate cluster placement. Text mining is used to convert textual data into numerical data. Customer complaints received through all contacts in PDAM Tirta Moedal Semarang from October–December 2021 were used as data. The clustering process forms 6 clusters,with the number of clusters tried being 3, 4, 5, and 6, which are seen by the smallest Xie-Beni Index. The main complaints from PDAM Tirta Moedal Semarang customer that seen through Word cloud in each cluster are that the water stops running in clusters 1 and 6 and the pipes leak in clusters 4 and 5. Complaints in clusters 2 and 3 are complaints related to water meters and water flow.
ANALISIS VOLATILITAS BITCOIN MENGGUNAKAN MODEL ARCH DAN GARCH Dheanisa Widyanti; sudarno sudarno; Tatik widiharih
Jurnal Gaussian Vol 12, No 2 (2023): Jurnal Gaussian
Publisher : Department of Statistics, Faculty of Science and Mathematics, Universitas Diponegoro

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.14710/j.gauss.12.2.254-265

Abstract

The popularity of Bitcoin increased significantly in 2021. Bitcoin is considered to deliver high returns in a relatively short period, indicating that bitcoin has high volatility. Data with high volatility usually violates the Autoregresstive IntegratedinMovinginAverage (ARIMA)in homoscedasticity assumption. The Autoregressive Conditional Heteroscedasticity (ARCH) and General Autoregressive Conditional Heteroscedasticity (GARCH) model is often used to overcome the problem of heteroscedasticity in thelARIMA model. The ARCH and GARCH models canfbe used to model thefvolatilityfof data. This Research uses ARCH and GARCH models to overcome the heteroscedasticity problem caused by the high volatility of Bitcoin data for the period 30th June 2018 to 30th June 2022. The results of this study suggest that there might be a heteroscedasticity problem in Bitcoin data. The bestffiimodel for Bitcoin data ismiARIMA(1,0,[4])-GARCH(1,1) with an AIC value of -1,4263 at a 95% confidence level
Klasifikasi Menggunakan Algoritma K-Nearest Neighbor pada Imbalance Class Data dengan SMOTE. (Studi Kasus: Nasabah Bank Perkreditan Rakyat “X”) Salsabilla Rizka Ardhana; Tatik Widiharih; Bagus Arya Saputra
Indonesian Journal of Applied Statistics Vol 6, No 2 (2023)
Publisher : Universitas Sebelas Maret

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.13057/ijas.v6i2.79389

Abstract

Rural Banks (Bank Perkreditan Rakyat/BPR) provide financial services to micro-businesses and low repayment communities, especially in rural areas. The main activity of the bank is lending. Customer credit classification is expected to assist BPR in anticipating potential bad loans. K-Nearest Neighbor classify current and potential bad credit status based on customer data from BPR “X” in Central Java in October 2022. K-Nearest Neighbor is effective against a large amount of training data and works based on the nearest neighbor. There is an imbalance class data which causes the classification process to focus more on the majority class. Imbalance class data is handled using Synthetic Minority Oversampling Technique (SMOTE) as an oversampling approach. Classification with the addition of SMOTE can improve the evaluation of classification accuracy, especially G-mean. G-mean is the most comprehensive measurement in term of  accuracy, sensitivity and specificity in evaluating classification performance on imbalance class data. The results of this research were able to increase g-mean to 58.55% and sensitivity to 45.46% by implementing SMOTE. Based on the classification results, it is concluded that K-Nearest Neighbor with SMOTE at k = 19 and a proportion of training data to test data of 70:30 is a more appropriate classification model to use for customer credit status. Keywords: Credit Status; K-Nearest Neighbor; Imbalance Class Data; SMOTE