cover
Contact Name
Tessy Octavia Mukhti
Contact Email
tessyoctaviam@fmipa.unp.ac.id
Phone
+6282283838641
Journal Mail Official
tessyoctaviam@fmipa.unp.ac.id
Editorial Address
LPPM Universitas Negeri Padang, Jalan Prof. Dr. Hamka, Air Tawar Barat, Kota Padang, Sumatera Barat 25131
Location
Kota padang,
Sumatera barat
INDONESIA
UNP Journal of Statistics and Data Science
ISSN : -     EISSN : 2985475X     DOI : 10.24036/ujsds
UNP Journal of Statistics and Data Science is an open access journal (e-journal) launched in 2022 by Department of Statistics, Faculty of Science and Mathematics, Universitas Negeri Padang. UJSDS publishes scientific articles on various aspects related to Statistics, Data Science, and its application. Articles can be in the form of research results, case studies, or literature reviews. All papers were reviewed by peer reviewers consisting of experts and academicians across universities.
Articles 236 Documents
Classification of Stroke Desease Using the Learning Vector Quantization Algorithm Andriarmi Andriarmi; Chairina Wirdiastuti; Syafriandi Syafriandi
UNP Journal of Statistics and Data Science Vol. 4 No. 2 (2026): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol4-iss2/499

Abstract

Stroke is one of the leading causes of death and disability worldwide, thereby making early detection crucial for timely and appropriate medical treatment. In clinical practice, stroke diagnosis is generally carried out through medical examinations and patient history analysis, but this process is time-consuming and depends on the subjective judgment of medical personnel. Therefore, machine learning approaches can be utilized to support disease classification more quickly and objectively. This study aims to analyze the performance of the Learning Vector Quantization (LVQ) method in classifying stroke disease using a dataset obtained from Kaggle. The dataset used in this study is imbalanced;therefore, the SMOTE (Synthetic Minority Over-sampling Technique) method was applied to handle class imbalance. The research stages included data preprocessing, splitting data into training and testing sets, LVQ model training, parameter optimization using learning rate and maximum epoch, and model evaluation using accuracy and sensitivity. The results show that the LVQ model trained on the original dataset achieved an accuracy of 95,72%, but failed to detect stroke cases with a sensitivity of 0%. After applying SMOTE, the best model achived a stroke sensitivity of 90%, although the accuracy decreased to 49,49% due to the high number of false positives. These findings indicate that LVQ is highly sensitive to data distribution and model parameters, making its performance on this dataset less optimal for stroke classification and more suitable as an initial screening tool.
Fuzzy Time Series Singh Method for Forecasting Tourist Arrivals at Kinantan Wildlife and Cultural Park Bukittinggi Olivin Adelia Huqmi; Fadhilah Fitri; Tessy Octavia Mukhti
UNP Journal of Statistics and Data Science Vol. 4 No. 1 (2026): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol4-iss1/376

Abstract

Tourism is a key sector in regional development, contributing to economic growth, job creation, and cultural preservation. In Bukittinggi, West Sumatra, the Kinantan Wildlife and Cultural Park (TMSBK) is a major tourist destination, known for its historical and educational value. Tourist visits to TMSBK show fluctuating trends influenced by seasonal factors, socio-economic conditions, and national or global events. These dynamics make accurate forecasting essential for effective tourism planning and management. This study aims to forecast monthly tourist visits to TMSBK using the Fuzzy Time Series (FTS) Singh method, which is suitable for uncertain and fluctuating time series data. The research used historical visitor data from 2021 to 2024 obtained from the Central Bureau of Statistics. The forecasting process included defining the universe of discourse, forming class intervals, fuzzifying historical data, establishing fuzzy logical relationships (FLR), and generating forecasts. The accuracy of the forecasts was measured using Mean Absolute Percentage Error (MAPE), with a result of 19.8%, indicating good predictive performance. The results show that the FTS Singh method successfully follows the fluctuation pattern of actual visitor data. This method provides valuable insights for destination managers in planning operations, promotional efforts, and service improvements. Therefore, the FTS Singh method can be considered a reliable tool to support sustainable tourism development and decision-making in Bukittinggi.
Application of Fuzzy Time Series Cheng in Forecasting Bukittinggi's Consumer Price Index Afifah Nabilah; Fadhilah Fitri; Yenni Kurniawati
UNP Journal of Statistics and Data Science Vol. 4 No. 1 (2026): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol4-iss1/395

Abstract

The Consumer Price Index (CPI) is one of the main indicators used to measure inflation and assess the public’s purchasing power. Based on CPI monitoring in March 2025, Bukittinggi City recorded the highest year-on-year (y-o-y) inflation rate in West Sumatra at 0.50 percent, with a CPI of 106.99. This indicates significant price fluctuations, which require careful analysis and forecasting to support regional economic policymaking. This study aims to forecast the CPI of Bukittinggi City for April 2025 using the Fuzzy Time Series (FTS) Cheng method. The data used consists of monthly CPI values from January 2020 to March 2025, totaling 63 observations, obtained from the official website of Statistics Indonesia (BPS). The forecasting result using the FTS Cheng method for April 2025 shows a CPI value of 106.19. To evaluate the model's accuracy, the Mean Absolute Percentage Error (MAPE) and Mean Absolute Error (MAE) were employed, yielding values of 0.82% and 0.90%, respectively. These values fall into the “very good” category based on standard forecasting accuracy criteria. The FTS Cheng method was selected due to its ability to accommodate data fluctuations and provide weighted relationships between fuzzy intervals, thus enhancing forecasting accuracy in dynamic economic conditions. However, this study is limited to univariate data and does not compare the FTS Cheng method with other forecasting models. This research provides valuable insights for local governments in designing effective economic strategies based on reliable predictive models.
Comparison of Agglomerative Hierarchical Clustering Methods for Grouping Indonesian Provinces Based on Community Literacy Development Index Olga Afrilly Putri; Bunga Nafandra; Zamahsary Martha
UNP Journal of Statistics and Data Science Vol. 4 No. 1 (2026): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol4-iss1/470

Abstract

Community literacy development is one of the important indicators in improving the quality of human resources in Indonesia. This study aims to group provinces in Indonesia based on the Community Literacy Development Index by considering the equity of library services, the adequacy of library collections, and the level of community visits per day. The method used is agglomerative hierarchical cluster analysis. Before grouping, the data is standardized to overcome differences in units and scales between variables. The selection of the best cluster method is done using the cophenetic correlation coefficient, while the determination of the optimal number of clusters uses the silhouette method. The results of the analysis show that the Average Linkage method is the most optimal hierarchical cluster method with the best number of clusters being four clusters. Each cluster has different characteristics, reflecting variations in community literacy levels, service equity, collection adequacy, and library visit intensity between provinces. These findings indicate disparities in community literacy development between regions in Indonesia. Therefore, the results of this study are expected to serve as a basis for consideration in formulating more effective and targeted literacy and library development policies.
Spatial Autoregressive Model to Factors Poverty Gap Index in West Java, 2023 Rahmat Kurniawan; Figo Rahmatullah; Fauzan Gustiandra; Tessy Octavia Mukhti
UNP Journal of Statistics and Data Science Vol. 4 No. 1 (2026): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol4-iss1/471

Abstract

Spatial analysis is the analysis of data with spatial effects. The spatial autoregressive is used when the effect of the dependent variable at one location is influenced by the value of the dependent variable at nearby or neighboring locations. The spatial autoregressive model is more appropriate to model the factors influencing the poverty depth index in West Java in 2023. Based on the Spatial Autoregressive modeling, the variables that influence the Poverty Depth Index in West Java are Population Density, Open Unemployment Rate, and economic growth. The SAR modeling produces a higher coefficient of determination compared to the linear model, which is 68.88% with an AIC value of 18.6149.
Pengelompokan Provinsi di Indonesia Berdasarkan Indikator Pendidikan Berkualitas Tahun 2025 Menggunakan Metode Self-Organizing Maps Dinda Putri Adilla; Tessy Octavia Mukhti
UNP Journal of Statistics and Data Science Vol. 4 No. 1 (2026): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol4-iss1/472

Abstract

Access to quality education plays an essential role in improving people’s welfare and supporting sustainable development. As a fundamental component of social progress, quality education is not limited to academic attainment but also involves the development of skills, values, and character needed for meaningful participation in society. This study seeks to identify patterns and disparities in education quality among provinces by grouping regions based on multiple educational indicators. The indicators analyzed include Average Years of Schooling, Literacy Rate, Access to Information and Communication Technology, Gross Enrollment Rate, Net Enrollment Rate, and Teacher Qualifications. The data were examined using descriptive statistics, data visualization, and normalization, followed by clustering through the Self-Organizing Map (SOM) method as an unsupervised learning approach in data mining. Two clusters were formed to represent provinces with relatively higher and lower levels of educational quality. Cluster validity was assessed using internal validation measures, namely the Connectivity Index, Silhouette Index, and Dunn Index. The findings reveal that most basic education and literacy indicators show relatively favorable conditions; however, disparities remain evident in average years of schooling, ICT access, and participation in secondary and higher education. The clustering results indicate that 35 provinces fall into the group with relatively higher education quality, while 3 provinces are classified in the lower category. These results suggest that although the overall condition of education is relatively good, regional inequality in educational outcomes persists and requires targeted policy interventions to promote more balanced and inclusive development.