Claim Missing Document
Check
Articles

Found 40 Documents
Search

Fuzzy Time Series Singh Method for Forecasting Tourist Arrivals at Kinantan Wildlife and Cultural Park Bukittinggi Olivin Adelia Huqmi; Fadhilah Fitri; Tessy Octavia Mukhti
UNP Journal of Statistics and Data Science Vol. 4 No. 1 (2026): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol4-iss1/376

Abstract

Tourism is a key sector in regional development, contributing to economic growth, job creation, and cultural preservation. In Bukittinggi, West Sumatra, the Kinantan Wildlife and Cultural Park (TMSBK) is a major tourist destination, known for its historical and educational value. Tourist visits to TMSBK show fluctuating trends influenced by seasonal factors, socio-economic conditions, and national or global events. These dynamics make accurate forecasting essential for effective tourism planning and management. This study aims to forecast monthly tourist visits to TMSBK using the Fuzzy Time Series (FTS) Singh method, which is suitable for uncertain and fluctuating time series data. The research used historical visitor data from 2021 to 2024 obtained from the Central Bureau of Statistics. The forecasting process included defining the universe of discourse, forming class intervals, fuzzifying historical data, establishing fuzzy logical relationships (FLR), and generating forecasts. The accuracy of the forecasts was measured using Mean Absolute Percentage Error (MAPE), with a result of 19.8%, indicating good predictive performance. The results show that the FTS Singh method successfully follows the fluctuation pattern of actual visitor data. This method provides valuable insights for destination managers in planning operations, promotional efforts, and service improvements. Therefore, the FTS Singh method can be considered a reliable tool to support sustainable tourism development and decision-making in Bukittinggi.
Applying Robust Spatial Autoregressive Model to Analyze the Determinants of Open Unemployment in West Java Berliana Nofriadi; Suci Rahmadani; Sepniza Nasywa; Tessy Octavia Mukhti; Yenni Kurniawati
UNP Journal of Statistics and Data Science Vol. 3 No. 3 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss3/402

Abstract

Open unemployment is a critical macroeconomic challenge in developing regions like West Java, Indonesia, where spatial disparities and data anomalies complicate traditional analysis. This study addresses these limitations by employing a Robust Spatial Autoregressive (RSAR) model with M-Estimator, integrating spatial dependence and outlier resilience to enhance estimation accuracy. Using 2024 district-level data from Indonesia’s Central Bureau of Statistics (BPS) and Open Data Jabar, the research examines determinants such as labor force participation, education, and regional GDP. The methodology begins with Ordinary Least Squares (OLS) to identify initial predictors, followed by spatial diagnostics (Moran’s I, Lagrange Multiplier tests) to confirm spatial autocorrelation. A customized Queen contiguity weight matrix captures neighborhood effects, while robust M-Estimation mitigates outlier distortions. Results reveal that the RSAR model achieves superior explanatory power (R² = 0.8626) compared to OLS and standard Spatial Autoregressive (SAR) models, with labor force participation (X₄) emerging as a significant negative predictor of unemployment. Spatial effects (ρ = 0.337) though modest, underscore the importance of inter-regional dynamics. The study concludes that RSAR offers a more reliable framework for regional labor analysis, combining spatial rigor with robustness against data irregularities. Policy-wise, the findings advocate targeted interventions to boost labor participation and address localized disparities, emphasizing the need for spatially informed, outlier-resistant methodologies in economic planning.
Factors Affecting Households Program Keluarga Harapan Recipients in West Sumatra: Binary Logistic Regression Analysis Sonia Ardhi; Dodi Vionanda; Yenni Kurniawati; Tessy Octavia Mukhti
UNP Journal of Statistics and Data Science Vol. 3 No. 3 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss3/406

Abstract

Poverty is still a complex issues in Indonesia. Poverty rate in West Sumatra province has increased over the past 3 years. One of the government's initiatives to address poverty is the Program Keluarga Harapan (PKH), which is a social protection program that provides conditional cash transfers to poor and vulnerable Keluarga Penerima Manfaat (KPM) on condition that they are registered in the Data Terpadu Kesejahteraan Sosial (DTKS). Although PKH has a positive impact on poverty alleviation and enhanced access to health, education, and social welfare, the implementation still faces major challenges such as data inaccuracies, particularly in targeting accuracy. Therefore, an analysis is needed to determine the factors that significantly affects PKH recipient households in West Sumatra Province. This research used variables from the DTKS variable group contained in SUSENAS 2024 using two stages one phase stratified sampling method with 11,600 observations consisting of 1,790 receiving PKH and 9,810 not receiving PKH. The dependent variable is PKH recipient status (Yes = 1, no = 0). Data were analyzed using binary logistic regression with a significance level of 5%. Based on the results of the analysis, it can be concluded that floor area of ​​the house, age of the household head, household size, education level of the household head, and floor material of the house have a significantly effect on PKH recipient households. Household size has the most influence on PKH receipt with a 40,3% probability of receiving PKH.
Stratified Cox Regression Approach to Identifying Prognostic Factors for Survival in Breast Cancer Patients Dhio Ervandi; Aisyah Novriani; Andini Diva Luthfiyah; Fauzan Al Hamdani Siregar; Tessy Octavia Mukhti
UNP Journal of Statistics and Data Science Vol. 3 No. 4 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss4/418

Abstract

The most common type of cancer that affects women is Breast cancer. In 2022, 2.3 million women were diagnosed with breast cancer, and 670,000 deaths were recorded globally. By 2040, it is estimated that breast cancer will increase by 40%, reaching 3 million annually with the number of deaths increasing by 50% to 1 million in 2020. This highlights breast cancer as a serious threat to world health. This study utilized secondary data from METABRIC or the Molecular Taxonomy of Breast Cancer International Consortium obtained from the website www.kaggle.com/datasets/raghadalharbi/breast-cancer-gene-expression-profiles-metabric/data. The independent variables analyzed were, Age at Diagnosis (X­­1), Surgery Type (X­­2), Chemotherapy (X­­3), Hormone Therapy (X­­4), Tumor Size (X­­5), Radio Therapy (X­­6), Pam50. The dependent variables were Survival Time (Overall Survival Month) and Patient Status. In this study, we used the Stratified Cox model to predict the predictor variables of survival time. The total number of patients used was 18886, with 1080 censored patients and 788 uncensored patients. The Stratified Cox model without interaction revealed that the patients who underwent breast-conserving surgery had a 1.35 times higher risk of death compared to those who underwent mastectomy. Patients who received chemotherapy had a 2.01 times higher risk of death than those who did not, while patients who did hormone therapy had a 1.83 times higher risk of death than those who did not undergo this therapy.
An Examination of Determinants Affecting the Survival Duration Pediatric Brain Cancer Patients Through Stratified Cox Regression Analysis Fauzan Al-Hamdani Siregar; Andini Diva Luthfiyah; Tessy Octavia Mukhti; Dony Permana
UNP Journal of Statistics and Data Science Vol. 3 No. 4 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss4/420

Abstract

Brain cancer is the second most common pediatric malignancy and the leading cause of cancer-related mortality in children. Pediatric brain tumors (PBTs) represent around 25% of all pediatric cancers and consist of clinically and biologically diverse subtypes, with an estimated incidence of 0.3–2.9 cases per 100,000 children annually. The high prevalence emphasizes the importance of identifying factors that influence patient survival. This study aims to identify and analyze the factors that significantly affect the survival duration of pediatric brain cancer patients by applying the Stratified Cox regression model. This study utilized secondary data from the Pediatric Brain Cancer database (www.cbioportal.org). Independent variables included cancer type, ethnicity, other medical conditions, sex, tumor type, and treatment type, while the dependent variables were survival time (OS Months) and patient status (OS Status). Data were analyzed using the Stratified Cox regression method. A total of 203 patients were observed, consisting of 39 uncensored cases (19.21%) and 164 censored cases (80.79%). The majority of patients were male (58.62%), diagnosed with low-grade glioma/astrocytoma (43.35%), classified as non-Hispanic or Latino (93.52%), had no additional medical conditions (51.72%), received new treatment (85.22%), and were categorized with primary tumor type (74.38%). Results from the stratified Cox model indicated that cancer type was a significant predictor of survival. Children with embryonal tumors were found to have 8.9 times greater risk of experiencing an event compared to those with CNS cancer types, whereas children with high-grade glioma/astrocytoma had a 24.85 times higher risk compared to the CNS cancer group.
Peramalan Konsentrasi PM2.5 di Kota Medan Menggunakan Metode ARIMAX dengan Faktor Meteorologi sebagai Variabel Eksogen Fauzan Arrahman; Tessy Octavia Mukhti; Dony Permana; Fenni Kurnia Mutiya
UNP Journal of Statistics and Data Science Vol. 3 No. 4 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss4/429

Abstract

Particulate Matter 2.5 (PM2.5) is a fine particle measuring less than 2.5 micrometers which is dangerous for human health because it can penetrate the respiratory system and cause cardiovascular disorders. High PM2.5 concentrations reflect a decline in air quality, so forecasting efforts are needed to support pollution control and environmental policies. This study aims to forecast daily PM2.5 concentrations in Medan City using the Autoregressive Integrated Moving Average with Exogenous Variables (ARIMAX) method by considering meteorological factors as exogenous variables. The data used consist of PM2.5 concentrations and average temperature, humidity, rainfall, and wind speed data for the period from June 1, 2024 to June 10, 2025. The analysis results show that the best model is ARIMAX (4,1,0) with exogenous variables of average temperature and rainfall, where temperature has a positive effect and rainfall has a negative effect on PM2.5. This model meets the assumptions of white noise and residual normality, with a MAPE value of 20.635%, indicating a fairly good level of forecasting accuracy. The forecasting results show PM2.5 concentrations in the range of 19–26 µg/m³ with a downward trend at the end of June 2025, indicating improved air quality in Medan City. Thus, the ARIMAX method with meteorological factors is considered effective in modeling and forecasting PM2.5 dynamics in urban areas.
Forecasting Smallholder Oil Palm Yield in Riau Province through the SARIMA Approach Septrina Kiki Arisandi; Dony Permana; Tessy Octavia Mukhti
UNP Journal of Statistics and Data Science Vol. 4 No. 1 (2026): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol4-iss1/436

Abstract

Oil palm stands as one of Indonesia’s major agricultural sectors that plays a vital role in regional economic growth, particularly within Riau Province. However, its production often fluctuates due to seasonal and environmental factors, making accurate forecasting essential for planning and policy formulation. This study aims to forecast smallholder oil palm production in Riau Province through the Seasonal Autoregressive Integrated Moving Average (SARIMA) Approach. The data consist of monthly oil palm production from January 2006 to December 2023 obtained from the Central Bureau of Statistics (BPS) of Riau Province. The modeling process includes identifying the model structure, estimating parameters, performing diagnostic checks, and evaluating forecasting accuracy using the Mean Absolute Percentage Error (MAPE). The best model selected was SARIMA (2,0,0)(0,1,1)[12] with an AIC value of 4980.12 and a MAPE of 11.27%, indicating a good level of accuracy. The model effectively captured both seasonal and long-term trend patterns in production. The forecast results suggest that peak production typically occurs in August–September, while the lowest occurs in February–March. The study concludes that the SARIMA model provides a robust statistical framework for predicting oil palm production and can be applied as a decision-support tool in agricultural and economic planning for the province
Comparison of K-Means and Ward Methods in Clustering Indonesian Provinces Based on Household Basic Service Access Nurul Mulya; Fajri Juli Rahman Nur Zendrato; Muhammad Arief Rivano; Zamahsary Martha; Tessy Octavia Mukhti
UNP Journal of Statistics and Data Science Vol. 4 No. 1 (2026): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol4-iss1/449

Abstract

Disparities in household basic service access across provinces in Indonesia remain a key issue in regional development. Basic services such as access to improved drinking water, proper sanitation, electricity, and adequate housing are essential indicators of household welfare, making regional classification necessary to identify similarities and disparities among provinces. This study aims to cluster Indonesian provinces based on household basic service access indicators and to compare the performance of the K-Means method and Hierarchical Clustering using the Ward approach. The analysis was conducted using numerical data with Euclidean distance as a measure of similarity. The optimal number of clusters was determined using the Silhouette plot and further validated using the Silhouette Coefficient. The results indicate that both K-Means and Ward methods produce two optimal clusters representing provinces with relatively high and relatively low levels of household basic service access. Centroid analysis reveals clear differences between clusters across all indicators, particularly in electricity access and sanitation. Furthermore, the evaluation of clustering quality shows that the Ward method yields a higher Silhouette Coefficient than the K-Means method, indicating more compact clusters and better separation between clusters. Therefore, the Ward method is considered more effective in mapping patterns of household basic service access across provinces. The findings of this study can support regional planning by providing a clearer understanding of disparities in household basic service access in Indonesia.
Spatial Autoregressive Model to Factors Poverty Gap Index in West Java, 2023 Rahmat Kurniawan; Figo Rahmatullah; Fauzan Gustiandra; Tessy Octavia Mukhti
UNP Journal of Statistics and Data Science Vol. 4 No. 1 (2026): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol4-iss1/471

Abstract

Spatial analysis is the analysis of data with spatial effects. The spatial autoregressive is used when the effect of the dependent variable at one location is influenced by the value of the dependent variable at nearby or neighboring locations. The spatial autoregressive model is more appropriate to model the factors influencing the poverty depth index in West Java in 2023. Based on the Spatial Autoregressive modeling, the variables that influence the Poverty Depth Index in West Java are Population Density, Open Unemployment Rate, and economic growth. The SAR modeling produces a higher coefficient of determination compared to the linear model, which is 68.88% with an AIC value of 18.6149.
Pengelompokan Provinsi di Indonesia Berdasarkan Indikator Pendidikan Berkualitas Tahun 2025 Menggunakan Metode Self-Organizing Maps Dinda Putri Adilla; Tessy Octavia Mukhti
UNP Journal of Statistics and Data Science Vol. 4 No. 1 (2026): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol4-iss1/472

Abstract

Access to quality education plays an essential role in improving people’s welfare and supporting sustainable development. As a fundamental component of social progress, quality education is not limited to academic attainment but also involves the development of skills, values, and character needed for meaningful participation in society. This study seeks to identify patterns and disparities in education quality among provinces by grouping regions based on multiple educational indicators. The indicators analyzed include Average Years of Schooling, Literacy Rate, Access to Information and Communication Technology, Gross Enrollment Rate, Net Enrollment Rate, and Teacher Qualifications. The data were examined using descriptive statistics, data visualization, and normalization, followed by clustering through the Self-Organizing Map (SOM) method as an unsupervised learning approach in data mining. Two clusters were formed to represent provinces with relatively higher and lower levels of educational quality. Cluster validity was assessed using internal validation measures, namely the Connectivity Index, Silhouette Index, and Dunn Index. The findings reveal that most basic education and literacy indicators show relatively favorable conditions; however, disparities remain evident in average years of schooling, ICT access, and participation in secondary and higher education. The clustering results indicate that 35 provinces fall into the group with relatively higher education quality, while 3 provinces are classified in the lower category. These results suggest that although the overall condition of education is relatively good, regional inequality in educational outcomes persists and requires targeted policy interventions to promote more balanced and inclusive development.