Claim Missing Document
Check
Articles

Comparison of the Fuzzy Time Series Chen Model and the Heuristic Model in Forecasting the Number of International Tourists in West Sumatra Rizki Akbar; Fitri, Fadhilah; Vionanda, Dodi; Mukhti, Tessy Octavia
Mathematical Journal of Modelling and Forecasting Vol. 2 No. 1 (2024): June 2024
Publisher : Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/mjmf.v2i1.20

Abstract

The Fuzzy Time Series Chen and Heuristic are two forecasting methods based on fuzzy logic used to predict values in time series. The FTS Chen and Heuristic models have almost identical forecasting processes, but the main difference lies in how they develop fuzzy logical relationships. The FTS Chen model uses Fuzzy Logical Relationship Groups obtained from the results of Fuzzy Logical Relationships for the forecasting process. On the other hand, the FTS Heuristic model uses Fuzzy Logical Relationships directly in the forecasting process. Fuzzy Logical Relationships are a collection of fuzzy logical relationships used to connect values in time series. By using Fuzzy Logical Relationships, the Heuristic model can predict values in time series more accurately and effectively. The forecasting is done to plan the development of tourism infrastructure, determine service needs, and optimize tourism promotion. The data shows that the number of foreign tourists visiting West Sumatra has continued to grow from 2006 to 2023. The comparison of the accuracy of the forecasting results of FTS Chen and Heuristic models for foreign tourists in West Sumatra yielded a MAPE of 0.241% for FTS model Chen and 0.194% for FTS model Heuristic. This indicates that the best forecasting model for foreign tourists is the Heuristic model due to its lower MAPE value.
Analisis Sentimen Program MSIB pada Aplikasi X (Twitter) Menggunakan Algoritma Naïve Bayes Husni, Nabila; Dodi Vionanda; Nur Leli; Syafriandi Syafriandi
UNP Journal of Statistics and Data Science Vol. 3 No. 2 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss2/361

Abstract

Certified Internships and Independent Studies (MSIB) is one of the programs of the Independent Learning-Independent Campus (MBKM) curriculum as a policy of the Kemendikbudristek. A government policy, especially in terms of education, will of course give rise to stigmas or feedback from the public regarding the policy. This research aims to find out public opinion regarding the MSIB program in the X (Twitter) application by sentiment analysis using the Naive Bayes Classifier algorithm. From this analysis, it was found that 84.6% of reviews had positive sentiments, while 16.4% of reviews had negative sentiments. Evaluation using the Naïve Bayes Classifier model shows that this model succeeded in classifying 85% of all data correctly, showing quite good performance in classifying the sentiment of these reviews.
Penerapan Algoritma Extreme Gradient Boosting dengan ADASYN untuk Klasifikasi Rumah Tangga Penerima Program Keluarga Harapan di Provinsi Sumatera Barat Susrifalah, Amelia; Vionanda, Dodi; Kurniawati, Yenni; Sulistiowati, Dwi
UNP Journal of Statistics and Data Science Vol. 3 No. 2 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss2/369

Abstract

Program Keluarga Harapan (PKH) is a form of social protection provided by the government to overcome poverty in Indonesia. However, challenges remain in accurately predicting eligible households. Therefore, a data-based classification method is needed to identify PKH recipients based on their factors. This research was conducted in West Sumatra Province using variables from the Data Terpadu Kesejahteraan Sosial (DTKS) variable group contained in SUSENAS 2024. Based on data from Badan Pusat Statistik (BPS) of West Sumatera Province, there are 1.790 PKH recipient households and 9.810 non-recipient households, indicating a class imbalance. Considering the large amount of data and complex variables, PKH can be analyzed using the Extreme Gradient Boosting (XGBoost) algorithm because of its ability to handle large-scale data and produce high classification performance. To address data imbalance, Adaptive Synthetic (ADASYN) was applied before analysis. The application of XGBoost with the scale_pos_weight parameter shows low classification performance, with sensitivity value of 12.3% and balanced accuracy of 55.2%. To overcome this, unbalanced data was handled using the ADASYN method. The application of XGBoost after data balancing with ADASYN showed significant performance improvement, with sensitivity value 80.4% and balanced accuracy 88.1%. In classifying PKH recipient households, the variables that make an important contribution are the age of the head of household, floor area, diploma of the head of household, floor material and number of household Members. This research shows that the combination of XGBoost and ADASYN is effective in overcoming data imbalance and improving PKH recipient classification performance.
Penerapan Partial Least Squares dan Pendekatan Robust dalam Analisis Diskriminan untuk Data Berdimensi Tinggi Rahmadina Adityana; Vionanda, Dodi; Permana, Dony; Fitri, Fadhilah
UNP Journal of Statistics and Data Science Vol. 3 No. 3 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss3/396

Abstract

Classical discriminant analysis, namely linear discriminant analysis and quadratic discriminant analysis, is generally known to suffer from singularity problems when exprerienced with high-dimensional data and is not robust to outliers that make the data not multivariate normally distributed. This research focuses on investigating the classification performance of discriminant analysis on high-dimensional data by applying two approaches, namely the Partial Least Square (PLS) dimension reduction approach as a solution to high-dimensional data and a robust approach with the Minimum Covariance Determinant (MCD) estimator technique that is robust to outliers. The data used for this study is Lee Silverman Voice Treatment (LSVT) data. PLS forms five optimal latent variables that represent predictor variable information. Based on the assumption test of covariance homogeneity between groups, the test statistic value is greater than the chi-square table or the p-value is smaller than the significance level, which means that the assumption is unfulfilled, so quadratic discriminant analysis is applied. The evaluation results showed that the quadratic discriminant analysis analysis model with the MCD approach on the PLS transformed data was able to achieve 81% accuracy, 71% precision, 86% recall, and 77% F1-score. These values indicate that both approaches are able to maintain the efficiency of discriminant analysis classification performance on high-dimensional and multivariate non-normally distributed data.
PayPal Usage in Indonesia with k-Nearest Neighbor Algorithm Amannia zeze; Muhammad Ravi Azzaki; Dodi Vionanda
UNP Journal of Statistics and Data Science Vol. 3 No. 3 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss3/405

Abstract

The development of information and digital technology has had a significant impact on the financial sector. In Indonesia, digital payment technologies such as PayPal, Gopay, Shopeepay, OVO, and DANA have become an integral part of the modern payment system. Since the implementation of the national electronic clearing system, RTGS, and ATMs in 2005, transactions have become increasinglyconvenient. This study analyzes user sentiment toward PayPal in Indonesia to understand user experience and provide insights for service development, marketing strategies, and brand reputation management. Review data from the PayPal app was collected from Google Plat via web scrapping and processed to yield 597 clean data points. Initial sentiment was categorized into positive, neutral, and negative, wordcloud visualization displayed positive and negative sentiment, while neutral sentiment was analyzed numerically. Automatic labeling was performed using the NLTK library based on rating values, above 3 positive, below 3 negative, and exactly 3 neutral. The results showed 146 positive reviews, 451 negative reviews, and a few neutral reviews. Sentiment classification using the K-Nearest Neighbor (K-NN) method yielded adequate accuracy, indicating that PayPal's acceptance in Indonesia is largely influenced by users' negative experiences. These findings provide a foundation for developing strategies to improve service quality and update PayPal's operational policies in the Indonesian market.
Factors Affecting Households Program Keluarga Harapan Recipients in West Sumatra: Binary Logistic Regression Analysis Ardhi, Sonia; Dodi Vionanda; Yenni Kurniawati; Tessy Octavia Mukhti
UNP Journal of Statistics and Data Science Vol. 3 No. 3 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss3/406

Abstract

Poverty is still a complex issues in Indonesia. Poverty rate in West Sumatra province has increased over the past 3 years. One of the government's initiatives to address poverty is the Program Keluarga Harapan (PKH), which is a social protection program that provides conditional cash transfers to poor and vulnerable Keluarga Penerima Manfaat (KPM) on condition that they are registered in the Data Terpadu Kesejahteraan Sosial (DTKS). Although PKH has a positive impact on poverty alleviation and enhanced access to health, education, and social welfare, the implementation still faces major challenges such as data inaccuracies, particularly in targeting accuracy. Therefore, an analysis is needed to determine the factors that significantly affects PKH recipient households in West Sumatra Province. This research used variables from the DTKS variable group contained in SUSENAS 2024 using two stages one phase stratified sampling method with 11,600 observations consisting of 1,790 receiving PKH and 9,810 not receiving PKH. The dependent variable is PKH recipient status (Yes = 1, no = 0). Data were analyzed using binary logistic regression with a significance level of 5%. Based on the results of the analysis, it can be concluded that floor area of ​​the house, age of the household head, household size, education level of the household head, and floor material of the house have a significantly effect on PKH recipient households. Household size has the most influence on PKH receipt with a 40,3% probability of receiving PKH.
Nagari Tanjung Balik Menuju Digitalisasi Data Syafriandi, Syafriandi; Amalita, Nonong; Vionanda, Dodi; Fitria, Dina; Zilrahmi, Zilrahmi; Yarman, Yarman
Suluah Bendang: Jurnal Ilmiah Pengabdian Kepada Masyarakat Vol 22, No 3 (2022): Suluah Bendang: Jurnal Ilmiah Pengabdian kepada Masyarakat
Publisher : Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/sb.03280

Abstract

Kegiatan pembangunan hendaknya dilaksanakan dengan menggunakan perencanaan yang tepat. Perencanaan ini memerlukan informasi yang diperoleh dengan adanya data.  Nagari Tanjung Balik Kecamatan X Koto Singkarak merupakan salah satu nagari yang termasuk dalam Kecamatan X Koto Singakarak, Kabupaten Solok. Untuk perencanaan pembangunan, nagari ini memerlukan adanya informasi dan data. Namun, nagari ini tidak memiliki akses ke data sektoral yang terhimpun di BPS. Di sisi lain, nagari ini juga dihadapkan pada keterbatasan sumber daya yang memiliki pengetahuan tentang Statistika. Oleh karena itu, tim pengabdi melaksanakan Kegiatan Pengabdian kepada Masyarakat di Nagari Tanjung Balik untuk membantu mengatasi kedua masalah di atas.  Dari kegiatan pengabdian ini, pemerintah Nagari Tanjung Balik memiliki database yang terbaru, akurat, dan mudah diakses yang bisa digunakan untuk mengetahui informasi yang detail tentang masyarakat nagari ataupun untuk memetakan potensi dan masalah di nagari. Begitu pula, dari kegiatan ini,  pemerintah nagari telah memiliki kader yang bisa melakukan pengumpulan data di waktu yang akan datang dengan menggunakan aplikasi RSN dan mengelola database yang telah dibangun.
Do Prestigious Schools Still Exist in Padang? An Exploratory Study on State Junior High School Admission 2025 in Padang Vionanda, Dodi; Wood, Raihan Attaya; Susrifalah, Amelia
Rangkiang Mathematics Journal Vol. 4 No. 2 (2025): Rangkiang Mathematics Journal
Publisher : Department of Mathematics, Universitas Negeri Padang (UNP)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/rmj.v4i2.106

Abstract

In this study, we perform an exploratory study of New Student Admission datasets for public Junior High School in Padang in 2025. We utilized tables, barplots, and boxplots to present information contained in datasets and we carried out cluster analysis using HDBSCAN algorithm. For this study we made use of admitted students’ datasets for each admission pathway of all state Junior High Schools in Padang in 2025. We carried out this study to investigate the emergence of prestigious schools among public Junior High School in Padang amid the implementation on zoning system. Our study reveals the presence of group of prestigious schools along with group of schools that admitted students mostly live nearby the schools. Hence, it is recommended for Padang Municipal government to improve the quality of schools that are not considered as prestigious schools since there are many schools that admitted students mostly live nearby the school.
Handling Unbalanced Data with SMOTE Algorithm for Unemployment Classification in Lima Puluh Kota Regency Using CART Method Aldwi Riandhoko; Amalita, Nonong; Vionanda, Dodi; Salma, Admi
Indonesian Journal of Statistics and Applications Vol 8 No 2 (2024)
Publisher : Statistics and Data Science Program Study, IPB University, IPB University, in collaboration with the Forum Pendidikan Tinggi Statistika Indonesia (FORSTAT) and the Ikatan Statistisi Indonesia (ISI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v8i2p166-177

Abstract

Unemployment is a problem that occurs in the labor force, where high unemployment is caused by the low ability of the labor force. A region that is still experiencing unemployment problems in West Sumatera is Lima Puluh Kota Regency. Unemployment in Lima Puluh Kota Regency is caused by the low competence of human resources to fulfill employment market requirements. Based on the results of the Sakernas survey in August 2023, Lima Puluh Kota Regency has more employed labor force than unemployed labor force, so this results in unbalanced data. A method that can overcome unbalanced data is Synthetic Minority Oversampling Technique (SMOTE). SMOTE is a technique with addition of synthetic data in minority class so that the proportion is balanced. Data imbalance conditions need to be handled so as to improve the performance of the classification model. Classification and Regression Trees (CART) is a classification technique with a decision tree method that can obtain the characteristics of a classification. The purpose of this research is to compare the CART model before and after applying SMOTE which can be measured by comparing the highest Area Under Curve (AUC) value. The AUC value in the CART method before SMOTE applied has a value of 62.1% while the AUC value in the CART method after SMOTE applied has a value of 70.2%. Therefore, it can be concluded that the CART classification analysis after SMOTE applied is able to provide better performance compared to the CART classification analysis before SMOTE applied.
Implementation of Fuzzy C-Means Algorithm for Clustering Provinces in Indonesia Based on Micro and Small Industry Ratio in Village Areas Rahmanesta, Frandito; Martha, Zamahsary; Vionanda, Dodi; Zilrahmi, Zilrahmi
Indonesian Journal of Statistics and Applications Vol 8 No 2 (2024)
Publisher : Statistics and Data Science Program Study, IPB University, IPB University, in collaboration with the Forum Pendidikan Tinggi Statistika Indonesia (FORSTAT) and the Ikatan Statistisi Indonesia (ISI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v8i2p178-190

Abstract

Post-economic crisis, the micro and small industries contribute the most labor compared to other industries. Regional development sourced from small micro industries is a strategic force in developing a country because the development of small micro industries leads to realizing equitable welfare to reduce income inequality. Development in village areas is an important factor for regional development, reducing inequality between regions, and alleviating poverty. However, based on the 2018 PODES survey, there are regional imbalances in Indonesia in the small micro industry which is centralized on Java Island. Therefore, clustering and characteristics of the province were carried out based on the PODES survey of the small micro industry sector. This research uses the Fuzzy C-Means algorithm to cluster 34 provinces in Indonesia based on the ratio of small micro industries in village areas in 2021, to see how the development of small micro industries in village areas in each province in Indonesia. Fuzzy C-Means is one of the data clustering techniques that uses a fuzzy clustering model, where cluster formation is based on a membership degree value that varies between 0 and 1. The Fuzzy C-Means algorithm generates 4 clusters, cluster 1 and 2 represents provinces with high and very high micro and small industry development in village areas and cluster 3 and 4 represents provinces with medium and low micro and small industry development in village areas. The Fuzzy C-Means algorithm produces a good cluster structure with a silhouette coefficient value of 0,6406.