Claim Missing Document
Check
Articles

Pemodelan Tingkat Partisipasi Angkatan Kerja Terhadap Persentase Penduduk Miskin di Jawa Timur Tahun 2023 Menggunakan Metode B-Spline Ibnul farizi, Gilang; Zilrahmi; Dony Permana; Admi Salma
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/215

Abstract

Poverty is a common issue in Indonesia. Data on the Percentage of Poor Population against the Labor Force Participation Rate (LFPR) per district/city, consisting of 38 districts/cities in East Java Province in 2023, indicates that the highest percentage of poverty in East Java Province in 2023 was 21,760. Employment is considered the most effective solution to alleviate poverty. The data in this study shows a distribution pattern that does not form a specific pattern, making it difficult to analyze using parametric methods. Therefore, the appropriate approach is Nonparametric Regression. In this study, the nonparametric regression used is the B-Spline regression model. The suitability of the model is based on the Mean Squared Error (MSE) value of the model. The analysis results indicate that the B-Spline regression model achieves an MSE value of 20.11447. The optimal MSE value is obtained from B-Spline estimation with order 2. This suggests that the B-Spline method provides a good explanation in addressing the issue
Optimization of Sentiment Analysis for MBKM Program using Naïve Bayes with Particle Swarm Optimization Diva Aliyah; Zilrahmi; Yenni Kurniawati; Dina Fitria
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/220

Abstract

In early 2020, Kemendikbudristek launched the MBKM program with the aim of improving the quality of higher education through a student-focused learning approach. The launch of this program triggered various reactions on social media, especially on Twitter, both positive and negative. This study aims to analyze the sentiment of Twitter users towards the MBKM program using the Naive Bayes algorithm optimized with Particle Swarm Optimization (PSO). The data used are Indonesian tweets containing the keywords "MBKM" and "Merdeka Campus" from the period July to December 2022. The research stages include data collection through crawling, manual labeling of data into positive and negative sentiments, data preprocessing, application of the Naive Bayes algorithm, and feature selection with PSO. The results showed that the group of tweets categorized based on positive and negative sentiments towards the implementation of the MBKM program in Indonesia in 2022, showed that the NB-PSO experiment achieved an accuracy of 90.87%, an increase of 7.12% compared to the Naive Bayes algorithm alone. Thus, the use of Particle Swarm Optimization algorithm in Naive Bayes classification algorithm is proven to improve classification performance, especially in the case of sentiment analysis. Keywords: Sentiment Analysis, Merdeka Belajar Kampus Merdeka, Twitter, Naive Bayes, Particle Swarm Optimization.
PT.Telkom (Tbk) Stock Price Forecasting Using Long Short Term Memory (LSTM) nazhiroh, hanifah; Dina Fitria; Dony Permana; Zilrahmi
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/223

Abstract

The movement of the share price of PT Telkom (Tbk) fluctuates so it is necessary to do a forecasting analysis. Forecasting the share price of PT Telkom (Tbk) can be done using the Long Short Term Memory (LSTM) method. LSTM is a development of the Recurrent Neural Network (RNN) method. In this study using PT.Telkom (Tbk) stock price data for 2018-2023 and PT.Telkom (Tbk) stock price data after Covid-19 (20121-2023). The purpose of this research is to determine the movement of PT.Telkom (Tbk) stock prices in 2024, to find out the difference in forecasting using PT.Telkom (Tbk) 2018-2023 stock price data with PT.Telkom (Tbk) stock price data after covid-19 2021-2023, and to determine the level of accuracy of forecasting PT.Telkom (Tbk) stock prices using the LSTM method. The results showed that both data have a small MAPE value. to forecast the share price of PT.Telkom for 1 year, PT.Telkom (Tbk) share price data for 2018-2023 is used which has more data to analyze long-term forecasting. From the analysis results obtained MAPE of 1.016% with the optimal parameter combination of neuron 4, batch size 64, and epoch 80. The results of forecasting the share price of PT.telkom (Tbk) in 2024 experienced very rapid fluctuations with an average share price of PT.Telkom (Tbk) in 2024 Rp 4,668 / sheet.
Analysis of Factors Influencing the Number of Families at Risk of Stunting in Merangin Regency Using Mixed Geographically Weighted Regression Fadlan Rafly, Muhammad; Zilrahmi; Dony Permana; Dina Fitria
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/236

Abstract

The number of families at risk of stunting is among the significant concerns that have been a negative impact on developing superior human resources in Merangin Regency. The number of families at risk of stunting is sought to be solved by identifying the contributing components. MGWR is among the methods that may be employed to obtain a specific model that affects each obesrvasion location locally and a comprehensive model that is global. Multiple linear regression and GWR are used to create models MGWR used when data has the influence of spatial heterogeneity. This project aims to develop an MGWR model which will be used to calculate the amount families at risk of stunting in each sub-district in Merangin Regency who are at risk of stunting in 2022. A fixed gaussian kernel weighting matrix is used in MGWR modeling. At the very least CV of 0.6152241, A fixed gaussian kernel is utilized as the weighting function. The results indicate that the model obtained has an accuracy rate of 99.18%, which means that the predictor variables can explain the model by that percentage. Families with insufficient access to drinking water is one factor that significantly affects how many families are at risk of stunting, families with inadequate sanitation, maternal age less than 20 years and families with babies under five years old.
Perbandingan Analisis Diskriminan Kuadratik dengan Analisis Diskriminan Kuadratik Robust martha, Ully Martha; Dodi Vionanda; Dony Permana; Zilrahmi
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/315

Abstract

This study compared the performance of quadratic discrimination analysis and robust quadratic discrimination analysis using the Iris dataset from Kaggle. The robust quadratic discriminant analysis, designed to handle outliers and non-normal distributions, shows better performance with an Apparent Error Rate (APER) of 2.5%. In contrast, the quadratic discriminant analysis, used for data with multivariate normal distribution and different variance-covariance matrices among groups, yields an APER of 3.03%. These results indicate that robust quadratic discriminant analysis is more accurate in classification on this dataset compared to quadratic discriminant analysis. Keywords: Apparent Error Rate, Quadratic Discrimination Analysis, Robust Quadratic Discrimination Analysis
Sentiment Analysis of The Constitutional Court Decision Regarding Changes to The Age Limit for Presidentian and Vice Presidential Candidates Using Support Vector Machine Amanda, Abilya; Nonong Amalita; Dodi Vionanda; Zilrahmi
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/321

Abstract

The Constitutional Court (MK) as a judicial institution granted a judicial review on October 16, 2023 related to the Election Law Article 169 (q) Law No.7 of 2017 number 90/PUU-XXI/2023. The Constitutional Court approved the material test, leading to changes in the age limit for presidential and vice presidential candidates. This change caused controversy because it was considered to benefit one of the candidate pairs. This research aims to see the trend of public opinion towards policy changes by the government. This research uses the Support Vector Machine (SVM) method which divides the data into two classification classes. The application of linear, Radial Bias Function (RBF), and polynomial kernels resulted in the highest accuracy of 84%. The calculation of accuracy, precision, and recall is 84%, 22%, and 90%, respectively. Based on the resulting wordcloud, Positive words indicate backing for presidential and vice presidential candidates. Meanwhile, negative sentiments express disapproval of the Constitutional Court's decision concerning the changes to the age limit requirements for presidential and vice presidential candidates.
Application of Principal Component Analysis in Identifying Factors Affecting the Human Development Index Faisal, Muhammad; Fitri, Fadhilah; Zilrahmi
Mathematical Journal of Modelling and Forecasting Vol. 2 No. 2 (2024): December 2024
Publisher : Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/mjmf.v2i2.26

Abstract

This study examines the Human Development Index (HDI) in West Sumatra Province in 2023. The HDI is an essential indicator for measuring the success of efforts to improve the quality of human life. This research aims to identify the key factors that influence the HDI. The HDI is constructed from three fundamental dimensions that indicate human quality of life: health, education, and economy. The factors within each dimension tend to be strongly correlated, as they mutually influence one another, potentially leading to multicollinearity issues. Therefore, an analysis is conducted to reduce the number of original variables into new orthogonal variables while preserving the total variance of the original variables using Principal Component Analysis (PCA). Based on this background, the study applies PCA to address multicollinearity and to identify new, more representative variables. The study findings indicate that the factors influencing the HDI are the education and economic and health welfare indexes.
Analysis of The Effect of Unemployment, Economic Growth and Inflation on Poverty in West Sumatra Province Ulya Syafitri.J; Zilrahmi; Admi Salma
UNP Journal of Statistics and Data Science Vol. 3 No. 1 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss1/329

Abstract

Poverty remains a major challenge in West Sumatra, although various efforts have been made to improve community welfare. In this context, it is important to understand the factors that influence poverty levels. Unemployment, economic growth and inflation are several important variables that can have a significant effect on poverty levels. Unemployment is one of the problems that is often associated with poverty. On the other hand, strong economic growth has the potential to reduce poverty levels by creating new job opportunities and increasing people's incomes. However, non-inclusive economic growth can increase social inequality and uneven income distribution, which in the end can worsen poverty. Apart from that, inflation can also affect poverty levels by reducing people's purchasing power, especially those with low incomes. This research aims to analyze the effect of unemployment, economic growth and inflation on poverty levels. The multiple linear regression analysis method is used to test the relationship between the independent variables (unemployment, economic growth and inflation) and the dependent variable (poverty). Based on the research findings, it can be concluded that unemployment, economic growth and inflation contribute to poverty in West Sumatra at 49,35% and the remainder 50,65% is explained by other factors outside the model.The analysis indicates a significant linear influence on unemployment and economic growth on poverty in West Sumatra and there is no significant linear impact of inflation  on poverty in West Sumatra.
STUDY ON EMD METHOD FOR PREDICTING THE PRICE OF CURLY RED CHILI IN INDONESIA Zilrahmi Zilrahmi; Hari Wijayanto; Farit M Afendi; Rizal Bakri
Indonesian Journal of Statistics and Applications Vol 4 No 2 (2020)
Publisher : Departemen Statistika, IPB University dengan Forum Perguruan Tinggi Statistika (FORSTAT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v4i2.600

Abstract

The fluctuations of curly red chili price affect the inflation rate in Indonesia. So that, the basic characteristics of price movement and correctly prediction for curly red chili price become concern in various studies. Empirical Mode Decomposition (EMD) method helps to examine behavioral characteristics of curly red chili prices in Indonesia easily. Ensemble EMD (EEMD) and modified EEMD are the decomposition method of time series which is development of EMD method. The decomposed data with EMD methods can also used for price forecast. The forecasting with ARIMA and trend polynomial performed to assess the effect of decomposition with EMD methods for forecast stability of curly red chili price in Indonesia under various conditions. The results show the most influence factor for price fluctuation of curly red chili in Indonesia is season and growing season. In this case, the ability of a decomposition method to produce the actual components that describe the pattern of data signals affect the accuracy of the predicted value obtained using the model. The predicted value using the decomposed data by modified EEMD always better than EEMD on the overall condition.
Implementation of Fuzzy C-Means Algorithm for Clustering Provinces in Indonesia Based on Micro and Small Industry Ratio in Village Areas Frandito Rahmanesta; Zamahsary Martha; Dodi Vionanda; Zilrahmi Zilrahmi
Indonesian Journal of Statistics and Applications Vol 8 No 2 (2024)
Publisher : Departemen Statistika, IPB University dengan Forum Perguruan Tinggi Statistika (FORSTAT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v8i2p178-190

Abstract

Post-economic crisis, the micro and small industries contribute the most labor compared to other industries. Regional development sourced from small micro industries is a strategic force in developing a country because the development of small micro industries leads to realizing equitable welfare to reduce income inequality. Development in village areas is an important factor for regional development, reducing inequality between regions, and alleviating poverty. However, based on the 2018 PODES survey, there are regional imbalances in Indonesia in the small micro industry which is centralized on Java Island. Therefore, clustering and characteristics of the province were carried out based on the PODES survey of the small micro industry sector. This research uses the Fuzzy C-Means algorithm to cluster 34 provinces in Indonesia based on the ratio of small micro industries in village areas in 2021, to see how the development of small micro industries in village areas in each province in Indonesia. Fuzzy C-Means is one of the data clustering techniques that uses a fuzzy clustering model, where cluster formation is based on a membership degree value that varies between 0 and 1. The Fuzzy C-Means algorithm generates 4 clusters, cluster 1 and 2 represents provinces with high and very high micro and small industry development in village areas and cluster 3 and 4 represents provinces with medium and low micro and small industry development in village areas. The Fuzzy C-Means algorithm produces a good cluster structure with a silhouette coefficient value of 0,6406.