Claim Missing Document
Check
Articles

Application of Multivariate Adaptive Regression Splines for Modeling Stunting Toddler on The Island of Java Rahma, Dzakyyah; Nonong Amalita; Yenni Kurniawati; Zamahsary Martha
UNP Journal of Statistics and Data Science Vol. 2 No. 3 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss3/205

Abstract

Stunting is a chronic nutritional problem experienced by toddlers, characterized by a shorter body height compared to children their age. The aim of this research is to model and determine the factors that influence Stunting on The Island of Java using Multivariate Adaptive Regression Spline (MARS). MARS is a modeling method that can handle high-dimensional data. The results of this study show that the best MARS model is a combination (BF=24, MI=3, and MO=2) with a minimum GCV value of 0.9475. Based on the model, the factors that significantly influence Stunting on the island of Java are babies receiving complete basic immunization (X4), babies getting exclusive breastfeeding (X3), pregnant women getting K4 (X1), and pregnant women getting TTD (X2). The level of importance of each variable is 100%, 81.64%, 60.38%, and 43.90%. Based on research results, babies receiving complete basic immunization is the variable that most influences stunting on The Island of Java in 2021.
Comparison of Linear Discriminant Analysis with Robust Linear Discriminant Analysis Fitri, Fitri Hayati; Dodi Vionanda; Yenni Kurniawati; Tessy Octavia Mukhti
UNP Journal of Statistics and Data Science Vol. 2 No. 3 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss3/206

Abstract

Discriminant analysis is a multivariate method for dividing things into discrete groups and assigning new objects to existing categories. A discriminant function, which is a linear combination of independent variables used to categorize things into two or more groups or categories, is the result of discriminant analysis. The independent variables in a linear discriminant analysis must be multivariate normally distributed, and the covariance matrices for each group must be equal. In linear discriminant analysis, it is also essential to identify outliers because their existence in the data set can undermine the assumptions made by the method and lead to incorrect classification results. Therefore, in discriminant analysis, handling outliers with robust approaches is required. One such robust method in discriminant analysis is the Minimum Covariance Determinant (MCD), which is highly effective in dealing with outliers and relatively easier to apply compared to other robust methods. The aim of this study is to compare the classification results of linear discriminant analysis with robust linear discriminant analysis on the dataset of diabetes patients at RSUD Padangsidimpuan in 2023. The results obtained from this dataset indicate that linear discriminant analysis achieved an accuracy of 85,71%, while robust linear discriminant analysis achieved an accuracy of 80,95%. These findings suggest that the use of liniar discriminant analysis and robustt linear discriminant analysis can yield different results depending on the characteristics of the data and the number of outliers in the dataset.
Implementation of the Fuzzy C-Means Clustering Method in Grouping Provinces in Indonesia based on the Types of Goods Sold in E-commerce Businesses in 2022 Bimbim Oktaviandi; Tessy Octavia Mukhti; Yenni Kurniawati; Zamahsary Martha
UNP Journal of Statistics and Data Science Vol. 2 No. 3 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss3/210

Abstract

The internet facilitates e-commerce by enabling efficient transactions and building consumer trust. With internet users in Indonesia reaching 204 million in 2022, it is crucial to Cluster provinces based on the types of goods and services sold online to design effective marketing strategies. The Fuzzy C-Means (FCM) method is used for Cluster analysis, allowing objects to have different membership degrees in multiple Clusters and providing accurate Cluster center placement. This study applies Fuzzy C-Means to Cluster 34 provinces in Indonesia based on the sale of goods/services in e-commerce in 2022, aiming to provide insights into market preferences and assist companies in developing more effective strategies. The results show that the method forms two Clusters. By evaluating standard deviation values and ratios, Fuzzy C-Means proves effective in Clustering provinces in Indonesia based on e-commerce sales data. Cluster validation reveals a standard deviation ratio of 0.14, indicating clear and significant Cluster separation.
Optimization of Sentiment Analysis for MBKM Program using Naïve Bayes with Particle Swarm Optimization Diva Aliyah; Zilrahmi; Yenni Kurniawati; Dina Fitria
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/220

Abstract

In early 2020, Kemendikbudristek launched the MBKM program with the aim of improving the quality of higher education through a student-focused learning approach. The launch of this program triggered various reactions on social media, especially on Twitter, both positive and negative. This study aims to analyze the sentiment of Twitter users towards the MBKM program using the Naive Bayes algorithm optimized with Particle Swarm Optimization (PSO). The data used are Indonesian tweets containing the keywords "MBKM" and "Merdeka Campus" from the period July to December 2022. The research stages include data collection through crawling, manual labeling of data into positive and negative sentiments, data preprocessing, application of the Naive Bayes algorithm, and feature selection with PSO. The results showed that the group of tweets categorized based on positive and negative sentiments towards the implementation of the MBKM program in Indonesia in 2022, showed that the NB-PSO experiment achieved an accuracy of 90.87%, an increase of 7.12% compared to the Naive Bayes algorithm alone. Thus, the use of Particle Swarm Optimization algorithm in Naive Bayes classification algorithm is proven to improve classification performance, especially in the case of sentiment analysis. Keywords: Sentiment Analysis, Merdeka Belajar Kampus Merdeka, Twitter, Naive Bayes, Particle Swarm Optimization.
Penerapan Metode Rating-Based Conjoint Analysis dalam Preferensi E-Wallet Mahasiswa Departemen Statistika Universitas Negeri Padang Putra, Dio Afdal; Dodi Vionanda; Yenni Kurniawati; Zamahsary Martha
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/222

Abstract

The rapid development of technology in the era of globalization has influenced the evolution of society's life in terms of economy, social, culture, and education, with the aim of facilitating daily activities, one of which is the ease of transactions using e-wallets. An e-wallet is a payment tool that uses a server-based system. Many factors influence a person's decision to use an e-wallet as a payment method, one of which is the level of security. To identify the factors that affect someone's use of e-wallets, one method is Rating-Based Conjoint Analysis (RBC). Therefore, this study aims to determine what influences a person to use an e-wallet, with the subjects being active students of the Statistics Department at Padang State University. The results of this RBC study indicate that the most influential factor on the e-wallet preferences of statistics students is security level, with a value of 37.70%, followed by transaction speed 23.17%, transfer fees at at 23.07%, features provided at 11.78%, and the least influential factor being promotions at 4.28%.
Early Marriage Factors Indonesian Using Spatial Regression Analysis permana, yazid; Dina Fitria; Yenni Kurniawati; Fadhilah Fitri
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/239

Abstract

Marriage is a sacred union recognized socially and religiously to form a family, as regulated by Law No. 16 of 2019. The percentage of early marriages in Indonesia continues to rise, reaching 21.5% in 2022, placing Indonesia 8th in the world according to UNICEF 2023 data. The increase in early marriages has significant impacts on maternal and child health and often leads to high divorce rates, with 516,334 cases in 2022. The aim of this research is to provide information and knowledge for students about early marriage and spatial regression. The main factors influencing early marriages are low education levels, economic difficulties, and environmental factors. Research shows that early marriages are highest in Kalimantan and Sulawesi, with spatial effects influencing the percentage of early marriages between regions.Spatial regression analysis, such as the Spatial Autoregressive (SAR) model, is used to examine the interactions between regions affecting early marriage. Spatial autocorrelation tests and spatial dependency effects show a spatial dependency effect, making the SAR model with queen contiguity weights the most suitable. The resulting model is considered quite good considering the R-squared value of 40.97%. The best-formed model shows that the Open Unemployment Rate (TPT) of youth is a significant variable that greatly impacts the percentage of early marriages. Therefore, the central and provincial governments are expected to pay more attention to the open youth unemployment factor to control and reduce the rate of early marriages in Indonesia.
Regularized Ordinal Regression with LASSO: Identifying Factors in Students' Public Speaking Anxiety at Universitas Negeri Padang Natasya Dwi Ovalingga, natasyalinggaa; Nonong Amalita; Yenni Kurniawati; Zamahsary Martha
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/316

Abstract

Public speaking anxiety is a common issue faced by students, particularly in academic settings. It may arise from a range of factors, including humiliation, physical appearance, preparation, audience interest, personality traits, rigid rules, unfamiliar role, negative result, and mistakes. This research seeks to determine the factors influencing different levels of public speaking anxiety among students at Universitas Negeri Padang through the application of ordinal regression with LASSO regularization. This method allows for automatic selection of significant variables and addressesmulticollinearity issues. The results indicate that eight factors influence low public speaking anxiety levels, while only six factors impact high public speaking anxiety levels. The ordinal regression model with LASSO penalty demonstrates good performance in classifying public speaking anxiety levels, achieving an accuracy of 71.33%. This study is expected to help students and educators better understand and manage public speaking anxiety, thereby enhancing public spekaing competence among students
Perbandingan Metode Naïve Bayes Dan K-Nearest Neighbors Dalam Mengklasifikasikan Indeks Pembangunan Manusia Menurut Kabupaten/ Kota di Indonesia Tahun 2022 Anggara, Rudi; Tessy Octavia Mukhti; Yenni Kurniawati; Dina Fitria
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/319

Abstract

The Human Development Index (HDI) is an indicator used to measure the success of efforts to improve the quality of human life in a particular region. Indonesia's HDI has increased every year, but the HDI in several districts/cities in Indonesia remains in the low category. The low HDI in these districts/cities is due to unequal development between regions in Indonesia. This disparity in development is influenced by HDI indicators as well as other factors. To address this issue, a decision system is needed to determine HDI categories using the Naive Bayes and KNN methods. Naive Bayes is applied with the assumption of Gaussian distribution, while KNN is implemented with the optimization of the nearest K value. Model performance evaluation is conducted to determine the best accuracy of the two methods using a confusion matrix. The analysis results show that the Naïve Bayes model outperforms the KNN algorithm in classifying the Human Development Index (HDI) by district/city in Indonesia for the year 2022, with Naïve Bayes achieving an accuracy of 93%. Therefore, the Naïve Bayes algorithm show good performance in terms of accuracy.
Mapping Indonesian Provinces Based on Leading Plantation Commodities with Export Potential Using Multidimensional Scaling Analysis Putri Yeni, Dicha; Tessy Octavia Mukhti; Yenni Kurniawati; Dina Fitria
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/327

Abstract

Indonesia, as an agrarian country, benefits significantly from its plantation subsector, which contributes substantially to the national economy. However, the processing of plantation products in Indonesia remains largely limited to raw or semi-finished goods, resulting in low added value and restricted income for both farmers and the nation. This study aims to map Indonesia's provinces based on the production of key plantation commodities with high export potential, utilizing the Multidimensional Scaling (MDS) analysis method. The research focuses on commodities such as pepper, palm oil, coconut, rubber, coffee, cocoa, clove, and tea. It seeks to group 34 Indonesian provinces based on similarities in plantation production, providing valuable insights for policymakers to enhance production and increase export value. The analysis calculates inter-provincial similarities to determine distances between objects and evaluates the accuracy of the MDS mapping using STRESS and R2 values. The findings indicate that 12 provinces share similarities in cocoa production, while 7 provinces are closely aligned in the production of pepper, rubber, and coffee. Furthermore, 5 provinces exhibit similarities in palm oil production, and 9 provinces demonstrate commonalities in the production of coconut, clove, and tea. The analysis achieved a STRESS value of 0.024 (2.4%) and an R2 value of 0.9994, indicating that the MDS mapping is highly reliable. However, the results do not fully align with field data, suggesting the need for orthogonal transformation through Principal Component Analysis (PCA) to improve accuracy.
Analisis Pemilihan Model Regresi Konversi Metanol Berdasarkan Suhu, Waktu Tinggal, Konsentrasi, Rasio Oksigen, dan Sistem Reaktor Marvero, Andre; Amri, Fahmi; Fadhil Irsyad, Muhammad; Kurniawati, Yenni
UNP Journal of Statistics and Data Science Vol. 3 No. 1 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss1/339

Abstract

This study aims to determine the best regression model that explains the effect of temperature, residence time, methanol concentration, oxygen to methanol ratio, and reactor system on methanol conversion in supercritical water. Preliminary analysis showed a violation of the multicollinearity assumption, which affected the validity of the multiple linear regression model. To overcome this and determine the optimal model, variable selection was performed using the stepwise selection method. This method was evaluated based on predictive power, model accuracy and statistical validity. The results showed that the stepwise method produced an optimal model in predicting conversion. Reactor system and temperature were the most significant variables affecting methanol conversion. The conclusion of this study shows that the variable selection approach with stepwise selection can be effectively used to identify the best regression model, when classical assumptions are met. These findings make an important contribution to the optimization of supercritical water-based chemical processes.
Co-Authors Aditya, Muhammad Fadhil Aditya Admi Salma Afifa Lufti Insani AL Rezki Ivansyah Alya Aufa, Wafiq Amelia Susrifalah Anang Kurnia Anggara, Rudi Anggi Adrian Danis Anita Fadila Anjelisni, Nining Annisa Ramadhani Aprotama, Celsy Ardhi, Sonia Ardiyatul Putri Arnellis Arnellis arrahmi, nailul Atus Amadi Putra Aulia Wanda Aulia, Yuke Aurumnisva Faturrahmi Baehaqi Berliana Nofriadi Bimbim Oktaviandi Celsy Aprotama Chairina Wirdiastuti Cindy Caterine Yolanda Darwas Deska Warita Devi Yopita Sipayung Dewi Murni Dina Fitria Dina Fitria Dina Fitria, Dina Disti Harlin Diva Aliyah Dodi Vionanda Dony Permana Dwi Sulistiowati, Dwi Elfiani Sarian Bur Elfin Innaka Hamidah Elza Vinora Eujenniatul Jannah Fadhil Irsyad, Muhammad Fadhilah Fitri Fahmi Amri, Fahmi Fashihullisan Fayyadh Ghaly Fayza Annisa Febrianti Febiola Putri, Febi Fitri, Fadhilah Fitri, Fitri Hayati fitri, silfia wisa Ghaly, Fayyadh Hadiyanti Riskha harelvi, dhea afrila Harpidna, Riska Harpidna Hary Merdeka Helma Helma Helma Helma Hendrawan, Muhammad Ihsan Dermawan Irwan Irwan Khairani, Putri Rahmatun Kusman Sadik Lutfian Almash M Fathoni Arnas Manja Danova Putri Marvero, Andre Maya Ifra Shobia Meira Parma Dewi Minora Longgom Nasution Muhammad Arief Rivano Mukhti, Tessy Octavia NA Mentacem Natasya Dwi Ovalingga, natasyalinggaa Nofriadi, Berliana Nonong Amalita Oktaviani, Bernadita Permana, Dony permana, yazid Prida Nova Sari Putra, Dio Afdal Putri Yeni, Dicha Putri, Fadhira Vitasha Rahma, Dzakyyah rahmad revi fadillah Revina Rahmadani Rizki Amalia, Annisa Rizkiah, Niswatul Ronald Rinaldo Rosa Salsabila Azarine Salma, Admi Salsabilla Khairani Sasmita, Riza Sepniza Nasywa Septrina Kiki Arisandi Silvia Triana Siskha Maulana Basrul Siti Nurhaliza Sondriva, Wilia SRI RAHAYU Sri Wahyuni Susrifalah, Amelia Syafriandi Syafriandi Syafriandi Syafriandi Tessy Octavia Mukhti Tsani, Nahda Maesya Wimmi Sartika Windi Dwi Saputra Wita, Wita Resfi Ananta Yunistika Ilanda Zahrani Asyati Zulika Zamahsary Martha Zilrahmi, Zilrahmi