cover
Contact Name
Tessy Octavia Mukhti
Contact Email
tessyoctaviam@fmipa.unp.ac.id
Phone
+6282283838641
Journal Mail Official
tessyoctaviam@fmipa.unp.ac.id
Editorial Address
LPPM Universitas Negeri Padang, Jalan Prof. Dr. Hamka, Air Tawar Barat, Kota Padang, Sumatera Barat 25131
Location
Kota padang,
Sumatera barat
INDONESIA
UNP Journal of Statistics and Data Science
ISSN : -     EISSN : 2985475X     DOI : 10.24036/ujsds
UNP Journal of Statistics and Data Science is an open access journal (e-journal) launched in 2022 by Department of Statistics, Faculty of Science and Mathematics, Universitas Negeri Padang. UJSDS publishes scientific articles on various aspects related to Statistics, Data Science, and its application. Articles can be in the form of research results, case studies, or literature reviews. All papers were reviewed by peer reviewers consisting of experts and academicians across universities.
Articles 202 Documents
Penerapan Metode Choice-Based Conjoint Analysis pada Preferensi Pekerjaan Mahasiswa Departemen Statistika Universitas Negeri Padang Putra, M. Farel Rusde; Dodi Vionanda; Dony Permana; Dina Fitria
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/221

Abstract

In the realm of psychology studies, it is widely assumed that the age range between 18 and 25 represents a critical period during which individuals preferences begin to take shape. This developmental phase encloses college students who despite their academic pursuits, remain relatively unfamiliar with the dynamic job market, particularly in the context of rapid technological advancements. Statistics as a discipline with broad applicability across both social and scientific domains, offers student of statistics significant career prospects. This research would likely estimate the job preferences of statistics students using one of the most common use methods called choice-based conjoint (CBC) analysis. The analysis reveals that work hours were the most substantial influence on statistics students’ job preferences, with a percentage of 40.29%. In addition, other factors that influence the preferences of statistics students are such as first salary (36.87%), correlation with the field of statistics (12.04%), work environment (7.18%), and type of workplace (3.62%).
Penerapan Metode Rating-Based Conjoint Analysis dalam Preferensi E-Wallet Mahasiswa Departemen Statistika Universitas Negeri Padang Putra, Dio Afdal; Dodi Vionanda; Yenni Kurniawati; Zamahsary Martha
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/222

Abstract

The rapid development of technology in the era of globalization has influenced the evolution of society's life in terms of economy, social, culture, and education, with the aim of facilitating daily activities, one of which is the ease of transactions using e-wallets. An e-wallet is a payment tool that uses a server-based system. Many factors influence a person's decision to use an e-wallet as a payment method, one of which is the level of security. To identify the factors that affect someone's use of e-wallets, one method is Rating-Based Conjoint Analysis (RBC). Therefore, this study aims to determine what influences a person to use an e-wallet, with the subjects being active students of the Statistics Department at Padang State University. The results of this RBC study indicate that the most influential factor on the e-wallet preferences of statistics students is security level, with a value of 37.70%, followed by transaction speed 23.17%, transfer fees at at 23.07%, features provided at 11.78%, and the least influential factor being promotions at 4.28%.
PT.Telkom (Tbk) Stock Price Forecasting Using Long Short Term Memory (LSTM) nazhiroh, hanifah; Dina Fitria; Dony Permana; Zilrahmi
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/223

Abstract

The movement of the share price of PT Telkom (Tbk) fluctuates so it is necessary to do a forecasting analysis. Forecasting the share price of PT Telkom (Tbk) can be done using the Long Short Term Memory (LSTM) method. LSTM is a development of the Recurrent Neural Network (RNN) method. In this study using PT.Telkom (Tbk) stock price data for 2018-2023 and PT.Telkom (Tbk) stock price data after Covid-19 (20121-2023). The purpose of this research is to determine the movement of PT.Telkom (Tbk) stock prices in 2024, to find out the difference in forecasting using PT.Telkom (Tbk) 2018-2023 stock price data with PT.Telkom (Tbk) stock price data after covid-19 2021-2023, and to determine the level of accuracy of forecasting PT.Telkom (Tbk) stock prices using the LSTM method. The results showed that both data have a small MAPE value. to forecast the share price of PT.Telkom for 1 year, PT.Telkom (Tbk) share price data for 2018-2023 is used which has more data to analyze long-term forecasting. From the analysis results obtained MAPE of 1.016% with the optimal parameter combination of neuron 4, batch size 64, and epoch 80. The results of forecasting the share price of PT.telkom (Tbk) in 2024 experienced very rapid fluctuations with an average share price of PT.Telkom (Tbk) in 2024 Rp 4,668 / sheet.
Implementation of CART Method with SMOTE for Household Poverty Classification in Mentawai Islands 2023 Dewi Adiningtiyas, Rheizma; Admi Salma; Syafriandi Syafriandi; Fadhilah Fitri
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/232

Abstract

Poverty is a condition in which individuals or groups are unable to fulfill their basic needs due to economic pressure or limited resources. The Classification and Regression Trees (CART) method is a classification technique in the form of a classification tree, which describes the relationship between independent and dependent variables. Data imbalance can lead to low sensitivity values and area under curve (AUC) values. One method that can overcome unbalanced data is to perform Synthetic Minority Oversampling Technique (SMOTE). SMOTE is a technique with the addition of artificial data in the minority class at a stage before analyzing the data. The purpose of this research is to compare the model without and with SMOTE in CART method. The use of SMOTE is applied to balance the amount of data on each poor household. The accuracy value of the method without SMOTE is 89% while with the SMOTE method is 79%. However, the sensitivity value has increased by 80%. Meanwhile, the AUC value in the CART method with SMOTE increased by 31%. So in this study it can be concluded that CART classification analysis with SMOTE is able to provide better performance compared to CART classification analysis without SMOTE.
Analysis of Factors Influencing the Number of Families at Risk of Stunting in Merangin Regency Using Mixed Geographically Weighted Regression Fadlan Rafly, Muhammad; Zilrahmi; Dony Permana; Dina Fitria
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/236

Abstract

The number of families at risk of stunting is among the significant concerns that have been a negative impact on developing superior human resources in Merangin Regency. The number of families at risk of stunting is sought to be solved by identifying the contributing components. MGWR is among the methods that may be employed to obtain a specific model that affects each obesrvasion location locally and a comprehensive model that is global. Multiple linear regression and GWR are used to create models MGWR used when data has the influence of spatial heterogeneity. This project aims to develop an MGWR model which will be used to calculate the amount families at risk of stunting in each sub-district in Merangin Regency who are at risk of stunting in 2022. A fixed gaussian kernel weighting matrix is used in MGWR modeling. At the very least CV of 0.6152241, A fixed gaussian kernel is utilized as the weighting function. The results indicate that the model obtained has an accuracy rate of 99.18%, which means that the predictor variables can explain the model by that percentage. Families with insufficient access to drinking water is one factor that significantly affects how many families are at risk of stunting, families with inadequate sanitation, maternal age less than 20 years and families with babies under five years old.
Early Marriage Factors Indonesian Using Spatial Regression Analysis permana, yazid; Dina Fitria; Yenni Kurniawati; Fadhilah Fitri
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/239

Abstract

Marriage is a sacred union recognized socially and religiously to form a family, as regulated by Law No. 16 of 2019. The percentage of early marriages in Indonesia continues to rise, reaching 21.5% in 2022, placing Indonesia 8th in the world according to UNICEF 2023 data. The increase in early marriages has significant impacts on maternal and child health and often leads to high divorce rates, with 516,334 cases in 2022. The aim of this research is to provide information and knowledge for students about early marriage and spatial regression. The main factors influencing early marriages are low education levels, economic difficulties, and environmental factors. Research shows that early marriages are highest in Kalimantan and Sulawesi, with spatial effects influencing the percentage of early marriages between regions.Spatial regression analysis, such as the Spatial Autoregressive (SAR) model, is used to examine the interactions between regions affecting early marriage. Spatial autocorrelation tests and spatial dependency effects show a spatial dependency effect, making the SAR model with queen contiguity weights the most suitable. The resulting model is considered quite good considering the R-squared value of 40.97%. The best-formed model shows that the Open Unemployment Rate (TPT) of youth is a significant variable that greatly impacts the percentage of early marriages. Therefore, the central and provincial governments are expected to pay more attention to the open youth unemployment factor to control and reduce the rate of early marriages in Indonesia.
Classification of Poor Households in Padang City Using the Naïve Bayes Algorithm with Synthetic Minority Oversampling Technique kartika, anice; Dina Fitria; Syafriandi Syafriandi; Tessy Octavia Mukhti
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/241

Abstract

Poverty is a condition where a person is unable to meet minimum basic needs or a condition caused by the influence of development policies that have not been able to reach all levels of society. In Indonesia, the government has designed various programs to overcome poverty, but these programs are often not on target. One method to improve the effectiveness of the program is through proper classification of poor and non-poor households. This study uses the Naïve Bayes classification method which is popular in data mining to predict data categories based on the probability distribution of its features. However, challenges arise when the data is unbalanced between different classes. To overcome this, the Synthetic Minority Oversampling Technique (SMOTE) method is used to balance the data. Based on the analysis that has been carried out To determine the performance of Naïve Bayes using SMOTE and without SMOTE in classifying poor households in Padang City in 2023, classification using the Naïve Bayes method without SMOTE produced an accuracy value of 98%, precision of 0%, and recall of 0%. Meanwhile, the classification using the Naïve Bayes method with SMOTE produces an accuracy value of 90%, precision of 87%, and recall of 92% and the results of the criteria for poor households in Padang City in 2023 using Naïve Bayes can be seen from the results that the probability of poor households is much greater than that of non-poor households, therefore the data is classified as  group of households that are classified as poor.
Library Book Lending Recommendation Using Association Rules with Frequent Pattern Growth (FP-Growth) Algorithm Kamil, Fakhri; Dony Permana; Dodi Vionanda; Dina Fitria
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/284

Abstract

College libraries are libraries managed by higher education institutions such as university libraries. The library functions as an information center management forum for students which includes learning resource functions, access functions, librarian functions, ethical functions, and evaluation functions.  Students prefer to read through e-books rather than reading books or library collections. Limited knowledge of literature is the cause of students choosing to look for books on search engines rather than in the library. Managed book loan circulation history data will be able to improve library services that can assist in finding library collections. Book recommendation services using association rules, can find patterns of borrowing behavior of book titles that have the highest association as the most recommended titles to be borrowed together. The FP-Growth or Frequent Pattern Growth is an algorithm of associations rule that is able to generate association rules as personalized book borrowing recommendations. The results of book recommendations found as many as 50 rules that meet the chi-square assumption test where the recommendation items are independent. The results of 50 rules for book title choices that can be used by students as suggestions for determining books that have a relationship to be borrowed together to enrich references. For students who wish to borrow the books 'Professional Teacher: Mastering Teaching Methods and Skills' is recommended to also borrow the book 'Participatory Learning Methods and Techniques'. With the book recommendation service, the library provides advice to students in choosing related book titles to borrow at the library.
Prediksi Harga Emas Dunia Menggunakan Metode k-Nearest Neighbor Nanda P, Muhamad Rayhan; Zamahsary Martha; Dodi Vionanda; Admi Salma
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/314

Abstract

This research aims to predict world gold prices using the k-nearest neighbor (KNN) method with secondary data from the London Bullion Market Association (LBMA) in the form of monthly time series data from January 2019 to December 2023. In the analysis process, the data is divided into two parts: 80% for training data (January 2019 - December 2022) and 20% for testing data (January - December 2023). The analysis results show that the Mean Absolute Percentage Error (MAPE) value of the KNN method is 4.5%, which indicates a very good level of accuracy. With a MAPE below 10%, the KNN model is proven to be able to accurately predict world gold prices. Gold price predictions for the period January to December 2024 show a consistent upward trend, which is influenced by factors such as global economic fluctuations, increased gold demand, and geopolitical uncertainty. These results show that the KNN model is reliable as a tool for forecasting future world gold prices.
Perbandingan Analisis Diskriminan Kuadratik dengan Analisis Diskriminan Kuadratik Robust martha, Ully Martha; Dodi Vionanda; Dony Permana; Zilrahmi
UNP Journal of Statistics and Data Science Vol. 2 No. 4 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss4/315

Abstract

This study compared the performance of quadratic discrimination analysis and robust quadratic discrimination analysis using the Iris dataset from Kaggle. The robust quadratic discriminant analysis, designed to handle outliers and non-normal distributions, shows better performance with an Apparent Error Rate (APER) of 2.5%. In contrast, the quadratic discriminant analysis, used for data with multivariate normal distribution and different variance-covariance matrices among groups, yields an APER of 3.03%. These results indicate that robust quadratic discriminant analysis is more accurate in classification on this dataset compared to quadratic discriminant analysis. Keywords: Apparent Error Rate, Quadratic Discrimination Analysis, Robust Quadratic Discrimination Analysis