cover
Contact Name
Tessy Octavia Mukhti
Contact Email
tessyoctaviam@fmipa.unp.ac.id
Phone
+6282283838641
Journal Mail Official
tessyoctaviam@fmipa.unp.ac.id
Editorial Address
LPPM Universitas Negeri Padang, Jalan Prof. Dr. Hamka, Air Tawar Barat, Kota Padang, Sumatera Barat 25131
Location
Kota padang,
Sumatera barat
INDONESIA
UNP Journal of Statistics and Data Science
ISSN : -     EISSN : 2985475X     DOI : 10.24036/ujsds
UNP Journal of Statistics and Data Science is an open access journal (e-journal) launched in 2022 by Department of Statistics, Faculty of Science and Mathematics, Universitas Negeri Padang. UJSDS publishes scientific articles on various aspects related to Statistics, Data Science, and its application. Articles can be in the form of research results, case studies, or literature reviews. All papers were reviewed by peer reviewers consisting of experts and academicians across universities.
Articles 202 Documents
Diagnosis of the type of delivery of pregnant women at Semen Padang Hospital Using the C4.5 Method rama novialdi; Dony Permana; Dodi Vionanda; Fadhilah Fitri
UNP Journal of Statistics and Data Science Vol. 2 No. 1 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss1/130

Abstract

The health of the mother and fetus is very important, but there are many challenges and risks associated with pregnancy and childbirth. According to WHO, in 2020 there were 287,000 cases of women dying during pregnancy and childbirth. Causative factors that affect the type of delivery include the age of pregnant women, MGG, systole, diastole, and pulse. One method that can be used to group the types of childbirth of pregnant women is classification. C4.5 is one of the methods used in forming decision trees to produce decisions. The purpose of C4.5 is to obtain attributes that will be the main criteria in the classification. Based on optimal tree results, the attribute that is the main criterion in classifying the type of delivery of pregnant women who give birth by caesar section and normal delivery at Semen Padang Hospital is MGG. Determination of classification results using confusion matrix resulted in an accuracy value of 74%, sensitivity of 80% to classify the type of delivery of pregnant women who gave birth caesar, and specificity of 66.67% to classify the type of delivery of pregnant women who gave birth normally.
Structural Equation Modeling Partial Least Square (SEM-PLS) Untuk Membandingkan Kondisi Public Speaking Anxiety Mahasiswa Soshum dan Saintek Sabina Chairun Najwa; Natasya Dwi Ovalingga; Hanifah Nazhiroh; Rizki Akbar; Fadhilah Fitri
UNP Journal of Statistics and Data Science Vol. 1 No. 5 (2023): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol1-iss5/132

Abstract

Public speaking is a communication skill to deliver opinion or massage to the audience. Public speaking anxiety, caused by various factors. Social and science students have differences in culture and learning systems. Therefore, students in both educational clusters have their own ways of overcoming communication barriers. This study aimed to identify factors that influence public speaking anxiety in social and science students at Padang State University. The method used is the Structural Equation Model Partial Least Square (SEM-PLS) to understand the influential factors in more detail and minimize analysis errors caused by missing values and multicollinearity due to diverse samples. The results of the analysis are path diagrams for structural models and outer loading tables. If the < value is 0.7, then recalculation is carried out so that a new model is formed. The feasibility of the social science family model was obtained 35% and the scientific science family was 36.5%. The effect of latent or exogenous variables in this study is weak. Social students have higher levels of speech anxiety than science students. This is influenced by humiliation, unfamiliar role, and negative result factors. In science students, the influencing factors are humiliation, preparation, and unfamiliar Role.
Classification of Stroke Disease at Dr. Drs. M. Hatta Brain Hospital Bukittinggi With Decision Tree Algorithm C4.5 Futiah Salsabila; Zamahsary Martha; Atus Amadi Putra; Admi Salma
UNP Journal of Statistics and Data Science Vol. 2 No. 1 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss1/135

Abstract

Stroke is a health condition that has vascular disorders where brain  function is related to problems with blood vessels that carry blood to the brain. Several factors that can influence stroke include unhealthy eating habits, lack of physical activity, smoking behavior, alcohol consumption, and obesity. The symptoms experienced are headache, nausea, vomiting, blurred vision and difficulty swallowing. The researcher’s aim is to determine the risk faktors that affect the incidence of stroke hospitalization based on stroke diagnoses at Rumah Sakit Otak Dr. Drs. M. Hatta Bukittinggi city by classifying each variable using a decision tree. A decision tree is a flowchart that resembles a branching tree. The C4.5 algorithm is used in this research, which can process numerical and categorical data, can handle missing attribute values, and produces rules that are easy to interpret. The results of the analysis show that the attribute that is a risk factor for stroke is the heart. The model created using the C4.5 algorithm was tested using a counfusion matrix resulting in an accuracy of 64.54%, a precision of 53.34% for classifying ischemic stroke patients correctly, and a recall of 72.73% for classifying hemorrhagic patients correctly.  
Prediction of Palm Oil Production Results PT.KSI South Solok Using Ensemble k-Nearest Neighbor Nilda Yanti; Atus Amadi Putra; Dony Permana; Zilrahmi
UNP Journal of Statistics and Data Science Vol. 2 No. 1 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss1/136

Abstract

PT. KSI experienced production decrease that the cause of replanting that happened in 2022. In managing palm oil production PT. KSI has problems with palm oil production results not reaching out the targets so it can affect the Company's Work Plan and Budget, therefore it is very necessary to predict palm oil production results so that all palm oil production and processing activities can run according to plan. The ensemble technique is a method that is capable of making accurate predictions and is used very effectively in the kNN method, therefore there is no need to search for the best k value.Based on the results of the analysis that has been carried out, it can be seen that by using an ensemble the level of accuracy is 9.36%, which is considered high accuracy compared to just using a single kNN with k = 1 of 10.84%. So it can be concluded that the model has worked well with the data.
Implementation of an Artificial Neural Network Based on the Backpropagation Algorithm in Forecasting the Closing Price of the Jakarta Composite Index (IHSG) Aditya, Muhammad Fadhil Aditya; Zilrahmi; Yenni Kurniawati; Tessy Octavia Mukhti
UNP Journal of Statistics and Data Science Vol. 2 No. 1 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss1/137

Abstract

Investing is highly common in Indonesia. Continuous investment activities carried out by the community will increase economic activity and employment opportunities, increase national income, and increase the level of prosperity of the community. In carrying out share buying and selling transactions, there is a means for companies to obtain funds from official financiers or investors, which is called the capital market. One of the indices issued by the IDX is the Jakarta Composite Index (IHSG). Statistics can be used to help investors, the government, or related institutions to predict the value of the IHSG. One method that can be used to predict data is an Artificial Neural Network (ANN). Backpropagation method is a multi-layer ANN method that works in a supervised learning. The idea of the Backpropagation algorithm is that the input of the neural network is evaluated against the desired output results. The purpose of this research is to give forecasting values with high accuracy to describe the movement of IHSG close price values using the ANN method based on the Backpropagation algorithm. The research showed that the BP (4,6,1) model produced an RMSE value of 28,24024 and a MAPE value of 0.00342%. Based on the results of this research, an Artificial Neural Network model based on the Backpropagation Algorithm can be applied to predict the IHSG Closing Price value.
Comparison of the C5.0 Algorithm and the CART Algorithm in Stroke Classification Indah Lestari; Dina Fitria; Syafriandi Syafriandi; Admi Salma
UNP Journal of Statistics and Data Science Vol. 2 No. 1 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss1/144

Abstract

The C5.0 and CART algorithms are similar in terms of velocity and handling of categorical and numeric type data. However, these two algorithms are differences in terms the CART algorithm is binary and classifies categorical, numerical and continuous response variables resulting in classification and regression decision trees. Meanwhile, the C5.0 algorithm is non-binary and classifies categorical response variables resulting in a classification tree. This research aims to classify the Kaggle’s Stroke Prediction Dataset to find out the variables that most influence the risk of stroke, as well as to compare the results of the classification accuracy of the both algorithms. The results of the study showed that CART algorithm has a higher value of accuracy and precision, but its recall value is lower than C5.0. The accuracy value of each algorithm is 77.9% and 77.5%, presision is 89.5% and 83.2%, recall is 67% and 71.4%. Overrall, it can be concluded that there is no difference in classification between the two algorithm. Beside that, in the CART there were 3 variables that most influence on stroke risk, they are age, BMI, and average blood glucose levels. Meanwhile, in C5.0 only 2 variable that most influence, there are age and average blood glucose levels.
Forecasting Gold Prices in Indonesia using Support Vector Regression with the Grid Search Algorithm Syahfitrri, Nindi; Nonong Amalita; Dodi Vionanda; Zamahsary Martha
UNP Journal of Statistics and Data Science Vol. 2 No. 1 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss1/145

Abstract

Investment is an effort to increase economic growth in Indonesia.  A popular investment in the community is gold investment.  The value of gold investments tends to increase but is not immune from price fluctuations, therefore it is important to forecast the price of gold in Indonesia. The method that can be used to make this forecast is Support Vector Regression (SVR).  SVR is a method that looks for a function that has a deviation of no more than ε to get the target value from all training data. The best SVR model with a linear kernel was obtained from a combination of parameters C=0,0625 and ε=0,001 with a RMSE value of 0,19734 and a value of 0,974112.  So, the SVR method is appropriate to use for forecasting gold prices in Indonesia.
Sentiment Analysis about Anti-LGBT Campaign using the Naïve Bayes Classifier rios; Syafriandi Syafriandi; Dony Permana; Dina Fitria
UNP Journal of Statistics and Data Science Vol. 2 No. 1 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss1/146

Abstract

Social media is growing so that the news that is discussed is also very fast to be known by everyone. The news or topic that is being discussed on social media is the anti-LGBT campaign. The conversation about the anti-LGBT campaign is expressed in the form of opinions that contain positive and negative feelings. The opinion is conveyed through Twitter. Twitter is a microblogging social media site that allows users to create short messages and share them easily and quickly. Opinions on Twitter are used to see whether the opinion rejects or supports the anti-LGBT campaign. The use of sentiment analysis helps to see the opinion supports or rejects the anti-LGBT campaign. The algorithm used to perform sentiment analysis is the Naïve Bayes Classifier. The purpose of this study is to determine the sentiment analysis of anti-LGBT campaign tweets on Twitter. This study using Phython as the tools. The dataset used is 3103 tweets with 80% training data and 20% test data. The sentiment analysis results obtained in this study show that Twitter users in Indonesia have more positive opinions. The use of the Naïve Bayes Classifier algorithm produces an accuracy of 68,75%, precision of 99,6%, and recall of 92,8%.
Sentiment Analysis of DANA Application Reviews on Google Play Store Using Naïve Bayes Classifier Algorithm Based on Information Gain Cindy Caterine Yolanda; Syafriandi Syafriandi; Yenni Kurniawati; Dina Fitria
UNP Journal of Statistics and Data Science Vol. 2 No. 1 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss1/147

Abstract

DANA is a digital payment platform that provides various features to make it easier for users to make payments, transfers, and balance replenishment online. DANA application users provide a variety of reviews that include both constructive and critical opinions, which can be valuable input for DANA application developers. The purpose of this research is to evaluate the results of sentiment classification of DANA application user reviews on the Google Play Store service using the Naïve Bayes Classifier method and Information Gain feature selection. In addition, this study aims to assess the effect of applying IG feature selection on the performance of the resulting model. In this study, reviews are divided into two categories, namely positive and negative based on lexicon-based labeling. Furthermore, data weighting, feature selection, and data division are carried out with a proportion of 80% train data and 20% test data before model building. There are two models, namely a model without feature selection (NBC model) and a model with feature selection (NBC-IG model). The evaluation results showed that the NBC model with 1106 features performed well, with 82.91% accuracy, 83.96% precision, and 90.23% recall. Meanwhile, the NBC-IG model with 536 features showed higher performance, with 85.09% accuracy, 85.79% precision, and 92.09% recall. The application of IG feature selection with the IG value limit parameter > 0.01 in the NBC model successfully reduced the number of features by 570, and improved model performance with an increase in accuracy by 2.18%, precision by 1.83%, and recall by 1.86%.
Twitter Data Sentimen Analysis 2024 Presidential Candidate Using Algorithm Naïve Bayes Classifier By Methods K-Fold Cross Validation Aldi Prajela; Syafriandi Syafriandi; Dony Permana; Dina Fitria
UNP Journal of Statistics and Data Science Vol. 2 No. 1 (2024): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol2-iss1/149

Abstract

Indonesia implements a democratic system by involving the public in General Elections (Pemilu) for specific political positions. The active community expresses opinions on social media, especially regarding the 2024 Presidential Election (Pilpres) and respective presidential candidates, which have become trending topics on Twitter. The analysis used to absorb these tweets into information is sentimen analysis using the Naïve Bayes Classifier algorithm with the K-fold Cross-Validation method. Through the stages of pre-processing, weighting, labeling, classification using NBC, and testing using a Confusion Matrix, The results of the classification from NBC showed that Anies got 80% positive tweets and 20% negative tweets from 1186 tweets, Prabowo Subianto got 78% positive tweets and 22% negative tweets from 1149 tweets, and Ganjar Pranowo got 77% positive tweets and 23% negative tweets from 1075 tweets. Testing process was carried out using the NBC algorithm with the K-Fold Cross Validation method using values k=1 to k=10. The function of K-Fold Cross Validation is to maximize the confusion matrix result. It can be conclude that Anies Baswedan has the highest score in iteration 4, namely a precision value of 90%, a recall value of 99%, and the accurary value of 91%. Furthemore, Ganjar Pranowo had the highest score in iteration 9, namely a precision value of 95%,a recall value of 97%, and an accuracy value of 92%. Meanwhile, Prabowo Subianto had the highest score in iteration 9, namely a precision value of 97%, a recall value of 99%, and an accuracy value of 94%.

Page 8 of 21 | Total Record : 202