cover
Contact Name
Tessy Octavia Mukhti
Contact Email
tessyoctaviam@fmipa.unp.ac.id
Phone
+6282283838641
Journal Mail Official
tessyoctaviam@fmipa.unp.ac.id
Editorial Address
LPPM Universitas Negeri Padang, Jalan Prof. Dr. Hamka, Air Tawar Barat, Kota Padang, Sumatera Barat 25131
Location
Kota padang,
Sumatera barat
INDONESIA
UNP Journal of Statistics and Data Science
ISSN : -     EISSN : 2985475X     DOI : 10.24036/ujsds
UNP Journal of Statistics and Data Science is an open access journal (e-journal) launched in 2022 by Department of Statistics, Faculty of Science and Mathematics, Universitas Negeri Padang. UJSDS publishes scientific articles on various aspects related to Statistics, Data Science, and its application. Articles can be in the form of research results, case studies, or literature reviews. All papers were reviewed by peer reviewers consisting of experts and academicians across universities.
Articles 202 Documents
Implementation of Association Rule on Agricultural Commodity Exports in Indonesia Using Apriori Algorithm Dinul Haq, Asra; Fitria, Dina; Dony Permana; Zamahsary Martha
UNP Journal of Statistics and Data Science Vol. 3 No. 1 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss1/336

Abstract

Exports of agricultural commodities in Indonesia have the smallest contribution to state revenues and the movement of export values ​​in the last decade has not shown a significant increase compared to other export sectors. This shows that there are weaknesses in the export of agricultural commodities so that an analysis is needed to optimize export results to other countries. These weaknesses can be seen in terms of quality, price, infrastructure and technology. This study uses association rule analysis with the apriori algorithm with the aim of finding out what agricultural commodities are exported simultaneously and the resulting association rules. The apriori algorithm is an algorithm used to find association rules between items in a database by considering two main parameters, namely Support and Confidence. The data used is agricultural commodity export data obtained from the publication of the Central Statistics Agency in Indonesia in 2023. Based on the analysis carried out, there are 32 association rules generated with a minimum Support of 25% and a minimum Confidence of 80%. Then after the Lift Ratio test was carried out, all the rules generated met the Lift Ratio test with a value of more than 1. The association rules produced must have at least 2 to 4 agricultural export commodities in each rule. By knowing the association rules for agricultural commodity exports, it is hoped that export distribution in the agricultural sector can be further optimized for trading abroad so that it can cover existing weaknesses.
Analisis Pemilihan Model Regresi Konversi Metanol Berdasarkan Suhu, Waktu Tinggal, Konsentrasi, Rasio Oksigen, dan Sistem Reaktor Marvero, Andre; Amri, Fahmi; Fadhil Irsyad, Muhammad; Kurniawati, Yenni
UNP Journal of Statistics and Data Science Vol. 3 No. 1 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss1/339

Abstract

This study aims to determine the best regression model that explains the effect of temperature, residence time, methanol concentration, oxygen to methanol ratio, and reactor system on methanol conversion in supercritical water. Preliminary analysis showed a violation of the multicollinearity assumption, which affected the validity of the multiple linear regression model. To overcome this and determine the optimal model, variable selection was performed using the stepwise selection method. This method was evaluated based on predictive power, model accuracy and statistical validity. The results showed that the stepwise method produced an optimal model in predicting conversion. Reactor system and temperature were the most significant variables affecting methanol conversion. The conclusion of this study shows that the variable selection approach with stepwise selection can be effectively used to identify the best regression model, when classical assumptions are met. These findings make an important contribution to the optimization of supercritical water-based chemical processes.
Classification of Factors Affecting Preeclampsia in Pregnant Women at RSUP. Dr. M. Djamil Padang using the CART Algorithm YUSWITA, AULIA; Dina Fitria; Dony Permana; Admi Salma
UNP Journal of Statistics and Data Science Vol. 3 No. 1 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss1/341

Abstract

Preeclampsia is a pregnancy-specific disease characterized by hypertension and proteinuria that occurs after 20 weeks of gestation. Preeclampsia itself is caused by various factors that can influence the occurrence of preeclampsia in pregnant women, including age, parity, history of hypertension, obesity, and kidney disorders. This study aims to determine the risk factors influencing preeclampsia based on preeclampsia diagnosis at RSUP Dr. M. Djamil Padang by classifying each variable using a decision tree. This research employs the CART (Classification and Regression Tree) algorithm. The CART algorithm has a binary nature and can analyze response variables that are either categorical or continuous, handle data with missing values, and produce an interpretable tree structure. The study results indicate that the primary risk factor for preeclampsia is parity. The model developed using the CART algorithm was tested using a confusion matrix, yielding an accuracy of 54%, a precision of 33.3% in correctly classifying patients with mild preeclampsia (PER), and a recall of 23.8% in classifying patients with severe preeclampsia (PEB).
Analisis Sentimen Penggunaan Aplikasi YouTube Menggunakan Metode Naïve Bayes Putri, Triana; Siti Nurhaliza; Dodi Vionanda
UNP Journal of Statistics and Data Science Vol. 3 No. 1 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss1/343

Abstract

This study aims to analyze user sentiment towards the YouTube application using the Naive Bayes method. With the rapid growth of YouTube users worldwide, understanding user preferences and experiences is crucial. Sentiment analysis, a process of processing or extracting textual data to obtain information by categorizing positive or negative sentiment The Naive Bayes algorithm, a statistical approach commonly used in natural language processing and sentiment analysis, was applied due to its simplicity and efficiency. The research involved data collection through web scraping, followed by preprocessing steps such as cleaning, case folding, tokenization, stopword removal, and stemming. Feature selection was performed using TF-IDF (Term Frequency-Inverse Document Frequency) to assign weights to words based on their importance. The Naive Bayes classifier was then trained on the preprocessed data, and its performance was evaluated using accuracy, precision, recall, and F1-score metrics. The results showed an accuracy of 82%, precision of 83%, recall of 98%, and an F1-score of 89%, indicating the effectiveness of the Naive Bayes method in sentiment analysis for the YouTube application. This study provides valuable insights into user sentiment towards YouTube, enabling developers and content creators to enhance user experiences and marketing strategies.
Analisis Klaster K-means dalam mengelompokan Kabupaten/Kota di Provinsi Sumatera Barat Berdasarkan Jenis Kekerasan Terhadap Perempuan Tahun 2023 Febiola, Latifah Jayatri; Fadhilah Fitri; Fenni Kunia Mutiya
UNP Journal of Statistics and Data Science Vol. 3 No. 1 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss1/344

Abstract

Violence against women is a serious social issue and a violation of human rights. Women are often vulnerable to violence, whether physical, psychological, or sexual, which negatively impacts their physical and mental health. To understand the distribution of violence cases against women in West Sumatra Province, an analytical method is needed to classify regions based on the number of reported cases. K-Means Clustering is one of the clustering analysis methods used to group districts/cities based on similarities in the number of violence cases. This study aims to classify districts/cities in West Sumatra based on the number of female violence victims using the K-Means Clustering algorithm. The optimal number of clusters was determined using the silhouette method, resulting in three clusters. Cluster 3 has the highest average number of physical and sexual violence cases, consisting of four districts/cities: Solok Regency, Lima Puluh Kota, Solok City, and Payakumbuh City. Cluster 2 represents areas with a moderate level of violence, dominated by psychological abuse, and consists of five districts/cities. Meanwhile, Cluster 1 comprises ten districts/cities with the lowest recorded violence cases. This classification provides insight into the regional distribution of violence against women in West Sumatra, identifying areas that require more attention. The findings suggest that the government should prioritize regions with high levels of violence through stricter law enforcement, the provision of support services for victims, gender equality campaigns, and increased awareness of women's rights
Analisis Sentimen Masyarakat Terhadap Korupsi Berdasarkan Tweet Menggunakan Klasifikasi Naive Bayes Zulzila, Alivia; Latifah Jayatri Febiola; Dodi Vionanda
UNP Journal of Statistics and Data Science Vol. 3 No. 1 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss1/345

Abstract

Corruption is one of the big problems faced in Indonesia. The still high rate of corruption can damage the integrity of government, hamper economic growth, and reduce public trust in public institutions. Even though the government has made efforts to eradicate corruption, such as the formation of the Corruption Eradication Commission (KPK), these big challenges remain. Social media, especially Twitter, has become an important platform for people to voice opinions and criticize corruption issues. Sentiment analysis is used to detect opinions in the form of judgments, evaluations, attitudes and emotions of a person. The textual classification algorithm used in this research is Naive Bayes. This research aims to determine public sentiment towards corruption in Indonesia in positive, negative and neutral categories. This is done by data preprocessing, data labeling, and classification. The results of sentiment classification using the Naïve Bayes method obtained positive sentiment of 11, negative sentiment of 14, and neutral sentiment of 1485. So it can be concluded that Indonesian society tends to have neutral sentiments towards corruption that occurs in Indonesia
Analysis on Scopus Articles Padang State University Based on SINTA Website Aidillah, Kerin Hagia; Dodi Vionanda; Dony Permana
UNP Journal of Statistics and Data Science Vol. 3 No. 1 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss1/346

Abstract

Universities have the responsibility to carry out education, research, and community service as mandated by Law Number 20 of 2003 on the National Education System in Article 20. The flagship research theme set by Universitas Negeri Padang (UNP) for the period 2020-2024 is "Development of Digital Learning Services and Development of Minangkabau Cuisine based on Local Potential." The focus of the flagship research activities at Padang State University encompasses two main research areas: 1) Digital Learning Services; and 2) Minangkabau Cuisine. The objective of this research is to compare the flagship research theme with the Scopus articles from Universitas Negeri Padang on the SINTA website. By analyzing the trends of Scopus article topics on the SINTA website using web scraping techniques and wordcloud visualization, it is concluded that there is a match between the trending topics of UNP's Scopus articles and UNP's flagship research theme, particularly in the field of Digital Learning Services. From the wordcloud results, which show keywords such as Learning, Development, Student, and Model. This research allows us to easily observe from the wordcloud visualization the trend of research topics in Scopus articles on SINTA at Universitas Negeri Padang, reflecting the realization of Universitas Negeri Padang flagship research theme for the period 2020-2024
Analisis Sentimen Review Aplikasi Chatting di Google Play Store Menggunakan Alghoritma Naïve Bayes Classifer Alfathan, Muhammad Luthfi; Dodi Vionanda; Nufhika Fishuri
UNP Journal of Statistics and Data Science Vol. 3 No. 1 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss1/347

Abstract

Chatting application is a medium used to connect two or more people through social media platforms. Based on the results of the survey report, there are 5 chat applications that are often used as a medium of communication, including WhatsApp, Facebook, Telegram, Instagram and Line applications. This research aims to see the sentiment of chat application users, and see how naive bayes performs in analyzing the sentiment of chat application users. The purpose of sentiment analysis in this research is to assess whether a comment related to an issue is negative or positive, as well as a guide in improving the quality or service of a product. From the analysis results obtained, the Naïve Bayes model showed mixed performance depending on the type of application and sentiment. The model generally showed better performance in identifying positive reviews, especially on Facebook, Telegram, and Instagram apps, where recall reached 100%. However, the model performed very poorly in identifying neutral reviews across all apps. To increase accuracy and more balanced sentiment detection capabilities, improvements in data preprocessing, handling data imbalance, or the use of more complex classification methods are needed.
Implementation of Text Mining for Emotion Detection Using The Lexicon Method (Case Study: Tweets About Pemilu 2024) Afifah Salsabilah Putri; Eujeniatul Jannah; Dodi Vionanda; Syafriandi
UNP Journal of Statistics and Data Science Vol. 3 No. 1 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss1/348

Abstract

The presidential election is a five-year event that is an important and crucial moment in the realisation of democracy in the Unitary State of the Republic of Indonesia (NKRI). In the modern political era, the development of information technology has had a significant impact in changing the way people interact and express their views on political issues, including in the Presidential election.  One of the social media platforms that is often used to debate political and social issues is Twitter. The analysis method used in this research is sentiment and emotion analysis with a lexicon-based approach. The research stages consist of twitter data collection, data preprocessing, and emotion feature extraction. The first word to be highlighted in the 2024 election series on twitter social media is Anies. Trust is the most dominant emotion towards the three candidate pairs, namely Anies Muhaimin, Prabowo Gibran, and Ganjar Mahfud, showing high public trust.
Analisis Sentimen Pengguna Twitter Terhadap Serangan Moskow oleh ISIS dengan Algoritma Naive Bayes Pratiwi, Cindy; Dodi Vionanda; Fayyadh Ghaly
UNP Journal of Statistics and Data Science Vol. 3 No. 1 (2025): UNP Journal of Statistics and Data Science
Publisher : Departemen Statistika Universitas Negeri Padang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.24036/ujsds/vol3-iss1/349

Abstract

This study aims to analyze public sentiment towards the ISIS attack in Moscow, Russia on March 22, 2024 through twitter data using the Naive Bayes classification method. The attack had a significant impact on people's perceptions and reactions as reflected in the tweets of twitter social media users. To analyze this, 3005 English tweets from 22 March 2024 to 30 April 2024 relating to the event were collected using the crawling method with the phyton programming language. Preprocessing was done on the data to clean the data, then data labeling was done using phyton TextBlob. Naive Bayes algorithm is used to classify the sentiment of tweets into positive, and negative classes. The results of the research using Naive Bayes show that public sentiment tends to be negative towards the attacks that occurred. Naive Bayes classification results are quite good with an accuracy value of 70%, but there is an imbalance of data that tends to be biased towards negative sentiment. This research provides insight into how public opinion responds to events that occur and the performance of the Naive Bayes model in classification.