Claim Missing Document
Check
Articles

A COMPARISON OF LOGISTIC REGRESSION, MIXED LOGISTIC REGRESSION, AND GEOGRAPHICALLY WEIGHTED LOGISTIC REGRESSION ON PUBLIC HEALTH DEVELOPMENT IN JAVA Setiawan, Erwan; Suprayogi, Muhammad Azis; Kurnia, Anang
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 19 No 1 (2025): BAREKENG: Journal of Mathematics and Its Application
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30598/barekengvol19iss1pp129-140

Abstract

The Public Health Development Index (Indeks Pembangunan Kesehatan Masyarakat - IPKM) is a combined parameter that reflects progress in health development and is useful for determining areas that need assistance in improving health development. Through IPKM modeling, factors that significantly influence regional public health development can be discovered. This research aims to find an appropriate model for modeling IPKM and determine the factors that significantly influence public health development. The data used is the 2018 IPKM data collected from 119 cities/regencies in Java. We propose three models namely logistic regression (LR), mixed logistic regression (MLR), and geographically weighted logistic regression (GWLR). The research results show that the MLR is the best model for modeling IPKM in Java based on the AIC value criteria. Based on the MLR model, the factors that have a significant influence on public health development are the egg and milk consumption level and the percentage of the number of doctors per thousand population.
FACTORS AFFECTING INDONESIAN PADDY HARVEST FAILURE: A COMPARISON OF BETA REGRESSION, QUASI-BINOMIAL REGRESSION, AND BETA MIXED MODELS Kusumaningrum, Dian; Hidayat, Agus Sofian Eka; Notodiputro, Khairil Anwar; Kurnia, Anang; Sartono, Bagus; Sumertajaya, I Made
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 18 No 4 (2024): BAREKENG: Journal of Mathematics and Its Application
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30598/barekengvol18iss4pp2611-2622

Abstract

The Paddy harvest failure rate is one of the key aspects in determining the total number of claims in a crop insurance policy. It is also an important factor indicating the fulfillment of targeted total production. Therefore, we proposed Beta Regression, Quasi Binomial Regression, and Beta Mixed Models which can be used to analyze significant variables affecting paddy harvest failure rates. Model selection and evaluations indicated that the Nested Beta Mixed Model is the best. Previous research has shown four significant fixed effect variables: drought, flood, pests, and disease risks. Pests and other types of risks also affect the variability of loss rate. All variables have positive effects, indicating higher values cause a higher possibility of a higher average harvest failure rate. High variability was shown for province, municipality, and farmers' random effects. Hence, to prevent a more significant loss rate, MoA should consider more intensive and innovative participatory activities in farmer groups to enhance good farming practices, especially for farmers who suffer from certain risks. These activities should also consider the local characteristics of each province or municipality. As for AUTP development and improvement, farmers with lower failure risks could be given a discounted premium to make it more appealing.
Sentiment Classification on the 2024 Indonesian Presidential Candidate Dataset Using Deep Learning Approaches Suhaeni, Cici; Wijayanto, Hari; Kurnia, Anang
Indonesian Journal of Statistics and Applications Vol 8 No 2 (2024)
Publisher : Statistics and Data Science Program Study, IPB University, IPB University, in collaboration with the Forum Pendidikan Tinggi Statistika Indonesia (FORSTAT) and the Ikatan Statistisi Indonesia (ISI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v8i2p83-94

Abstract

This study aims to compare the performance of three deep learning models (LSTM, BiLSTM, and GRU) in the task of sentiment classification for the 2024 Indonesian Presidential Candidate dataset, focusing specifically on the case of Prabowo Subianto. The dataset comprises social media X posts sourced from kaggle, and the analysis investigates the effectiveness of different variants of recurrent neural network architectures in identifying public sentiment. The models were evaluated on accuracy and F1 score. The results demonstrate that BiLSTM outperformed both LSTM and GRU models in all metrics, achieving a testing accuracy of 80.70% and an F1 score of 86.86%, compared to LSTM and GRU which both achieved a testing accuracy of 72.56% and an F1 score of approximately 84%. The higher performance of BiLSTM is attributed to its ability to capture bidirectional context within the text, thereby understanding complex sentiment patterns more effectively. LSTM and GRU models displayed similar performance, therefore BiLSTM is the best model for this dataset. These results indicate that BiLSTM is especially well-suited for analyzing public sentiment towards political figures like Prabowo Subianto, offering significant insights into public discussions surrounding the 2024 Indonesian Presidential Election. This study recommends exploring transformer-based models like BERT or GPT variants to enhance sentiment classification accuracy in this domain.
STRATEGY FOR ELIMINATING NEGLECTED TROPICAL DISEASES THROUGH INDIVIDUAL AND AREA ASPECTS USING THE HIERARCHICAL LOGISTIC REGRESSION METHOD Oktora, Siskarossa Ika; Matualage, Dariani; Amalia Pasaribu, Asysta; Fitriyani Sahamony, Nur; Kurnia, Anang
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 19 No 4 (2025): BAREKENG: Journal of Mathematics and Its Application
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30598/barekengvol19iss4pp2495-2506

Abstract

Filariasis is one of the Neglected Tropical Diseases (NTDs) that is often associated with poverty and marginalized community groups. Papua is the province with the highest number of chronic filariasis cases and has the largest number of endemic districts/municipalities compared to other provinces in Indonesia. Papua is also the province with the highest poverty rate in Indonesia. To support the government's filariasis elimination program, this study aims to determine variables that influence the incidence of filariasis in Papua at the individual and area levels. This study uses 2018 Indonesia Basic Health Research data from the Ministry of Health and regional data from BPS-Statistics Indonesia. The results using Hierarchical Binary Logistic Regression concluded that defecation behavior in latrines, prevention behavior against mosquito bites, participation in mass preventive drug administration, number of poor people, and number of health workers have a significant effect on the incidence of filariasis. In contrast, the variables age, gender, type of work, and level of education do not have a significant effect.
A Machine Learning Approach to Spatial Analysis of Paddy Field Conversion Using Multispectral Sentinel-2A Imagery Fauzan, Achmad; Kurnia, Anang
JOIV : International Journal on Informatics Visualization Vol 9, No 5 (2025)
Publisher : Society of Visual Informatics

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.62527/joiv.9.5.3617

Abstract

The expanse of rice fields is a critical metric as it is intimately linked to agricultural productivity in a given locale. This study investigates the application of satellite imagery to quantify trice fields' acreage and temporal variations. The data utilized was acquired by the Sentinel-2A multispectral satellite. The variables employed are the image's baseband and spectral index. The research area encompasses the Sukamakmur sub-district in Bogor Regency, Indonesia. The types of machine learning models include Extreme Gradient Boosting (XGBoost), Multi-Layer Perceptron (MLP), and k-Nearest Neighbor (kNN). The simulation of class numbers is conducted to achieve the most stable and precise evaluation metric values. The XGBoost algorithm is used for the overall classification process of the region based on the optimal metric score. The model's accuracy, precision, recall, and F1-score are 92.37%, 92.3%, 92.38%, and 92.33%, respectively, indicating a very good performance. The model successfully captures a decline in rice field area between 2020 and 2023. Using the Modified Moran’s Index (MMI), the study reveals a positive spatial autocorrelation, indicating a clustered pattern in land-use change. Regions that experience either substantial or minor changes in land use are commonly situated near areas exhibiting similar characteristics. This study presents a spatially aware machine learning framework that enables the effective monitoring of agricultural land-use dynamics. In the future, this framework can be enhanced by integrating time-series forecasting and socio-economic data, supporting more informed decision-making in food security planning and agricultural policy development.
Perbandingan Metode GARCH, LSTM, GRU, dan CNN pada Peramalan Volatilitas Kurs Septiani, Adeline Vinda; Afendi, Farit Mochamad; Kurnia, Anang
Limits: Journal of Mathematics and Its Applications Vol. 22 No. 1 (2025): Limits: Journal of Mathematics and Its Applications Volume 22 Nomor 1 Edisi Ma
Publisher : Pusat Publikasi Ilmiah LPPM Institut Teknologi Sepuluh Nopember

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.12962/limits.v22i1.3384

Abstract

Currency volatility is an important aspect of time series data analysis in economics and finance. This study aims to compare the performance of four methods: Generalized Autoregressive Conditional Heteroscedasticity (GARCH), Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), and Convolutional Neural Network (CNN), in predicting the volatility of the Rupiah against the US Dollar. The data used is daily exchange rates from January 2015 to March 2024. The evaluation is conducted by calculating the Root Mean Square Error (RMSE) and the percentage of actual values within a 95% confidence interval on training and testing data. The results indicate that LSTM achieves the lowest RMSE, with values of 5.30E-05 on training data and 2.50E-05 on testing data, demonstrating high accuracy in capturing non-linear patterns and long-term fluctuations. GRU records the highest percentage of actual values within the confidence interval, at 90.32% for training data and 91.72% for testing data, reflecting superior consistency compared to other methods. Meanwhile, GARCH shows competitive performance but lacks robustness on testing data. CNN exhibits the lowest performance, with high RMSE and a low percentage of data within the confidence interval. Overall, GRU emerges as the best method, offering an optimal balance between predictive accuracy and consistency, making it a reliable tool for modeling exchange rate volatility in high-volatility scenarios. Consequently, GRU is utilized for forecasting exchange rate volatility for the next 30 days. These findings contribute to the selection of appropriate methods for modeling exchange rate volatility, particularly amidst global market uncertainty.
Evaluating Ordinal Multivariate Models under Multicollinearity via Pairwise Likelihood: A Simulation Perspective Achmad Fauzan; Kusman Sadik; Anang Kurnia
Advance Sustainable Science Engineering and Technology Vol. 7 No. 4 (2025): August-October
Publisher : Science and Technology Research Centre Universitas PGRI Semarang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26877/asset.v7i4.2282

Abstract

This study examines the effect of multicollinearity on ordinal regression through a two-stage Monte Carlo simulation. A synthetic population of 2,000,000 observations was generated with predictors drawn from a normal distribution, and responses simulated using an ordinal probit model. A Monte Carlo procedure was employed with 10 repetitions, each consisting of 100 random samples of 1,000 observations. Parameter estimation employed Maximum Likelihood Estimation (MLE) for univariate models and Pairwise Likelihood (PL) for multivariate models, with performance assessed using mean squared error (MSE), bias, and computation time. Results show that multicollinearity had negligible impact on estimator bias and MSE, confirming the robustness of both MLE and PL to correlated predictors. However, severe multicollinearity substantially increased computation time, indicating a trade-off between estimator stability and efficiency. These findings highlight PL as a viable approach for analyzing complex ordinal data, particularly in applications such as socio-economic surveys and health metrics where predictor correlation is unavoidable.
Determinants of Environmental Quality Index (EQI) in Indonesia in 2018-2022 Sihombing, Pardomuan Robinson; Erfiani, Erfiani; Notodiputro, Khairil Anwar; Kurnia, Anang
KEUNIS Vol. 13 No. 2 (2025): JULY 2025
Publisher : Finance and Banking Program, Accounting Department, Politeknik Negeri Semarang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.32497/keunis.v13i2.6559

Abstract

The environment is a critical issue in sustainable development in Indonesia, with significant variations in environmental quality between regions. This study seeks to examine the influence of the Regional Government Budget, COVID-19 (as a dummy variable), Gross Regional Domestic Product (GRDP), and the Human Development Index (HDI) on the Environmental Quality Index (EQI) in Indonesia. The data for this study were obtained from BPS–Statistics Indonesia and the Ministry of Environment and Forestry, covering the period from 2018 to 2022. The analysis employs multiple linear regression using panel data. Panel model testing indicates that the fixed effects model with cross-sectional lag provides the best fit. The results show that, collectively, all variables have a significant influence on Indonesia's Environmental Quality Index (EQI). Individually, the Regional Government Budget for environmental purposes, the COVID-19 dummy variable, and the Human Development Index (HDI) have a significant positive impact on EQI. In contrast, Gross Regional Domestic Product (GRDP) has a significant negative effect. These findings highlight the need for comprehensive macro-socioeconomic policies to sustain and enhance environmental quality in Indonesia.
The Empirical Best Linear Unbiased Prediction and The Emperical Best Predictor Unit-Level Approaches in Estimating Per Capita Expenditure at the Subdistrict Level Fauziah, Ghina; Kurnia, Anang; Djuraidah, Anik
Scientific Journal of Informatics Vol. 12 No. 2: May 2025
Publisher : Universitas Negeri Semarang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.15294/sji.v12i2.25037

Abstract

Purpose: This study aims to estimate and evaluate per capita expenditure at the subdistrict level in Garut Regency by employing unit-level Small Area Estimation (SAE) techniques, specifically utilizing the Empirical Best Linear Unbiased Predictor (EBLUP) and the Empirical Best Predictor (EBP) methods. Methods: The data used in this study are socio-economic data, specifically per capita household expenditure in Garut Regency. Socio-economic data generally skew positively rather than the normal distribution, so a method that can approximate or come close to the normal distribution is needed, for example, log-normal transformation. To improve the performance of EBLUP, which may lead to inefficient estimators because of violation of the assumption of normality, this study proposes the Empirical Best Predictor (EBP) method. It handles positively skewed data by applying log-normal transformation to sample data so that it more closely conforms to the desired distribution. Result: The EBP results are more stable than EBLUP since EBLUP is highly sensitive to outliers, and in cases where the normality assumption is violated, it produces a significant mean square error and inefficient estimators. Evaluating the estimates with both EBLUP and EBP shows Relative Root Mean Squared Error (RRMSE) values above 25%, especially in the subdistricts of Pamulihan, Sukaresmi, and Kersamanah. This is probably due to the household samples being taken in these three subdistricts being comparatively small compared to the other. Novelty: In this research, we use EBP to improve the performance of EBLUP, which produces inefficient estimators when the normality assumption is violated.
Evaluasi Performa Rmixmod dan KAMILA dalam Pengelompokan Perguruan Tinggi di Indonesia Berdasarkan Data Capaian Kinerja Bertipe Campuran Santoso, Andrianto; Kurnia, Anang; Hamim Wigena, Aji
Building of Informatics, Technology and Science (BITS) Vol 7 No 1 (2025): June (2025)
Publisher : Forum Kerjasama Pendidikan Tinggi

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.47065/bits.v7i1.7376

Abstract

Clustering is a technique for grouping objects based on their similarities within clusters and their differences across clusters. In real-world, objects often have characteristics represented by a combination of numerical and categorical variables, requiring clustering techniques that can process mixed-type data. Model-based clustering is one of the approaches that can be utilized for such data. This study evaluates and compares two model-based clustering algorithms for mixed data type, Rmixmod, which employs a mixture model with maximum likelihood estimation and expectation-maximization, and KAMILA, which utilizes a semi-parametric approach. Both algorithms are implemented to cluster Indonesian higher education institutions based on their performance. The optimal number of clusters is determined using the Bayesian Information Criterion and the Silhouette Coefficient. Algorithms performance is evaluated using the Silhouette Coeeficient, the Calinski-Harabasz Index, and the Davies-Bouldin Index. The research results showed that the Rmixmod algorithm outperformed KAMILA in clustering Indonesian higher education institutions, with a Silhouette Coeeficient of 0.2878, a Calinski-Harabasz Index of 253.9433, and a Davies-Bouldin Index of 1.5321. The optimal number of clusters formed was five. Cluster interpretation is conducted by analyzing the mean values of PC and the distribution of categorical variables within each cluster. The clustering results are expected to serve as a foundation for the government in formulating strategic policies that are both effective and differentiated according to the characteristics of each group of higher education institutions.
Co-Authors . Hanniva . Marzuki . Sutriyati Abdullah Ilman Fahmi Achmad Fauzan Achmad Fauzan, Achmad Agus Buono Agus M Soleh Agus Mohamad Soleh Ahmad Ansori Mattjik Ajeng Bita Alfira Aji Hamim Wigena Alkahfi, Cahya Amalia Pasaribu, Asysta Amin, Yudi Fathul Anik Djuraidah Ardiansyah, Muhlis Arie Anggreyani Arief Gusnanto ASEP SAEFUDDIN Astri Fatimah Azka Ubaidillah Bagus Sartono Bambang Sumantri Beny Trianjaya Budi Susetyo Budi Waryanto Cici Suhaeni Citra Jaya Dede Dirgahayu Dede Dirgahayu Deiby T Salaki Dewi Juliah Ratnaningsih Dhea Dewanti Dian Handayani Dian Kusumaningrum Dian Kusumaningrum Dian Kusumaningrum, Dwi Agustin Nuriani Sirodj Dwi Wahyu Triscowati Efriwati Efriwati Erfiani Erfiani Erfiani Erwan Setiawan, Erwan Farit Mochamad Afendi Farit Mohamad Afendi Fauzi, Fatkhurokhman Fauziah, Ghina Febryna Sembiring Fitri Dewi Shyntia Fitrianto, Anwar Fitriyani Sahamony, Nur Gerry Alfa Dito Hamid, Assyifa Lala Pratiwi Hamim Wigena, Aji Haq, Irvanal Hari Wijayanto Hari Wijayanto Hari Wijayanto Hestiani Wulandari Hidayat, Agus Sofian Eka Hidayat, Muhammad I Made Sumertajaya I Wayan Mangku Ikhlasul Amalia Rahmi Ina Widayanty Indah Herlawati Indahwati Indonesian Journal of Statistics and Its Applications IJSA Ita Wulandari Iwan Kurniawan Khairani, Fitri Khairil Anwar Notodiputro Kristuisno Martsuyanto Kapiluka Kusman Sadik Loly, Joao Ferreira Rendes Bean Matualage, Dariani Maulana Achiar, Anshari Luthfi Muhammad Nur Aidi Mulianto Raharjo Nashir, Husnun Newton Newton Nurul Hidayati Pardomuan Robinson Sihombing Pasaribu, Asysta Amalia Pingkan Awalia Pramana, Setia Purba, Widyo Pura Purwanto, Arie Putri, Christiana Anggraeni Rahardiantoro, Septian Rahma Anisa Rahma Anisa Rahman, Gusti Arviana Retsi Firda Maulina Ristiyanti Ristiyanti Rysda Rysda Ryska Putri Madyasari Sahamony, Nur Fitriyani Santoso, Andrianto Santoso, Zein Rizky Sari Agustini Hafman Septiani, Adeline Vinda Setyowati, Indah Rini Siregar, Jodi jhouranda Siskarossa Ika Oktora Siti Muchlisoh Suhaeni, Cici Suprayogi, Muhammad Azis Suprayogi, Muhammad Aziz Teguh Prasetyo Thooriq Ghaith Topan . Ruspayandi Triscowati, Dwi Wahyu Tyas, Maulida Fajrining Utami Dyah Syafitri Viarti Eminita Widiyanto, Rhendy K. P. Widoretno, Widoretno Yani Nurhadryani Yenni Angraini Yenni Kurniawati Yudistira Yudistira Yully Sofyah Waode