Claim Missing Document
Check
Articles

Comparative Performance of GLMM and GEE for Longitudinal Beta Regression in Economic Inequality Modelling Sihombing, Pardomuan Robinson; Erfiani; Khairil Anwar Notodiputro; Anang Kurnia
Advance Sustainable Science Engineering and Technology Vol. 7 No. 3 (2025): May - July
Publisher : Science and Technology Research Centre Universitas PGRI Semarang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.26877/asset.v7i3.2057

Abstract

Due to the shortcomings of conventional Gaussian methods, specialized models are frequently needed for longitudinal data analysis with bounded outcomes, such as the Gini ratio. In order to model economic inequality in Indonesia, this study compares the effectiveness of Generalized Linear Mixed Models (GLMM) and Generalized Estimating Equations (GEE) for beta-distributed longitudinal data. Root Mean Square Error (RMSE) and pseudo R-squared values are used to assess model performance using panel data from 10 provinces between 2018 and 2024 as well as important socioeconomic indicators. With lower RMSE and higher explanatory power across all provincial subsets, the results consistently demonstrate that GLMM performs better than both GEE and generalized linear models (GLM). ANOVA tests verify that modeling methodologies, not data heterogeneity in GRDP or Gini values, are responsible for the differences in model performance. These results demonstrate how well GLMM handles complex data structures and within-subject correlations, providing more accurate and effective estimates in longitudinal beta regression scenarios. The study encourages the use of GLMM for more precise longitudinal analysis in economic and social research and offers insightful information for researchers modeling inequality indices.
Bibliometric Mapping and Trend Analysis of Beta Regression Modeling: A Decade of Development (2015–2024) Sihombing, Pardomuan Robinson; Erfiani, Erfiani; Notodiputro, Khairil Anwar; Kurnia, Anang
Sinkron : jurnal dan penelitian teknik informatika Vol. 9 No. 3 (2025): Article Research July 2025
Publisher : Politeknik Ganesha Medan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33395/sinkron.v9i3.14949

Abstract

Beta regression is a statistical model designed to handle dependent variables that assume values within the open interval (0, 1), such as rates, proportions, or percentages. The study aimed to determine the development of beta regression over the last 10 years with a bibliometric approach. The source of the article database used comes from the Scopus website. The tool used for analysis is R software with a bibliometrix package. The results of this study show that there are 293 articles published in the Scopus Journal. Research develops in various research fields. The author with the most articles is Cribari-Neto, F., with the most significant number of documents, i.e., 12. According to the author's country of origin related to the beta regression method, Brazil has the most countries, while Indonesia is in 12th place. Therefore, research on beta regression still has excellent potential to continue to be developed.
Mendeteksi Unsur Depresi pada Unggahan Media Sosial Menggunakan Metode Machine Learning dengan Optimasi Berbasis Inspirasi Alam Santoso, Zein Rizky; Wigena, Aji Hamim; Kurnia, Anang
ESTIMASI: Journal of Statistics and Its Application Vol. 6, No. 2, Juli, 2025 : Estimasi
Publisher : Hasanuddin University

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.20956/ejsa.v6i2.45516

Abstract

Social media has now become an inseparable part of everyday life, including in expressing emotions and mental states. One popular platform is X (formerly Twitter), where many users indirectly share signs of depression. This study develops a classification model to detect indications of depression in social media posts, using machine learning algorithms and feature selection techniques based on nature-inspired algorithms. The classification algorithms used include Naïve Bayes, k-Nearest Neighbors (k-NN), Decision Tree, Random Forest, and XGBoost. Each algorithm is combined with feature selection techniques using Particle Swarm Optimization (PSO), Bat Algorithm (BA), and Flamingo Search Algorithm (FSA). The models are evaluated based on accuracy, precision, recall, F1-score, and the number of features used. The results show that the combination of the Random Forest method with FSA-based feature selection (RF-FSA) delivers the best performance, with an accuracy of 82.2%, balanced precision and recall, and efficient feature usage. Another strong alternative is XGBoost with FSA (XGB-FSA), although it requires more features and longer computational time. This study demonstrates that selecting the right feature selection algorithm, particularly FSA, can significantly improve both the accuracy and efficiency of depression text classification models. The resulting model is expected to serve as a useful tool for early detection of depression symptoms from social media posts, allowing for quicker and more targeted interventions.
TEXT CLUSTERING ONLINE LEARNING OPINION DURING COVID-19 PANDEMIC IN INDONESIA USING TWEETS Tyas, Maulida Fajrining; Kurnia, Anang; Soleh, Agus Mohamad
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 16 No 3 (2022): BAREKENG: Journal of Mathematics and Its Applications
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (1033.901 KB) | DOI: 10.30598/barekengvol16iss3pp939-948

Abstract

To prevent the spread of corona virus, restriction of social activities are implemented including school activities which reaps the pros and cons in community. Opinions about online learning are widely conveyed mainly on Twitter. Tweets obtained can be used to extract information using text clustering to group topics about online learning during pandemic in Indonesia. K-Means is often used and has good performance in text clustering area. However, the problem of high dimensionality in textual data can result in difficult computations so that a sampling method is proposed. This paper aims to examine whether a sampling method to cluster tweets can result to an efficient clustering than using the whole dataset. After pre-processing, five sample sizes are selected from 28300 tweets which are 250, 500, 2500, 10000 and 20000 to conduct K-Means clustering. Results showed that from 10 iterations, three main cluster topics appeared 90%-100% in sample size of 2500, 10000 and 20000. Meanwhile sample size of 250 and 500 tend to produced 20%-60% appearance of the three main cluster topics. This means that around 8% to 35% of tweets used can yield representative clusters and efficient computation which is four times faster than using entire dataset.
SMALL AREA ESTIMATION OF MEAN YEARS SCHOOL IN KABUPATEN BOGOR USING SEMIPARAMETRIC P-SPLINE Putri, Christiana Anggraeni; Indahwati, Indahwati; Kurnia, Anang
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 16 No 4 (2022): BAREKENG: Journal of Mathematics and Its Applications
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (823.151 KB) | DOI: 10.30598/barekengvol16iss4pp1541-1550

Abstract

The Fay-Herriot model, generally uses the EBLUP (Empirical Best Linear Unbiased Prediction) method, is less flexible due to the assumption of linearity. The P-Spline semiparametric model is a modification of the Fay-Herriot model which can accommodate the presence of two components, linear and nonlinear predictors. This paper also deals spatial dependence among the random area effects so that a model with spatially autocorrelated errors will be implemented, known as the SEBLUP (Spatial Empirical Best Linear Unbiased Prediction) method. Using data from SUSENAS, PODES, and some publication from BPS, the main objective of this study is to estimate the mean years school at kecamatan level in Kabupaten Bogor using EBLUP, Semiparametric P-Spline approach and SEBLUP method. The results show that based on the RRMSE value, the cubic P-Spline model with three knots predicts the mean years school better than EBLUP. Meanwhile, the addition of spatial effects into the small area estimation has not been able to improve the estimated value of the P-Spline semiparametric approach.
SUBDISTRICT CLUSTERING IN WEST JAVA PROVINCE BASED ON DISEASE INCIDENCE OF JKN PARTICIPANTS PRIMARY SERVICES Nashir, Husnun; Kurnia, Anang; Fitrianto, Anwar
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 17 No 1 (2023): BAREKENG: Journal of Mathematics and Its Applications
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (484.01 KB) | DOI: 10.30598/barekengvol17iss1pp0295-0304

Abstract

One of the efforts that can be done to optimize health services and the distribution of facilities and infrastructure efficiently in a wide scope is by profiling and clustering areas in the province of West Java to the scope of sub-districts that have similar characteristics of disease category. The methods that will be compared to get the best clustering are hierarchical clustering and ensemble clustering. The data used as the object of research is the BPJS Kesehatan capitation primary service sample data for the 2017-2018 period. Some of the important variables used include: primary disease diagnosis data (ICD-10) of patients at the puskesmas, service time, type of visit, and location of service sub-district. This study uses several evaluation metrics Silhouette coefficient, Dunn index, Davies-Bouldin index, and C-index to determine the optimal number of clusters formed. In addition, descriptive analysis and visualization of the clustering results are also used as considerations in selecting the optimal cluster. Based on the evaluation results, the optimal method is hierarchical clustering with complete linkage. This method produces three clusters: cluster 1 consists of 5 sub-districts that have a high/dominant mean value in almost all disease categories, cluster 2 consists of 26 sub-districts that have a medium mean value, and cluster 3 consists of 589 sub-districts that have a low mean value. Most of the members of clusters 1 and 2 are sub-districts located in the districts/cities around the national capital (DKI Jakarta) and the provincial capital (Bandung) while the members of cluster 3 are mostly sub-districts located in suburban districts/cities or far from the central government.
A COMPARISON OF LOGISTIC REGRESSION AND GEOGRAPHICALLY WEIGHTED LOGISTIC REGRESSION (GWLR) ON COVID-19 DATA IN WEST SUMATRA Haq, Irvanal; Aidi, Muhammad Nur; Kurnia, Anang; Efriwati, Efriwati
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 17 No 3 (2023): BAREKENG: Journal of Mathematics and Its Applications
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30598/barekengvol17iss3pp1749-1760

Abstract

An understanding of factors that affect the recovery time from a disease is important for the community, medical staff, and also the government. This research analyzed factors that affect the recovery time of Covid-19 sufferers in West Sumatra. In addition, the consumption of a herbal made from Sungkai leaves, which is believed by some people in West Sumatra to accelerate the healing from Covid-19, was also included in the analysis. The recovery time here was categorized into two classes (binary): 1 for within 2 weeks, and 0 for more than 2 weeks. The methods used were logistic regression and geographically weighted logistic regression (GWLR). GWLR provides estimates of parameters for each location. The data used in this study is Covid-19 data of 2021 taken from the Regional Research and Development Agency (Litbangda) of West Sumatra with a total of 764 observations collected from 19 regencies/cities in West Sumatra. The results showed that there was no difference between the logistic regression model and the GWLR models based on the values of AIC and the ratio of deviance and degrees of freedom (df). The addition of spatial factors through GWLR models did not provide additional information regarding the recovery of Covid-19 sufferers within 2 weeks or more than 2 weeks. The logistic regression model gives the result that, at significance level α = 10%, residence, vaccination status, and symptoms significantly affect the recovery time within 2 weeks or more for Covid-19 sufferers, while other variables, namely sex, age, Sungkai leaves consumption status, and ginger consumption status have no significant effects.
MODELING THE INCIDENCE OF MALNUTRITION IN BOGOR REGENCY USING ZERO-INFLATED NEGATIVE BINOMIAL MIXED EFFECT MODEL Sirodj, Dwi Agustin Nuriani; Sadik, Kusman; Kurnia, Anang
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 18 No 2 (2024): BAREKENG: Journal of Mathematics and Its Application
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30598/barekengvol18iss2pp0961-0972

Abstract

Modeling response variables in the form of count data generally uses a model based on the Poisson distribution. However, some conditions, such as the presence of excess zero, can be found in the data that result in overdispersion, which will have an impact on the resulting variance in the model. In this paper, three approaches, namely the Poisson Mixed Model, the Negative Binomial (NB) Mixed Model, and the Zero-Inflated Negative Binomial (ZINB) Mixed Model, are used to model the incidence of malnutrition in Bogor Regency. The data used in this study are secondary data sourced from the West Java open data website. Based on the results of data analysis, it appears that the ZINB Mixed Model method is a method capable of accommodating random effects, overdispersion, and excess zero in modeling malnutrition in Bogor Regency. Variables that significantly affect the occurrence of malnutrition cases in villages in Bogor Regency include the Number of Children Weighed Routinely Every Month, Number of Children Measured for Length and Height Twice a Year, Number of Children under 12 Months Old Who Received Complete Basic Immunization, Number of Posyandu (Integrated Health Post), and Number of Parents/Caregivers Participating in Monthly Parenting (PAUD).
TWOFOLD SUBAREA MODEL FOR ESTIMATING COMMUTER PROPORTION IN 10 METROPOLITAN AREAS Amin, Yudi Fathul; Indahwati, Indahwati; Kurnia, Anang
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 18 No 2 (2024): BAREKENG: Journal of Mathematics and Its Application
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30598/barekengvol18iss2pp1009-1022

Abstract

The metropolitan area is a major contributor to national GDP. The metropolitan area is a center of attraction for many people who come to earn income as commuters. Commuters are people who carry out work activities in the center of the metropolitan area, which are carried out by residents who live in suburban areas around the center of the metropolitan area and commute regularly every day. The availability of commuter statistics from surveys for presentation level down to the smallest administrative level, such as regencies/municipalities, is unreliable. This happens because this level of presentation has poor precision due to insufficient samples due to the Statistics Indonesia survey design for making estimates at the national and provincial levels. It can be done using small area estimation (SAE) to meet increasing data needs, but existing SAE models can often estimate only at one level. To meet data requests more effectively, a model is needed that can estimate several small areas simultaneously. In SAE, one of the SAE models that can do this is the twofold subarea model. The twofold subarea model produces estimates of the proportion of commuters with good precision at the subarea level (regencies/municipalities) and area level (metropolitan area), with the RRMSE percentage value of the estimated proportion of commuters being below 25% for all regions. The results of this research can be used to present commuter data at the regencies/municipalities level and metropolitan area level where there is a lack of samples and become a new opportunity for Statistics Indonesia to increase statistical production in small areas, which is more effective compared to other SAE methods which have so far been used only to estimate one area level.
MULTINOMIAL LOGISTIC REGRESSION MODEL USING MAXIMUM LIKELIHOOD APPROACH AND BAYES METHOD ON INDONESIA'S ECONOMIC GROWTH PRE TO POST COVID-19 PANDEMIC Purwanto, Arie; Suprayogi, Muhammad Aziz; Setiawan, Erwan; Loly, Joao Ferreira Rendes Bean; Rahman, Gusti Arviana; Kurnia, Anang
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 19 No 1 (2025): BAREKENG: Journal of Mathematics and Its Application
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30598/barekengvol19iss1pp51-62

Abstract

Economic growth in Indonesia has become a major concern in the global context, especially before and after the Covid-19 pandemic. Key sectors such as tourism, manufacturing, trade and transportation have been seriously affected by restrictions on travel and economic activity imposed to control the spread of the virus. Therefore, it is considered necessary to carry out modeling to describe existing conditions. In this research, two approaches were used, namely the Maximum Likelihood approach and the Bayes approach. The use of methods in general as research material for researchers to study these two methods further. So far the algorithm used for the Bayes concept method is Markov Chain Monte Carlo with Hasting's Metropolis method. The parameter estimation results obtained from both methods are considered quite identical. However, it is necessary to pay attention to the iteration procedure that will be carried out. The selection of factors used in the iteration process is very determining in obtaining estimated parameter values. Furthermore, the results obtained so far do not contain any fundamental differences regarding economic growth in Indonesia. In general, Indonesia can be said to be stable in terms of economic growth.
Co-Authors . Hanniva . Marzuki . Sutriyati Abdullah Ilman Fahmi Achmad Fauzan Achmad Fauzan, Achmad Agus Buono Agus M Soleh Agus Mohamad Soleh Ahmad Ansori Mattjik Ajeng Bita Alfira Aji Hamim Wigena Alkahfi, Cahya Amalia Pasaribu, Asysta Amin, Yudi Fathul Anik Djuraidah Ardiansyah, Muhlis Arie Anggreyani Arief Gusnanto ASEP SAEFUDDIN Astri Fatimah Azka Ubaidillah Bagus Sartono Bambang Sumantri Beny Trianjaya Budi Susetyo Budi Waryanto Cici Suhaeni Citra Jaya Dede Dirgahayu Dede Dirgahayu Deiby T Salaki Dewi Juliah Ratnaningsih Dhea Dewanti Dian Handayani Dian Kusumaningrum Dian Kusumaningrum Dian Kusumaningrum, Dwi Agustin Nuriani Sirodj Dwi Wahyu Triscowati Efriwati Efriwati Erfiani Erfiani Erfiani Erwan Setiawan, Erwan Farit Mochamad Afendi Farit Mohamad Afendi Fauzi, Fatkhurokhman Fauziah, Ghina Febryna Sembiring Fitri Dewi Shyntia Fitrianto, Anwar Fitriyani Sahamony, Nur Gerry Alfa Dito Hamid, Assyifa Lala Pratiwi Hamim Wigena, Aji Haq, Irvanal Hari Wijayanto Hari Wijayanto Hari Wijayanto Hestiani Wulandari Hidayat, Agus Sofian Eka Hidayat, Muhammad I Made Sumertajaya I Wayan Mangku Ikhlasul Amalia Rahmi Ina Widayanty Indah Herlawati Indahwati Indonesian Journal of Statistics and Its Applications IJSA Ita Wulandari Iwan Kurniawan Khairani, Fitri Khairil Anwar Notodiputro Kristuisno Martsuyanto Kapiluka Kusman Sadik Loly, Joao Ferreira Rendes Bean Matualage, Dariani Maulana Achiar, Anshari Luthfi Muhammad Nur Aidi Mulianto Raharjo Nashir, Husnun Newton Newton Nurul Hidayati Pardomuan Robinson Sihombing Pasaribu, Asysta Amalia Pingkan Awalia Pramana, Setia Purba, Widyo Pura Purwanto, Arie Putri, Christiana Anggraeni Rahardiantoro, Septian Rahma Anisa Rahma Anisa Rahman, Gusti Arviana Retsi Firda Maulina Ristiyanti Ristiyanti Rysda Rysda Ryska Putri Madyasari Sahamony, Nur Fitriyani Santoso, Andrianto Santoso, Zein Rizky Sari Agustini Hafman Septiani, Adeline Vinda Setyowati, Indah Rini Siregar, Jodi jhouranda Siskarossa Ika Oktora Siti Muchlisoh Suhaeni, Cici Suprayogi, Muhammad Azis Suprayogi, Muhammad Aziz Teguh Prasetyo Thooriq Ghaith Topan . Ruspayandi Triscowati, Dwi Wahyu Tyas, Maulida Fajrining Utami Dyah Syafitri Viarti Eminita Widiyanto, Rhendy K. P. Widoretno, Widoretno Yani Nurhadryani Yenni Angraini Yenni Kurniawati Yudistira Yudistira Yully Sofyah Waode