p-Index From 2021 - 2026
7.537
P-Index
This Author published in this journals
All Journal FORUM STATISTIKA DAN KOMPUTASI Media Statistika Statistika JURNAL MATEMATIKA STATISTIKA DAN KOMPUTASI IPTEK The Journal for Technology and Science CAUCHY: Jurnal Matematika Murni dan Aplikasi Sosioinforma JUITA : Jurnal Informatika Jurnal Pengelolaan Sumberdaya Alam dan Lingkungan (Journal of Natural Resources and Environmental Management) International Journal of Advances in Intelligent Informatics Scientific Journal of Informatics JOIN (Jurnal Online Informatika) Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Indonesian Journal of Applied Statistics Jurnal Penelitian Pertanian Tanaman Pangan BAREKENG: Jurnal Ilmu Matematika dan Terapan JOURNAL OF APPLIED INFORMATICS AND COMPUTING SINTECH (Science and Information Technology) Journal MIND (Multimedia Artificial Intelligent Networking Database) Journal JIPI (Jurnal Ilmiah Penelitian dan Pembelajaran Informatika) Jurnal Aplikasi Statistika & Komputasi Statistik FIBONACCI: Jurnal Pendidikan Matematika dan Matematika Inferensi International Journal of Advances in Data and Information Systems InPrime: Indonesian Journal Of Pure And Applied Mathematics ESTIMASI: Journal of Statistics and Its Application Majalah Ilmiah Matematika dan Statistika (MIMS) Jurnal Lebesgue : Jurnal Ilmiah Pendidikan Matematika, Matematika dan Statistika Journal of Applied Data Sciences Enthusiastic : International Journal of Applied Statistics and Data Science Prosiding Seminar Nasional Official Statistics Jurnal Natural Eduvest - Journal of Universal Studies Xplore: Journal of Statistics PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND OFFICIAL STATISTICS Parameter: Jurnal Matematika, Statistika dan Terapannya Scientific Journal of Informatics Journal of Mathematics, Computation and Statistics (JMATHCOS) Advance Sustainable Science, Engineering and Technology (ASSET) Indonesian Journal of Statistics and Its Applications Journal on Mathematics Education
Claim Missing Document
Check
Articles

Optimizing Machine Learning for Daily Rainfall Prediction in Bogor: A Statistical Downscaling Approach Intan Arassah, Fradha; Sadik, Kusman; Sartono, Bagus; Sofan, Parwati
Eduvest - Journal of Universal Studies Vol. 5 No. 6 (2025): Eduvest - Journal of Universal Studies
Publisher : Green Publisher Indonesia

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.59188/eduvest.v5i6.51307

Abstract

This study explores the use of machine learning models as a statistical downscaling technique to predict daily rainfall in Bogor, Indonesia. The general circulation model (GCM) is a leading tool for climate prediction, and this research applied a two-stage machine learning model to improve its predictions. The main objectives were to evaluate different GCM domains and handle missing data using two imputation approaches. The first stage involved constructing datasets with varying methods for addressing missing values, followed by the application of a support vector classification (SVC) model to classify rainy and non-rainy days. In the second stage, a recurrent neural network (RNN) model was developed to predict daily rainfall amounts. The results revealed that using random forest imputation for missing data enhanced model accuracy and reduced the root mean square error (RMSE). Among the different GCM domains, the 5 km resolution GCM data was the most accurate when compared to local station climatology. The SVC model, using a radial basis function kernel, achieved an impressive classification accuracy of 98.5%, while the RNN model achieved an RMSE of 16.19. These findings are valuable for improving rainfall predictions and can provide effective data-driven recommendations for disaster mitigation efforts in the region.
THE PROMINENCE OF VECTOR AUTOREGRESSIVE MODEL IN MULTIVARIATE TIME SERIES FORECASTING MODELS WITH STATIONARY PROBLEMS Rohaeti, Embay; Sumertajaya, I Made; Wigena, Aji Hamim; Sadik, Kusman
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 16 No 4 (2022): BAREKENG: Journal of Mathematics and Its Applications
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (688.398 KB) | DOI: 10.30598/barekengvol16iss4pp1313-1324

Abstract

One of the problems in modelling multivariate time series is stationary. Stationary test results do not always produce all stationary variables; mixed stationary and non-stationary variables are possible. When stationary problems are found in multivariate time series modelling, it is necessary to evaluate the model's performance in various stationary conditions to obtain the best forecasting model. This study aims to get a superior multivariate time series forecasting model based on the goodness of the model in various stationary conditions. In this study, the evaluation of the model's performance through simulation data modelling is then applied to the actual data with a stationary problem, namely Bogor City inflation data. The best model in simulation modelling is based on the stability of RMSE and MAD in 100 replications. The results are that the VAR model is the best in various stationary conditions. Meanwhile, the best model on actual data modelling is based on evaluation in 4 folds for model fitting power and model forecasting power. The Bogor City inflation data modelling with the mixed stationary problem resulted in the best model, namely the VAR(1) model. This means the VAR model is good enough to be used as a forecasting model in mixed stationary conditions. Thus, in this study, based on the goodness of the model in two modelling scenarios in various stationary conditions, overall, it was found that the VAR model was superior to the VARD and VECM models.
A COMPARISON OF COX PROPORTIONAL HAZARD AND RANDOM SURVIVAL FOREST MODELS IN PREDICTING CHURN OF THE TELECOMMUNICATION INDUSTRY CUSTOMER Nurhaliza, Sitti; Sadik, Kusman; Saefuddin, Asep
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 16 No 4 (2022): BAREKENG: Journal of Mathematics and Its Applications
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (398.623 KB) | DOI: 10.30598/barekengvol16iss4pp1433-1440

Abstract

The Cox Proportional hazard model is a popular method to analyze right-censored survival data. This method is efficient to use if the proportional hazard assumption is fulfilled. This method does not provide an accurate conclusion if these assumptions are not fulfilled. The new innovative method with a non-parametric approach is now developing to predict the time until an event occurs based on machine learning techniques that can solve the limitation of CPH. The method is Random Survival Forest, which analyzes right-censored survival data without regard to any assumptions. This paper aims to compare the predictive quality of the two methods using the C-index value in predicting right-censored survival data on churn data of the telecommunication industry customers with 2P packages consisting of Internet and TV, which are taken from all customer databases in the Jabodetabek area. The results show that the median value of the C-index of the RSF model is 0.769, greater than the median C-index value of the CPH model of 0.689. So the prediction quality of the RSF model is better than the CPH model in predicting the churn of the telecommunications industry customer.
TRANSFER FUNCTION AND ARIMA MODEL FOR FORECASTING BI RATE IN INDONESIA Khikmah, Khusnia Nurul; Sadik, Kusman; Indahwati, Indahwati
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 17 No 3 (2023): BAREKENG: Journal of Mathematics and Its Applications
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30598/barekengvol17iss3pp1359-1366

Abstract

Fluctuating gold prices can have an impact on various sectors of the economy. Some of the impacts of rising and falling gold prices are inflation, currency exchange rates, and the value of the Bank Indonesia benchmark interest rate (BI Rate). The data was taken from the Indonesian Central Statistics Agency's official website (BPS) for the Bank Indonesia benchmark interest rate (BI Rate) value. Therefore, research on the value of the Bank Indonesia benchmark interest rate (BI Rate) is essential with the gold price as a control. The purpose of this study is to forecast the value of the Bank Indonesia reference interest rate (BI Rate) with a transfer function model where the input variable used is the price of gold and forecast the value of the Bank Indonesia benchmark interest rate (BI Rate) with the ARIMA model. The analysis results show that the best model for forecasting the Bank Indonesia reference interest rate (BI Rate) is a transfer function model with a value of , , , and a noise series model with the MAPE value is
A PRELIMINARY STUDY OF SENTIMENT ANALYSIS ON COVID-19 NEWS: LESSON LEARNED FROM DATA ACQUISITION, PRE-PROCESSING, AND DESCRIPTIVE ANALYTICS Amalia, Rahmatin Nur; Sadik, Kusman; Notodiputro, Khairil Anwar
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 17 No 4 (2023): BAREKENG: Journal of Mathematics and Its Applications
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30598/barekengvol17iss4pp1901-1914

Abstract

Sentiment analysis is a method used to analyze opinions and feelings. The goal of sentiment analysis is to determine whether a document contains a positive or negative emotion. Along with the spread of Covid-19 cases, news related to Covid-19 has often become a trending topic in the mass media. Conducting sentiment analysis using all news becomes more challenging because it might take time and cost. Therefore, the sampling method is needed to obtain representative news for the analysis. Web scraping was employed to obtain the news article about Covid-19 in Indonesia. In order to select the representative news, two-step sampling was employed by using stratified and systematic random sampling. According to the topic modelling results using lambda 0.6, news articles are grouped into three topics: updating Covid-19 cases, vaccination, and government policy. In addition, based on the number of positive and negative words, news articles are grouped into news dominated by positive words, news dominated by negative words, and news with the same number of positive and negative words. Methods for representing text in numerical form have been developed. Some of them use tf-idf weighting and word embedding. It does not pay attention to word order or meaning, only based on the frequency of words both locally and globally. Furthermore, this method will form a vector size as large as the number of unique words in the document, so it is less effective when many documents are used. Meanwhile, the vector size generated from the word2vec method is not as much as the number of unique words in the corpus. In addition, word2vec considers the context of the words in the corpus.
SIMULATION STUDY OF HIERARCHICAL BAYESIAN APPROACH FOR SMALL AREA ESTIMATION WITH MEASUREMENT ERROR Latifah, Leli; Sadik, Kusman; Indahwati, Indahwati
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 17 No 4 (2023): BAREKENG: Journal of Mathematics and Its Applications
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30598/barekengvol17iss4pp2059-2070

Abstract

In small area estimation (SAE), the auxiliary variables used are commonly derived from registration data such as census and administrative data. It is assumed that the auxiliary variables are available for all areas. The limited availability of auxiliary variables can be an obstacle in SAE. The additional information from the survey can be alternative data, but it is assumed that the auxiliary variables will contain measurement errors. This study conducted a simulation of data that aims to handle when auxiliary variables are measured with errors. Two simulations were studied with some scenarios to the percentage area where the auxiliary variable is measured with error and scenarios to the generated auxiliary variables. Compare four methods: direct estimation, Fay-Herriot Empirical Best Linear Unbiased Prediction (EBLUP-FH), Ybarra-Lohr SAE with measurement error (SaeME), and Hierarchical Bayesian SaeME. The results show that, in both the simulation study, the Hierarchical Bayesian SaeME method gives a smaller the EMSE value than the other two methods when auxiliary information is measured with error.
MODELING THE INCIDENCE OF MALNUTRITION IN BOGOR REGENCY USING ZERO-INFLATED NEGATIVE BINOMIAL MIXED EFFECT MODEL Sirodj, Dwi Agustin Nuriani; Sadik, Kusman; Kurnia, Anang
BAREKENG: Jurnal Ilmu Matematika dan Terapan Vol 18 No 2 (2024): BAREKENG: Journal of Mathematics and Its Application
Publisher : PATTIMURA UNIVERSITY

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30598/barekengvol18iss2pp0961-0972

Abstract

Modeling response variables in the form of count data generally uses a model based on the Poisson distribution. However, some conditions, such as the presence of excess zero, can be found in the data that result in overdispersion, which will have an impact on the resulting variance in the model. In this paper, three approaches, namely the Poisson Mixed Model, the Negative Binomial (NB) Mixed Model, and the Zero-Inflated Negative Binomial (ZINB) Mixed Model, are used to model the incidence of malnutrition in Bogor Regency. The data used in this study are secondary data sourced from the West Java open data website. Based on the results of data analysis, it appears that the ZINB Mixed Model method is a method capable of accommodating random effects, overdispersion, and excess zero in modeling malnutrition in Bogor Regency. Variables that significantly affect the occurrence of malnutrition cases in villages in Bogor Regency include the Number of Children Weighed Routinely Every Month, Number of Children Measured for Length and Height Twice a Year, Number of Children under 12 Months Old Who Received Complete Basic Immunization, Number of Posyandu (Integrated Health Post), and Number of Parents/Caregivers Participating in Monthly Parenting (PAUD).
Performance Comparison of Random Forest and XGBoost Optimized with Cuckoo Search Algorithm for Coconut Milk Adulteration Detection Using FTIR Spectroscopy I Gusti Ngurah, Sentana Putra; Kusman Sadik; Agus Mohamad Soleh; Cici Suhaeni
Journal of Mathematics, Computations and Statistics Vol. 8 No. 2 (2025): Volume 08 Nomor 02 (Oktober 2025)
Publisher : Jurusan Matematika FMIPA UNM

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35580/jmathcos.v8i2.7817

Abstract

Coconut milk has emerged as a strategic food commodity in the global tropical region, with market demand growing at 7.2% per annum since 2021. This increasing demand has led to sophisticated adulteration practices, including dilution with water. Such adulteration not only reduces the nutritional value but also poses serious health risks, including food poisoning and allergic reactions. This study developed an innovative detection method combining Fourier Transform Infrared (FTIR) spectroscopy with a sophisticated machine learning algorithm. We analyzed 719 coconut milk samples (wavelength range 2500-4000 nm) consisting of traditional market products and instant commercial products. This study aims to develop an FTIR-based coconut milk adulteration detection model by optimizing RF and XGBoost parameters using CSA and evaluating the comparative performance of the two models in identifying different types of adulterants. The spectral data underwent rigorous preprocessing using a combination of Standard Normal Variate (SNV) and Savitzky-Golay (SG) techniques to overcome the effects of noise and light scattering, which significantly improved feature extraction. The results show that CSA-optimized XGBoost achieves superior performance with 92% accuracy and 91% F1 score, outperforming Random Forest in all evaluation metrics. The model shows particular strength in precision (98%), indicating its outstanding ability to minimize false positives in adulteration detection. Stability tests through 30 experimental repetitions reveal that the combination of XGBoost+CSA maintains consistent performance with minimal variance, confirming its reliability for industrial applications. Comparative analysis shows that the combination of SNV+SG preprocessing improves the accuracy of the baseline model by 9-12%, while CSA optimization provides an additional performance improvement of 10-15%. This research makes significant contributions to food science and safety. This study demonstrates the effectiveness of CSA in optimizing spectroscopic models, achieving 19.5% higher precision. The combination of SNV+SG preprocessing improves the baseline accuracy by 9-12%, while CSA optimization provides an additional performance improvement of 10-15%. This study not only provides a rapid and non-destructive adulteration detection solution but also proves the effectiveness of the CSA approach in optimizing the spectroscopic model. These findings have important implications for strengthening food safety regulations and developing real-time quality control systems in the coconut milk industry.
Effect of Feature Normalization and Distance Metrics on K-Nearest Neighbors Performance for Diabetes Disease Classification Yusran, Muhammad; Sadik, Kusman; Soleh, Agus M; Suhaeni, Cici
Journal of Mathematics, Computations and Statistics Vol. 8 No. 2 (2025): Volume 08 Nomor 02 (Oktober 2025)
Publisher : Jurusan Matematika FMIPA UNM

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35580/jmathcos.v8i2.8012

Abstract

Diabetes is a global health issue with a steadily increasing prevalence each year. Early detection of the disease is an important step in preventing severe complications. The K-Nearest Neighbors (KNN) algorithm is often used in disease classification, but its performance is highly influenced by the choice of normalization method and distance metric used. This study aims to evaluate the effect of various normalization methods and distance metrics on the performance of the KNN algorithm in diabetes disease classification. The three normalization methods were employed: z-score normalization, min-max scaling, and median absolute deviation (MAD). In addition, the seven distance metrics were assessed: Euclidean, Manhattan, Chebyshev, Canberra, Hassanat, Lorentzian, and Clark. The dataset used is Pima Indians Diabetes which consists of 768 observations and 8 features. The data were split into 80% training data and 20% test data, and using 5-fold cross-validation to determine the optimal k value. The results show that the MAD-Canberra combination produces the highest overall accuracy, recall, and F1-score of 87.32%, 82.33%, and 81.94%, respectively. The highest precision was obtained from the Baseline-Hassanat combination at 86.96%, while the lowest performance was observed for the Z-Score-Chebyshev combination with F1-Score 58.02%. These results highlight that no single combination universally outperforms others, underscoring the need for empirical evaluation. Nonetheless, combining MAD normalization with metrics such as Canberra or Hassanat can serve as a strong starting point for developing KNN-based classification systems, especially in medical contexts that are sensitive to misclassification.
Analysis and Optimization of Rainfall Prediction in Makassar City Using Artificial Neural Networks Based on Data Augmentation, Regularization, and Bayesian Optimization Abdullah, Adib Roisilmi; Sadik, Kusman; Suhaeni, Cici; Saleh, Agus Muhammad
Journal of Mathematics, Computations and Statistics Vol. 8 No. 2 (2025): Volume 08 Nomor 02 (Oktober 2025)
Publisher : Jurusan Matematika FMIPA UNM

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35580/jmathcos.v8i2.8304

Abstract

This study develops a robust and efficient rainfall prediction model using an Artificial Neural Network (ANN), significantly enhanced through integrated data augmentation, regularization, and Bayesian optimization techniques. We utilized a dataset of 118 monthly rainfall records from Makassar City, spanning 2014–2022, sourced from the Meteorological, Climatological, and Geophysical Agency (BMKG). To effectively capture inherent temporal patterns, lag features (specifically lag-1, lag-3, and lag-6 rainfall values) were meticulously constructed as input variables. Subsequently, Min-Max normalization was applied across all features, ensuring input consistency and optimizing the ANN's learning process. An initial manual grid search identified the most effective baseline ANN architecture, featuring four hidden layers ([128, 32, 16, 64] neurons), a tanh activation function, and a learning rate of 0.01. While the baseline ANN model achieved a commendable initial performance with an RMSE of 0.1608, comprehensive experiments revealed the superior benefits of a fully integrated approach. This advanced model, which synergistically combined data augmentation (to address data limitations and enhance generalization), regularization (to mitigate overfitting), and Bayesian optimization (for efficient hyperparameter tuning), demonstrated significantly improved generalization capabilities and enhanced model stability. This integrated model yielded an RMSE of 0.1861, an MSE of 0.0346, and an MAE of 0.1359. These compelling findings unequivocally underscore that integrated optimization strategies are crucial for developing more robust and reliable ANN-based rainfall prediction models, particularly for critical applications in climate-based time series forecasting.
Co-Authors . Erfiani . Indahwati A.Tuti Rumiati Aam Alamudi Abdullah, Adib Roisilmi Achmad Fauzan Agus Mohamad Soleh Ahmad Rifai Nasution Aji Hamim Wigena Akbar Rizki Akbar Rizki Akmala Firdausi Alfiryal, Naufalia Amalia, Rahmatin Nur Anadra, Rahmi Ananda Shafira Anang Kurnia Andespa, Reyuli Andi Okta Fengki ASEP SAEFUDDIN Astari, Reka Agustia Astari, Reka Agustia Aulya Permatasari Azka Ubaidillah Bagus Sartono Budi Susetyo Budi Susetyo Cici Suhaeni Cici Suhaeni Dian Handayani Dito, Gerry Alfa Dwi Agustin Nuriani Sirodj Efriwati Efriwati Embay Rohaeti Eminita, Viarti Evita Purnaningrum Fahira, Fani FARDILLA RAHMAWATI Farit Mochamad Afendi Fitrianto, Anwar Freya, Wa Ode Rona Gerry Alfa Dito Haikal, Husnul Aris Hari Wijayanto Hasnataeni, Yunia Hazan Azhari Zainuddin Hermawati, Neni I Gusti Ngurah, Sentana Putra I Made Sumertajaya I Wayan Mangku Indahwati Indahwati Indahwati Intan Arassah, Fradha Iqbal, Teuku Achmad Isnanda, Eriski Kamila, Sabrina Adnin Khairi A N Khairil Anwar Notodiputro Khikmah, Khusnia Nurul khusnul khotimah Khusnul Khotimah Kusni Rohani Rumahorbo Latifah, Leli Lili Puspita Rahayu Logananta Puja Kusuma M Soleh, Agus Mochamad Ridwan Mochamad Ridwan, Mochamad Mohammad Masjkur Muh Nur Fiqri Adham Muhammad Yusran Mulianto Raharjo Naima Rakhsyanda Nisrina Az-Zahra, Putri Nur Khamidah NURADILLA, SITI Nusar Hajarisman Pangestika, Dhita Elsha Parwati Sofan, Parwati Purnama Sari Rakhsyanda, Naima Rifqi Aulya Rahman Rita Rahmawati Rizaldi Boer Rizki, Akbar Rizqi, Tasya Anisah ROCHYATI ROCHYATI Rumahorbo, Kusni Rohani Sahamony, Nur Fitriyani Saleh, Agus Muhammad Satriyo Wibowo Sentana Putra, I Gusti Ngurah Siregar, Jodi jhouranda Siti Aisyah Siti Raudlah Sitti Nurhaliza Soleh, Agus M Suhaeni, Cici Sundari, Marta Supriatin, Febriyani Eka Tendi Ferdian Diputra Titin Suhartini Titin Suhartini, Titin Tri Wahyuni Uswatun Hasanah Utami Dyah Syafitri Viarti Eminita Widhiyanti Nugraheni Yenni Angraini Yenni Kurniawati Yuli Eka Putri Zafira Fakhriyah