cover
Contact Name
Sachnaz Desta Oktarina
Contact Email
sachnazdes@apps.ipb.ac.id
Phone
-
Journal Mail Official
ijsa@apps.ipb.ac.id
Editorial Address
sachnazdes@apps.ipb.ac.id
Location
Kota bogor,
Jawa barat
INDONESIA
Indonesian Journal of Statistics and Its Applications
ISSN : 25990802     EISSN : 25990802     DOI : -
Core Subject : Science, Education,
Indonesian Journal of Statistics and Its Applications (eISSN:2599-0802) (formerly named Forum Statistika dan Komputasi), established since 2017, publishes scientific papers in the area of statistical science and the applications. The published papers should be research papers with, but not limited to, the following topics: experimental design and analysis, survey methods and analysis, operation research, data mining, statistical modeling, computational statistics, time series and econometrics, and statistics education. All papers were reviewed by peer reviewers consisting of experts and academicians across universities and agencies
Articles 192 Documents
Performance Evaluation of ARDL Model Stacked with Boosted Ridge Regression on Time Series Data with Multicollinearity: Evaluasi Kinerja Estimasi Model ARDL stacked with Boosted Ridge Regression pada Data Deret Waktu dengan Multikolinearitas Dalimunthe, Amir Abduljabbar; Soleh, Agus Mohamad; Afendi, Farit Mochamad
Indonesian Journal of Statistics and Applications Vol 9 No 1 (2025)
Publisher : Statistics and Data Science Program Study, IPB University, IPB University, in collaboration with the Forum Pendidikan Tinggi Statistika Indonesia (FORSTAT) and the Ikatan Statistisi Indonesia (ISI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v9i1p136-144

Abstract

Time series data plays a vital role in financial and economic study. Two commonly applied models for such data are Vector Autoregression (VAR) and Autoregressive Distributed Lags (ARDL). Nonetheless, interdependence among explanatory variables often leads to multicollinearity, posing challenges for model reliability. This study investigates the effectiveness of the ARDL model integrated with boosted ridge regression as a method to mitigate multicollinearity. Due to limitations in available empirical data, simulation data will be generated to support the analysis. The research consists of two stages: synthetic data generation and analysis on simulated data. Results suggest that ARDL performs well under various multicollinearity conditions, particularly when the training set is sufficiently large and model structure is correctly specified. For smaller training sets, the ARDL Ridge variant demonstrates improved predictive performance.
Exploring a Large Language Model on the ChatGPT Platform for Indonesian Text Preprocessing Tasks Suhaeni, Cici; Kamila, Sabrina Adnin; Fahira, Fani; Yusran, Muhammad; Alfa Dito, Gerry
Indonesian Journal of Statistics and Applications Vol 9 No 1 (2025)
Publisher : Statistics and Data Science Program Study, IPB University, IPB University, in collaboration with the Forum Pendidikan Tinggi Statistika Indonesia (FORSTAT) and the Ikatan Statistisi Indonesia (ISI)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v9i1p100-116

Abstract

Preprocessing is a crucial step in Natural Language Processing, especially for informal languages like Indonesian, which contain complex morphology, slang, abbreviations, and non-standard expressions. Traditional rule-based tools such as regex, IndoNLP, and Sastrawi are commonly used but often fall short in handling noisy, user-generated text. This study explores the capability of Large Language Model, particularly ChatGPT-o3, in performing Indonesian text preprocessing tasks, namely text cleaning, normalization, stopword removal, and stemming/lemmatization, and compares it to conventional rule-based approaches. Using two types of datasets, consisting of a small example dataset of five manually constructed sentences and a real-world dataset of 100 tweets about the Indonesian “Makan Bergizi Gratis” program, both preprocessing methods were applied and evaluated. Results show that ChatGPT-o3 performs equally well in text cleaning and significantly better in normalization. However, rule-based methods like IndoNLP and Sastrawi still outperform ChatGPT-o3 in stopword removal and stemming. These findings indicate that while ChatGPT-o3 demonstrates strong contextual understanding and linguistic flexibility, they may underperform in rigid, token-based operations without fine-tuning. This study provides initial insights into using Large Language Models as an alternative preprocessing engine for Indonesian text and highlights the need for hybrid approaches or improved prompt design in future applications.