cover
Contact Name
Sachnaz Desta Oktarina
Contact Email
sachnazdes@apps.ipb.ac.id
Phone
-
Journal Mail Official
ijsa@apps.ipb.ac.id
Editorial Address
sachnazdes@apps.ipb.ac.id
Location
Kota bogor,
Jawa barat
INDONESIA
Indonesian Journal of Statistics and Its Applications
ISSN : 25990802     EISSN : 25990802     DOI : -
Core Subject : Science, Education,
Indonesian Journal of Statistics and Its Applications (eISSN:2599-0802) (formerly named Forum Statistika dan Komputasi), established since 2017, publishes scientific papers in the area of statistical science and the applications. The published papers should be research papers with, but not limited to, the following topics: experimental design and analysis, survey methods and analysis, operation research, data mining, statistical modeling, computational statistics, time series and econometrics, and statistics education. All papers were reviewed by peer reviewers consisting of experts and academicians across universities and agencies
Articles 192 Documents
ANALISIS INFLASI MENGGUNAKAN DATA GOOGLE TRENDS DENGAN MODEL ARIMAX DI DKI JAKARTA Newton Newton; Anang Kurnia; I Made Sumertajaya
Indonesian Journal of Statistics and Applications Vol 4 No 3 (2020)
Publisher : Departemen Statistika, IPB University dengan Forum Perguruan Tinggi Statistika (FORSTAT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v4i3.694

Abstract

Inflation is an important economic indicator in showing the economic symptoms of a region's price level. DKI Jakarta is the capital of Indonesia chosen as the center of the economic barometer because it can provide the greatest contribution and influence on the Indonesian economy. The ARIMAX model was used for forecasting by adding independent variables in the Google trends data. Google trends data were explored based on seven expenditure groups published by IHK. The purpose of this study was to determine the effect of forecast Google trends using BPS inflation data in DKI Jakarta. The result of the exploration of Google Trends data was forecasted to get the best forecast model results. The result of data analysis indicates that the forecast results approached the original BPS data with the best forecast model is ARIMAX (2.0.3) all variables X. Google Trends data can be used as forecasting but cannot be used as a reference policy decision.
Comparison of Functional Regression and Functional Principal Component Regression for Estimating Non-Invasive Blood Glucose Level: Perbandingan Metode Regresi Fungsional dan Regresi Komponen Utama Fungsional untuk Menduga Kadar Glukosa Darah pada Alat Non-Invasif Nurul Fadhilah; Erfiani Erfiani; Indahwati Indahwati
Indonesian Journal of Statistics and Applications Vol 5 No 1 (2021)
Publisher : Departemen Statistika, IPB University dengan Forum Perguruan Tinggi Statistika (FORSTAT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v5i1p14-25

Abstract

The calibration method is an alternative method that can be used to analyze the relationship between invasive and non-invasive blood glucose levels. Calibration modeling generally has a large dimension and contains multicolinearities because usually in functional data the number of independent variables (p) is greater than the number of observations (p>n). Both problems can be overcome using Functional Regression (FR) and Functional Principal Component Regression (FPCR). FPCR is based on Principal Component Analysis (PCA). In FPCR, the data is transformed using a polynomial basis before data reduction. This research tried to model the equations of spectral calibration of voltage value excreted by non-invasive blood glucose level monitoring devices to predict blood glucose using FR and FPCR. This study aimed to determine the best calibration model for measuring non-invasive blood glucose levels with the FR and FPCR. The results of this research showed that the FR model had a bigger coefficient determination (R2) value and lower Root Mean Square Error (RMSE) and Root Mean Square Error Prediction (RMSEP) value than the FPCR model, which was 12.9%, 5.417, and 5.727 respectively. Overall, the calibration modeling with the FR model is the best model for estimate blood glucose level compared to the FPCR model.
Improving Classification Model Performances using an Active Learning Method to Detect Hate Speech in Twitter: Peningkatan Kinerja Model Klasifikasi dengan Pembelajaran Aktif dalam Mendeteksi Ujaran Kebencian di Twitter Muhammad Ilham Abidin; Khairil Anwar Notodiputro; Bagus Sartono
Indonesian Journal of Statistics and Applications Vol 5 No 1 (2021)
Publisher : Departemen Statistika, IPB University dengan Forum Perguruan Tinggi Statistika (FORSTAT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v5i1p26-38

Abstract

Efforts from the police to address hate speech on social media such as Twitter will not be sufficient to rely solely on manual checks. Therefore, it is necessary to use statistical modelling like the classification model to detect hate speech automatically. Classification is a type of predictive modelling to produce accurate predictions based on labelled data. Generally, the available data are usually unlabelled implying that the labelling process needs to be done beforehand. Data labelling is time consuming, high cost, and often fails to produce correct labels. This research aims to improve the performances of classification models by adding a small amount of data through the so called active learning method. The results showed that there was no significant difference in the performances of logistic regression and naïve bayes classification models in detecting hate speech. However, the results also showed that adding data through the active learning method substantially improved the logistics regression performance in detecting hate speech when compared to data addition based on a simple random sampling method. Therefore, the performances of classification models in detecting hate speech on Twitter could be improved by using an active learning method.
ARFIMA Modelling for Tectonic Earthquakes in The Maluku Region: Pemodelan ARFIMA untuk Kejadian Gempa Bumi Tektonik di Wilayah Maluku Ferry Kondo Lembang; Lexy Janzen Sinay; Asrul Irfanullah
Indonesian Journal of Statistics and Applications Vol 5 No 1 (2021)
Publisher : Departemen Statistika, IPB University dengan Forum Perguruan Tinggi Statistika (FORSTAT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v5i1p39-49

Abstract

Maluku Province is one of the regions in Indonesia with a very active and very prone earthquake intensity because it is a meeting place for 3 (three) plates, namely the Eurasian, Pacific and Australian plates. In the last 100 years, the history of tectonic earthquakes with tsunamis that occurred in Indonesia was 25-30% occurring in the Maluku Sea and Banda Sea. Based on this fact, this study aims to analyze the incidence of tectonic earthquakes that occurred in the Maluku region and its surroundings using the Autoregressive Fractionally Integrated Moving Averages (ARFIMA) model which has the ability to explain long-term time series data (long memory). The results of the research data analysis show that the best model for predicting the number of tectonic earthquakes that occur in Maluku and its surroundings is ARFIMA (0; 0.712; 1) with an MSE value of 0.1156. Meanwhile, the best model for predicting the average magnitude of the number of tectonic earthquakes that occurred in Maluku and its surroundings is ARFIMA (0; -3,224 x 10-9; 1) with an MSE value of 0.01237. Based on the two best models, the prediction results obtained from the number of tectonic earthquakes and the average magnitude of the number of tectonic earthquakes that occurred in Maluku and its surroundings for the next three periods, namely the first period there were 31 tectonic earthquakes with an average magnitude of 4.38481 SR. the second period there were 32 tectonic earthquakes with an average magnitude of 4.38407, and the third period there were 32 tectonic earthquakes with an average magnitude of 4.38333.
Simulation Study of Robust Geographically Weighted Empirical Best Linear Unbiased Predictor on Small Area Estimation: Simulasi Metode Prediksi Tak Bias Linier Terbaik Empiris Terboboti Geografis Kekar pada Pendugaan Area Kecil Naima Rakhsyanda; Kusman Sadik; Indahwati Indahwati
Indonesian Journal of Statistics and Applications Vol 5 No 1 (2021)
Publisher : Departemen Statistika, IPB University dengan Forum Perguruan Tinggi Statistika (FORSTAT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v5i1p50-60

Abstract

Small area estimation can be used to predict the population parameter with small sample sizes. For some cases, the population units that are close spatially may be more related than units that are further apart. The use of spatial information like geographic coordinates are studied in this research. Outlier contaminations can affect small area estimations. This study was conducted using simulation methods on generated data with six scenarios. The scenarios are the combination of spatial effects (spatial stationary and spatial non-stationary) with outlier contamination (no outlier, symmetric outliers, and non-symmetric outliers). The purpose of this study was to compare the geographically weighted empirical best linear unbiased predictor (GWEBLUP) and robust GWEBLUP (RGWEBLUP) with direct estimator, EBLUP, and REBLUP using simulation data. The performance of the predictors is evaluated using relative root mean squared error (RRMSE). The simulation results showed that geographically weighted predictors have the smallest RRMSE values for scenarios with spatial non-stationary, therefore offer a better prediction. For scenarios with outliers, robust predictors with smaller RRMSE values offer more efficiency than non-robust predictors.
PERBANDINGAN MODEL GARCH SIMETRIS DAN ASIMETRIS PADA DATA KURS HARIAN Isna Shofia Mubarokah; Anwar Fitrianto; Farit M Affendi
Indonesian Journal of Statistics and Applications Vol 4 No 4 (2020)
Publisher : Departemen Statistika, IPB University dengan Forum Perguruan Tinggi Statistika (FORSTAT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v4i4.709

Abstract

ARCH and GARCH models are widely used in financial data to describe its volatility pattern. The models assume the positive and negative return residual gives the same or symmetric influence on its volatility. However, in reality, this assumption is frequently violated, which is called heteroscedasticity. Therefore, to deal with heteroscedasticity and asymmetric data, the asymmetric GARCH models, which are EGARCH and GJR-GARCH models are used. This research aims to compare the models between symmetric and asymmetric GARCH to make financial data modeling. It uses daily data on three foreign exchange rates for IDR including IDR/CNY, IDR/JPY, and IDR/USD. The data series to be used here are from January 4, 2016, to January 20, 2020. This research method is started by selecting the best mean model for each data. Based on the best mean model, then modeling the mean and variance function are simultaneously conducted using the GARCH model. To test whether there was an asymmetric effect on the data, a Lagrange multiplier test was applied on the residuals of the GARCH model. The results show that the asymmetric effect was found in the IDR/CNY and IDR/JPY exchange rates. To overcome this asymmetric effect, EGARCH and GJR-GARCH model were applied to the two exchange rates. Then the two models are compared to find out which volatility model is better. Using AIC and BIC we find EGARCH as the best model for IDR/CNY exchange rates daily return and GJR-GARCH as the best model for IDR/JPY exchange rates daily return.
The Model of Per-Capita Expenditure Figures in Sumatera Selatan uses a Geographically Weighted Panel Regression: Model Angka Pengeluaran Per-Kapita di Sumatera Selatan menggunakan Geographically Weighted Panel Regression Dia Cahya Wati; Dea Alvionita Azka; Herni Utami
Indonesian Journal of Statistics and Applications Vol 5 No 1 (2021)
Publisher : Departemen Statistika, IPB University dengan Forum Perguruan Tinggi Statistika (FORSTAT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v5i1p61-74

Abstract

The Geographically Weighted Panel Regression (GWPR) is a development of a global regression model where the basic idea is taken from a combination of panel data and GWR. The GWPR model is built from the point approach method, which is based on the position of the coordinates of latitude and longitude. The parameters for the regression model at each location will produce different values. GWPR can accommodate spatial effects, so that it can better explain the relationship between response variables and predictors. The purpose of this study is to compare the GWPR model with the Fixed Gaussian and Adaptive Bisquare weighting functions based on the AIC value. The data used in this study is secondary data taken from the website of the Central Statistics Agency (BPS) in the form of Per-Capita Expenditure Figures in South Sumatra in 2013-2019. This research results that in the case of the Per-Capita Expenditure Rate (AP), it is better to use the GWPR method with a fixed gaussian weighting function in the modeling, where the resulting coefficient of determination is 95.81% rather than adaptive bisquare with a determination coefficient of 93.3%. The factors that influence the Per-Capita Expenditure Rate (AP) in South Sumatra on the fixed gaussian weighting are divided into 6 groups, while the adaptive bisquare is divided into 2 groups.
ANALISIS JUMLAH KASUS MALARIA DI WILAYAH SUMATERA MENGGUNAKAN GEOGRAPHICALLY WEIGHTED ZERO-INFLATED POISSON REGRESSION (GWZIPR) Rahmat Kevin Praditia; Dian Agustina; Dyah Setyo Rini
Indonesian Journal of Statistics and Applications Vol 4 No 4 (2020)
Publisher : Departemen Statistika, IPB University dengan Forum Perguruan Tinggi Statistika (FORSTAT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v4i4.716

Abstract

A method that can be used if there is a spatial factor and if overdispersion happens in a count data is Geographically Weighted Zero-Inflated Poisson Regression (GWZIPR). This research aimed to analyze the number of malaria cases in every regency/city of Sumatra Land using the GWZIPR method and distribution mapping of factors affecting the number of malaria cases in Sumatra Land. Data involved in this research was the number of malaria cases as the response variable and the predictor variable as a percentage of households that have access to proper sanitation, a percentage of households that have access to proper water resources, and a percentage of the number of public health centers. The results were for each area which had distinctive models based on significant variables. The distribution mapping of factors affecting the number of malaria cases in every regency/city was commonly divided into three groups based on significant variables on ln and logit models. The mapping did not shape a spreading pattern or each regency/city in that group because the geographical locations were close to each other. GWZIPR method in this research was better than the ZIP Regression method because it produced the least AIC value.
PEMODELAN STATISTICAL DOWNSCALING DENGAN LASSO DAN GROUP LASSO UNTUK PENDUGAAN CURAH HUJAN M. Yunus; Asep Saefuddin; Agus M Soleh
Indonesian Journal of Statistics and Applications Vol 4 No 4 (2020)
Publisher : Departemen Statistika, IPB University dengan Forum Perguruan Tinggi Statistika (FORSTAT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v4i4.724

Abstract

One of the rainfall prediction techniques is the Statistical Downscaling Modeling (SDS). SDS modeling is one of the applications of modeling with covariates conditions that are generally large and not independent. The problems that will be encountered is the problem of ill-conditional data i.e multicollinearity and the high correlation between variables. The case of highly correlated data causes a linear regression coefficient estimators obtained to have a large variance. This research was conducted to make the statistical downscaling modeling using the lasso and group lasso for the prediction of rainfall. Group of the covariate scenario is applied based on the adjacent area, the high correlation between covariates and correlation between covariates and responses, and also the addition of dummy variables. Scenario six (grouping which is done by considering the covariates that have a positive correlation to the response is divided into 3 groups, 1 individual and the covariates that are negatively correlated with the response are divided into 2 groups, 1 individual) is better than the other scenarios in linear modeling without a dummy. Then, linear modeling with a dummy is better than without a dummy for both techniques. In linear modeling with a dummy, the Group lasso technique can be considered more in SDs modeling, because the difference in the RMSEP statistical value and the correlation coefficient value is significant.
Exploration of Obesity Status of Indonesia Basic Health Research 2013 With Synthetic Minority Over-Sampling Techniques: Eksplorasi Status Obesitas Riset Kesehatan Dasar 2013 Indonesia dengan Teknik Synthetic Minority Over-Sampling Sri Astuti Thamrin; Dian Sidik; Hedi Kuswanto; Armin Lawi; Ansariadi Ansariadi
Indonesian Journal of Statistics and Applications Vol 5 No 1 (2021)
Publisher : Departemen Statistika, IPB University dengan Forum Perguruan Tinggi Statistika (FORSTAT)

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29244/ijsa.v5i1p75-91

Abstract

The accuracy of the data class is very important in classification with a machine learning approach. The more accurate the existing data sets and classes, the better the output generated by machine learning. In fact, classification can experience imbalance class data in which each class does not have the same portion of the data set it has. The existence of data imbalance will affect the classification accuracy. One of the easiest ways to correct imbalanced data classes is to balance it. This study aims to explore the problem of data class imbalance in the medium case dataset and to address the imbalance of data classes as well. The Synthetic Minority Over-Sampling Technique (SMOTE) method is used to overcome the problem of class imbalance in obesity status in Indonesia 2013 Basic Health Research (RISKESDAS). The results show that the number of obese class (13.9%) and non-obese class (84.6%). This means that there is an imbalance in the data class with moderate criteria. Moreover, SMOTE with over-sampling 600% can improve the level of minor classes (obesity). As consequence, the classes of obesity status balanced. Therefore, SMOTE technique was better compared to without SMOTE in exploring the obesity status of Indonesia RISKESDAS 2013.

Page 10 of 20 | Total Record : 192