Claim Missing Document
Check
Articles

Found 20 Documents
Search

ESTIMASI REGRESI ROBUST M PADA FAKTORIAL RANCANGAN ACAK LENGKAP YANG MENGANDUNG OUTLIER Siswanto Siswanto; Raupong Raupong; Anisa Anisa
Jurnal Matematika, Statistika dan Komputasi Vol. 13 No. 2 (2017): January 2017
Publisher : Department of Mathematics, Hasanuddin University

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (490.729 KB) | DOI: 10.20956/jmsk.v13i2.3505

Abstract

Dalam statistik, melakukan suatu percobaan adalah salah satu cara untuk mendapatkan suatu data. Pada perancangan  percobaan ada beberapa hal yang membuat suatu data menjadi outlier, yaitu  tidak berhasilnya suatu  pengamatan  pada salah satu unit percobaan atau  kesalahan dalam  pengambilan  pengamatan. Pada  perancangan percobaan  tidak berhasilnya suatu pengamatan pada unit percobaan biasanya disebut data hilang. Dengan kata lain data hilang tersebut biasanya disebut outlier. Jika terjadi outlier atau  suatu  pengamatan  gagal maka nilai datanya harus ditaksir atau melakukan percobaan ulang. Pada penelitian ini, untuk mengatasi outlier  pada faktorial Rancangan Acak Lengkap (RAL) digunakan estimasi regresi robust M sehingga diperoleh  nilai penduga baru yang resistant terhadap outlier.
Determination of Fractional Chromatic Numbers in the Operation of Adding Two Different Graphs: Penentuan Bilangan Kromatik Fraksional pada Operasi Penjumlahan Dua Graf berbeda Junianto Sesa; Siswanto Siswanto
Jurnal Matematika, Statistika dan Komputasi Vol. 18 No. 2 (2022): JANUARY 2022
Publisher : Department of Mathematics, Hasanuddin University

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.20956/j.v18i2.14501

Abstract

The development of graph theory has provided many new pieces of knowledge, one of them is graph color. Where the application is spread in various fields such as the coding index theory. Fractional coloring is multiple coloring at points with different colors where the adjoining point has a different color. The operation in the graph is known as the sum operation. Point coloring can be applied to graphs where the result of operations is from several special graphs. In this case, the graph summation results of the path graph and the cycle graph will produce the same fractional chromatic number as the sum of the fractional chromatic numbers of each graph before it is operated.
The Sentiment Analysis Using Naïve Bayes with Lexicon-Based Feature on TikTok Application Siswanto Siswanto; Zakiyah Mar'ah; Alfiyah Salsa Dila Sabir; Taufik Hidayat; Fadilah Amirul Adhel; Waode Sitti Amni
Jurnal Varian Vol 6 No 1 (2022)
Publisher : Universitas Bumigora

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30812/varian.v6i1.2205

Abstract

On TikTok application, there are several types of content in the form of education, cooking recipes, comedy, various tips, beauty, business, etc. However, some non-educational contents sometimes appear on TikTok homepage even though minors can access the app. As a result, TikTok application can influence the behavior of minors to be disgraceful, therefore, an assessment of the application can be one of the objects for conducting sentiment analysis. The purpose of this study is to compare the results of sentiment analysis on TikTok application using Naïve Bayes with Lexicon-Based and without Lexicon-Based features. We used the TikTok reviews on Google Play Store as our data. According to the analysis, without Lexicon-Based feature, we obtained the accuracy rate, precision rate, and recall rate of 83%, 78%, and 69%, respectively. Meanwhile, the accuracy, precision, and recall rates using the Lexicon-Based feature were 85%, 91%, and 93%, respectively. Therefore, we concluded that sentiment analysis using Naïve Bayes with Lexicon-Based feature was better than without Lexicon-Based feature on TikTok reviews.
ANALISIS SENTIMEN MASYARAKAT PADA KEBIJAKAN VAKSINASI COVID-19 DI TWITTER MENGGUNAKAN METODE MESIN VEKTOR PENDUKUNG DENGAN KERNEL RADIAL BASIS FUNCTION BERBASIS FITUR LEKSIKON Sri Mulyani; Sri Astuti Thamrin; Siswanto Siswanto
Jambura Journal of Probability and Statistics Vol 3, No 2 (2022): Jambura Journal Of Probability and Statistics
Publisher : Department of Mathematics, Universitas Negeri Gorontalo

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34312/jjps.v3i2.16663

Abstract

Twitter is one of the popular social media used to get news quickly and briefly. After the outbreak of the COVID-19 virus and the government's policy to vaccinate against COVID-19 in Indonesia, more and more public opinion has been expressed through tweets. This makes the topic of COVID-19 vaccination interesting for sentiment analysis. Through sentiment analysis, information in the form of text data can be extracted to classify information related to positive or negative opinions. In this study, the classification of public opinion on COVID-19 vaccination was carried out using the supporting vector machine method without and with lexicon-based features. The manual labeling data used were 2981 tweets. The results of the classification of public opinion on COVID-19 vaccination in Indonesia with a supporting vector machine without the lexicon feature obtained accuracy, g-mean and AUC of 83%, 50% and 61.35%, respectively. Meanwhile, with lexicon-based features, the performance of the supporting vector machine method for classifying public opinion on COVID-19 vaccination in Indonesia obtained accuracy, g-mean and AUC of 90%, 86.63% and 87%, respectively. Based on these results, the performance of the supporting vector machine method with lexicon-based features provides better results for the performance of classifying of public opinion on COVID-19 vaccination compared to supporting vector machines without lexicon-based features.
Comparison of Variance Covariance and Historical Simulation Methods to Calculate Value At Risk on Banking Stock Portfolio Maria Yus Trinity Irsan; Evelyn Priscilla; Siswanto Siswanto
Jurnal Matematika, Statistika dan Komputasi Vol. 19 No. 1 (2022): SEPTEMBER, 2022
Publisher : Department of Mathematics, Hasanuddin University

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.20956/j.v19i1.21436

Abstract

In investing, all investors must be faced with risk that must be borne. Therefore, to determine the best strategy in investing, every investor must calculate the risk. One statistical approach that can be used to measure the risk is Value at Risk (VaR). VaR is defined as a tolerable loss with a certain level of confidence. The purpose of this research is to estimate VaR using Variance Covariance and Historical Simulation methods on banking stock portfolio consisting of three stocks for the period 11 September 2020-30 September 2021. Both methods will then be evaluated using backtesting to determine the accuracy of VaR and to obtain the best method. From the research results, if the holding period is 1 day, then the VaR calculation for banking stock portfolio using both methods can be used to estimate the risk at 99% and 95% confidence levels, except for the VaR value using the Variance Covariance method for banking stock portfolio at 95% confidence level. The results show that Variance Covariance method is the best method for 99% confidence level. As for the 95% confidence level, Historical Simulation method is the best method.
Identification of Factors that Influence Stunting Cases in South Sulawesi using Geographically Weighted Regression Modeling Siswanto Siswanto; Mirna Mirna; Muhammad Yusran; Ummul Auliyah Syam; Alya Safira Irtiqa Miolo
Jurnal Matematika, Statistika dan Komputasi Vol. 19 No. 1 (2022): SEPTEMBER, 2022
Publisher : Department of Mathematics, Hasanuddin University

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.20956/j.v19i1.21617

Abstract

In Indonesia, nearly seven million children under five are stunted and throughout the world, Indonesia is the country with the fifth-highest stunting prevalence. South Sulawesi ranks fourth with a high stunting potential in Indonesia. Stunting is caused by multi-dimensional factors and not only due to malnutrition experienced by pregnant women and children under five. In more detail, several factors that cause stunting are the effects of poor care, the lack of household/family access to nutritious food, and the lack of access to clean water and sanitation. In addition to maternal characteristics and parenting, the problem of stunting is also influenced by environmental factors and geographical conditions (population density, climatic conditions, and inadequate sanitation) so the spatial analysis is important to do in overcoming this problem. In spatial data, often observations at a location (space) depend on observations at other locations that are nearby (neighboring). By using Geographically Weighted Regression (GWR) obtained variables that affect the prevalence of stunting in South Sulawesi Province, including the percentage of babies receiving vitamin A intake, the percentage of babies receiving exclusive breastfeeding, the percentage of babies receiving health care, the percentage of malnourished children under five, the percentage short toddlers, the percentage of infants receiving DPT-HB-Hib, Measles and BCG immunizations.  for the GWR model is 81.32% and based on variables that are significant to the prevalence of stunting in South Sulawesi Province, three clusters are formed.
Statistical Downscaling Menggunakan Pengelompokan Expectation-Maximization pada Data CFSRv2 Rizka Pitri; Ayu Sofia; Siswanto
Jurnal Statistika dan Aplikasinya Vol 6 No 2 (2022): Jurnal Statistika dan Aplikasinya
Publisher : Program Studi Statistika FMIPA UNJ

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.21009/JSA.06203

Abstract

General Circulation Model (GCM) is a numerical model that produce a number of data from various climate parameters that are used to estimate climate, one of which is precipitation. GCM is a global scale data and has high resolution. So, the GCM cannot consider the local-scale areas with a higher resolution than the GCM. Therefore, to be able to use the GCM for estimating the local rainfall, namely Statistical Downscaling (SD). SD is a technique used to get the relationship between global-scale (GCM) and local-scale data. SD use the GCM which consist of dependent variables that have multicollinearity. So in this research, the principal component regression (PCR) and partial least square regression (PLSR) will be used to reduce the multicollinearity. In addition, to reduce the RMSEP and increase the correlation value, a clustering technique will be applied before modeling, namely Expectation-Maximization (EM) clustering. This research use CFSRv2 data as GCM and local rainfall data at four rainfall stations in West Java (January 2011 to December 2017). Based on this research, PCR is a good modeling than PLSR and EM clustering get the lower RMSEP and higher correlation value than without clustering before modeling. The conclusion is PCR with EM clustering is a good method for estimating local rainfall using the SD technique especially rainfall in West Java and CFSRv2 data.
Aplikasi Model Autoregressive Conditional Heteroscedastic-Generalized Auto Autoregressive Conditional Heteroscedastic pada Data Return Saham Bank Syariah Indonesia Zulfanita Dien R; Siswanto
ESTIMASI: Journal of Statistics and Its Application Vol. 4, No. 1, Januari, 2023 : Estimasi
Publisher : Hasanuddin University

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.20956/ejsa.vi.24799

Abstract

The increase of the financial sector, financial information is used in the economy to model and predict the movement of capital market stocks, so investors can easily understand investment risks. Financial sector data is in the form of time series data. Financial data  is found that does not fit the assumption of heteroscedasticity, so a model is needed that can maintain heteroscedasticity. Model Autoregressive Conditional Heteroscedasticity-Generalized Autoregressive Conditional Heteroscedastic is one of the econometric models used to model heteroscedasticity data in time series. The data in this study is BSI's daily closing price data taken from 4 January 2021 to 31 August 2022 with 406 data. Based on the selection of a time series model on Bank Syariah Indonesia (BSI), the best models are ARMA (11.0) and ARCH models (1). So that the ARMA (11.0)-ARCH (1) model can be the best model for modeling and predicting BSI stock return prices.
Robust Spatial-Temporal Analysis of Toddler Pneumonia Cases and its Influencing Factors Musdalifah Musdalifah; Siswanto Siswanto; Nirwan Ilyas
Jurnal Varian Vol 6 No 2 (2023)
Publisher : Universitas Bumigora

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.30812/varian.v6i2.2599

Abstract

Pneumonia is a disease that causes inflammation of the lungs and is one of the most common diseases infecting toddlers. As a directly infectious disease, there is a possibility of the influence of location diversity on the number of pneumonia sufferers. Robust Geographically and Temporally Weighted Regression (RGTWR) is a method used to model data by considering the heterogeneity of location and time and to overcome outliers in the data. The data used is the number of pneumonia sufferers aged under five and the factors that are thought to influence it, namely the number of health centers, population density, percentage of children under five with complete basic immunizations, percentage of children under five who are exclusively breastfed 0-6 months, and percentage of poor people. This study was conducted to model pneumonia sufferers under five and to find out the factors that significantly affect the number of sufferers in each observation. RGTWR produces an optimal model with an R2 value of 99.9997%, a Mean Absolute Deviation of 21.6852, and a Median Absolute Deviation of 6.9661 compared to the Geographically and Temporally Weighted Regression model. Variables number of puskesmas, percentage of infants with complete basic immunization, and percentage of poor population are factors that influence the number of pneumonia sufferers under five in most locations in 34 provinces and 5 years of observation.
Application of Resampling and Boosting Methods Using the C5.0 Algorithm : Case Study Indonesia Family Survey Data Hedi Kuswanto; Nurtiti Sunusi; Siswanto Siswanto; Nirwan Nirwan
Proceedings of The International Conference on Data Science and Official Statistics Vol. 2021 No. 1 (2021): Proceedings of 2021 International Conference on Data Science and Official St
Publisher : Politeknik Statistika STIS

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34123/icdsos.v2021i1.198

Abstract

Hypertension is a non-communicable disease that is characterized by an increase in systolic and diastolic blood pressure of more than 140 mmHg and or 90 mmHg. Hypertension needs to get more attention the condition is because hypertension will cause complications in the target organs and this disease does not appear to show significant symptoms at the beginning of the disease because it is called "silent disease". The study discusses the integration method of resampling and boosting in predicting hypertension status using the C5.0 algorithm. Classification of the C5.0 Algorithm by applying to resample increases performance specificity and AUC. Random oversampling (ROS) increased the specificity by 95.67% and AUC increased by 91.11%. Random over-under sampling (ROUS) increased specificity by 88.84% and AUC increased by 87.13%. In addition, applying boosting to the C5.0 algorithm that has been reapplied increases the accuracy performance. Random oversampling (ROS) increased accuracy by 93.86% and random over-under sampling (ROUS) increased accuracy by 89.98%. The response variables that contributed the most were high cholesterol and heart problems. The application of resampling and boosting to the contribution of high cholesterol and heart problems always topped the list.