p-Index From 2021 - 2026
7.537
P-Index
This Author published in this journals
All Journal FORUM STATISTIKA DAN KOMPUTASI Media Statistika Statistika JURNAL MATEMATIKA STATISTIKA DAN KOMPUTASI IPTEK The Journal for Technology and Science CAUCHY: Jurnal Matematika Murni dan Aplikasi Sosioinforma JUITA : Jurnal Informatika Jurnal Pengelolaan Sumberdaya Alam dan Lingkungan (Journal of Natural Resources and Environmental Management) International Journal of Advances in Intelligent Informatics Scientific Journal of Informatics JOIN (Jurnal Online Informatika) Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) Indonesian Journal of Applied Statistics Jurnal Penelitian Pertanian Tanaman Pangan BAREKENG: Jurnal Ilmu Matematika dan Terapan JOURNAL OF APPLIED INFORMATICS AND COMPUTING SINTECH (Science and Information Technology) Journal MIND (Multimedia Artificial Intelligent Networking Database) Journal JIPI (Jurnal Ilmiah Penelitian dan Pembelajaran Informatika) Jurnal Aplikasi Statistika & Komputasi Statistik FIBONACCI: Jurnal Pendidikan Matematika dan Matematika Inferensi International Journal of Advances in Data and Information Systems InPrime: Indonesian Journal Of Pure And Applied Mathematics ESTIMASI: Journal of Statistics and Its Application Majalah Ilmiah Matematika dan Statistika (MIMS) Jurnal Lebesgue : Jurnal Ilmiah Pendidikan Matematika, Matematika dan Statistika Journal of Applied Data Sciences Enthusiastic : International Journal of Applied Statistics and Data Science Prosiding Seminar Nasional Official Statistics Jurnal Natural Eduvest - Journal of Universal Studies Xplore: Journal of Statistics PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND OFFICIAL STATISTICS Parameter: Jurnal Matematika, Statistika dan Terapannya Scientific Journal of Informatics Journal of Mathematics, Computation and Statistics (JMATHCOS) Advance Sustainable Science, Engineering and Technology (ASSET) Indonesian Journal of Statistics and Its Applications Journal on Mathematics Education
Claim Missing Document
Check
Articles

Evaluation of Imputation Methods for Clustering Categorical Time Series on Financial Sector Stock Data Rita Rahmawati; I Made Sumertajaya; Asep Saefuddin; Kusman Sadik
Journal of Mathematics, Computations and Statistics Vol. 9 No. 2 (2026): Volume 09 Issue 02 (June 2026)
Publisher : Jurusan Matematika FMIPA UNM

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35580/t5pe8v78

Abstract

Missing values in financial time series data can affect the information structure of the data and impact the clustering results obtained. This research aims to evaluate the performance of several time series data imputation methods on the quality of categorical time series clustering on financial sector stock data on the Indonesia Stock Exchange. The imputation methods compared include linear interpolation, spline interpolation, and Kalman smoothing. The research data is in the form of daily closing prices of 92 financial sector stocks for the period 2 January 2023 to 31 October 2025. Numerical clustering was carried out using K-Means Time Series based on Dynamic Time Warping (DTW), while categorical clustering was carried out using K-Medoids with the Gower distance measure in two categorization schemes, namely five and seven categories. Evaluation of suitability between numerical and categorical clustering was carried out using the Rand Index (RI), Fowlkes–Mallows Index (FMI), and Jaccard Index. The research results show that the imputation method produces different clustering qualities. Linear interpolation provides the best and most consistent performance compared to other methods, especially in the seven-category scheme with an RI value of 0.6417, FMI of 0.4338, and Jaccard Index of 0.3256. These results show that linear interpolation is better able to maintain the information structure of the data in the categorical time series clustering process compared to spline interpolation or Kalman smoothing.
Bayesian Neural Network untuk Prediksi Diabetes: Uncertainty Quantification dalam Machine Learning Kamila, Sabrina Adnin; Sadik, Kusman; Suhaeni, Cici; Soleh, Agus Mohamad
Indonesian Journal of Applied Statistics Vol 9, No 1 (2026)
Publisher : Universitas Sebelas Maret

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.13057/ijas.v9i1.103994

Abstract

Penelitian ini bertujuan mengevaluasi dan membandingkan kinerja tiga model machine learning, yaitu random forest (RF), feedforward neural network (FNN), dan bayesian neural network (BNN), dalam klasifikasi diabetes menggunakan Diabetes Health Indicators Dataset dari UCI Machine Learning Repository yang memiliki ketidakseimbangan kelas. Prapemrosesan data meliputi normalisasi fitur menggunakan StandardScaler dan penanganan ketidakseimbangan kelas dengan synthetic minority over-sampling technique (SMOTE). Evaluasi model dilakukan menggunakan metrik akurasi dan skor F1, yang didukung oleh classification report dan confusion matrix. Hasil evaluasi menunjukkan bahwa RF menghasilkan akurasi tinggi (0,8493) namun skor F1 yang rendah (0,3386), yang mengindikasikan rendahnya sensitivitas model terhadap kasus positif diabetes. FNN memberikan performa yang lebih seimbang dengan skor F1 sebesar 0,4490 setelah penyesuaian threshold optimal. Sementara itu, BNN mencapai akurasi 0,8498 dan skor F1 sebesar 0,4043, serta memiliki keunggulan tambahan berupa kemampuan mengukur ketidakpastian prediksi melalui pendekatan Monte Carlo Dropout. Dengan demikian, FNN lebih unggul dalam keseimbangan klasifikasi, sementara BNN lebih relevan untuk aplikasi medis yang membutuhkan informasi tingkat kepercayaan prediksi guna mendukung pengambilan keputusan klinis yang lebih andal.This study aims to evaluate and compare the performance of three machine learning models, namely random forest (RF), feedforward neural network (FNN), and bayesian neural network (BNN), for diabetes classification using the Diabetes Health Indicators Dataset from the UCI Machine Learning Repository, which exhibits significant class imbalance. Data preprocessing includes feature normalization using StandardScaler and class imbalance handling through synthetic minority over-sampling technique (SMOTE). Model performance is evaluated using accuracy and F1-score metrics, supported by classification report and confusion matrix analysis. The results show that RF achieves high accuracy (0.8493) but a low F1-score (0.3386), indicating poor sensitivity to positive diabetes cases. FNN provides more balanced performance with an F1-score of 0.4490 after optimal threshold adjustment. Meanwhile, BNN achieves an accuracy of 0.8498 and F1-score of 0.4043, while offering the additional advantage of uncertainty quantification through Monte Carlo Dropout. Therefore, FNN is more effective for balanced classification performance, while BNN is more suitable for medical applications that require prediction confidence information to support more reliable and informed clinical decision-making.Kata Kunci: Prediksi diabetes, kuantifikasi ketidakpastian, bayesian neural network, classification imbalance, machine learning.Keywords: Diabetes prediction, uncertainty quantification, bayesian neural network, classification imbalance, machine learning.
Deteksi Polycystic Ovary Syndrome (PCOS) Berbasis Machine Learning: Kombinasi SMOTE, Random Forest, Gradient Boosting, dan Bayesian Optimization Alfiryal, Naufalia; Sadik, Kusman; Suhaeni, Cici; Soleh, Agus Mohamad
Indonesian Journal of Applied Statistics Vol 8, No 2 (2025)
Publisher : Universitas Sebelas Maret

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.13057/ijas.v8i2.109931

Abstract

Polycystic ovary syndrome (PCOS) merupakan gangguan endokrin yang umum terjadi pada wanita usia reproduktif. Kondisi ini dapat menyebabkan gangguan ovulasi, ketidakseimbangan hormon, resistensi insulin, serta meningkatkan risiko penyakit kardiovaskular, obesitas, dan gangguan psikologis. Meskipun prevalensinya cukup tinggi, sekitar 75% kasus PCOS masih belum terdiagnosis dalam praktik klinis akibat kompleksitas gejala dan keterbatasan metode diagnosis yang digunakan saat ini. Untuk mengatasi permasalahan tersebut, penelitian ini mengusulkan pendekatan berbasis machine learning guna meningkatkan akurasi dan efisiensi deteksi PCOS. Penelitian ini membandingkan performa dua algoritma pembelajaran terawasi, yaitu random forest dan gradient boosting, dalam melakukan prediksi PCOS. Dataset yang digunakan diperoleh dari repositori publik dan memuat berbagai fitur klinis yang berkaitan dengan PCOS. Untuk menangani permasalahan ketidakseimbangan kelas, metode synthetic minority over-sampling technique (SMOTE) diterapkan pada data pelatihan. Selain itu, bayesian optimization digunakan untuk melakukan penyetelan hiperparameter pada masing-masing model agar diperoleh performa yang optimal. Evaluasi performa model dilakukan menggunakan beberapa metrik, dengan area under the curve–receiver operating characteristic (AUC-ROC) sebagai metrik utama. Hasil penelitian menunjukkan bahwa model Gradient Boosting memberikan performa terbaik dengan nilai AUC sebesar 0,8983 dan nilai recall sebesar 0,95, yang mengindikasikan sensitivitas tinggi dalam mengidentifikasi kasus PCOS. Temuan ini menunjukkan bahwa kombinasi SMOTE dan bayesian optimization efektif dalam meningkatkan akurasi prediksi, khususnya pada dataset medis yang tidak seimbang. Pendekatan yang diusulkan memiliki potensi untuk diintegrasikan ke dalam sistem pendukung keputusan klinis guna mendukung proses skrining PCOS yang lebih dini dan andal.Polycystic ovary syndrome (PCOS) is a common endocrine disorder among reproductive-aged women. This condition can lead to ovulatory dysfunction, hormonal imbalance, insulin resistance, and an increased risk of cardiovascular disease, obesity, and psychological disorders. Despite its high prevalence, approximately 75% of PCOS cases remain undiagnosed in clinical settings due to the complexity of symptoms and limitations of current diagnostic methods. To address this issue, a machine learning-based approach is proposed to improve the accuracy and efficiency of PCOS detection. This study compares the performance of two supervised learning algorithms random forest and gradient boosting for PCOS prediction. The dataset used was obtained from a public repository and contains various clinical features associated with PCOS. To address the class imbalance problem, the synthetic minority over-sampling technique (SMOTE) was applied to the training data. Additionally, bayesian optimization was employed to fine-tune the hyperparameters of each model for optimal performance. Model performance was evaluated using several metrics, with the area under the curve–receiver operating characteristic (AUC-ROC) as the primary measure. The Gradient Boosting model achieved the best results, with an AUC of 0.8983 and a recall of 0.95, indicating high sensitivity in identifying positive PCOS cases. These findings demonstrate that the combination of SMOTE and Bayesian Optimization is effective in enhancing predictive accuracy, especially in imbalanced medical datasets. The proposed approach shows promise for integration into clinical decision-support systems to facilitate earlier and more reliable PCOS screening.Kata Kunci: Bayesian optimization; gradient boosting; PCOS; random forest; SMOTE.Keywords : Bayesian optimization; gradient boosting; PCOS; random forest; SMOTE.
Co-Authors . Erfiani . Indahwati A.Tuti Rumiati Aam Alamudi Abdullah, Adib Roisilmi Achmad Fauzan Agus Mohamad Soleh Ahmad Rifai Nasution Aji Hamim Wigena Akbar Rizki Akbar Rizki Akmala Firdausi Alfiryal, Naufalia Amalia, Rahmatin Nur Anadra, Rahmi Ananda Shafira Anang Kurnia Andespa, Reyuli Andi Okta Fengki ASEP SAEFUDDIN Astari, Reka Agustia Astari, Reka Agustia Aulya Permatasari Azka Ubaidillah Bagus Sartono Budi Susetyo Budi Susetyo Cici Suhaeni Cici Suhaeni Dian Handayani Dito, Gerry Alfa Dwi Agustin Nuriani Sirodj Efriwati Efriwati Embay Rohaeti Eminita, Viarti EVITA PURNANINGRUM Fahira, Fani FARDILLA RAHMAWATI Farit Mochamad Afendi Fitrianto, Anwar Freya, Wa Ode Rona Gerry Alfa Dito Haikal, Husnul Aris Hari Wijayanto Hasnataeni, Yunia Hazan Azhari Zainuddin Hermawati, Neni I Gusti Ngurah, Sentana Putra I Made Sumertajaya I Wayan Mangku Indahwati Indahwati Indahwati Intan Arassah, Fradha Iqbal, Teuku Achmad Isnanda, Eriski Kamila, Sabrina Adnin Khairi A N Khairil Anwar Notodiputro Khikmah, Khusnia Nurul khusnul khotimah Khusnul Khotimah Kusni Rohani Rumahorbo Latifah, Leli Lili Puspita Rahayu Logananta Puja Kusuma M Soleh, Agus Mochamad Ridwan Mochamad Ridwan, Mochamad Mohammad Masjkur Muh Nur Fiqri Adham Muhammad Yusran Mulianto Raharjo Naima Rakhsyanda Nisrina Az-Zahra, Putri Nur Khamidah NURADILLA, SITI Nusar Hajarisman Pangestika, Dhita Elsha Parwati Sofan, Parwati Purnama Sari Rakhsyanda, Naima Rifqi Aulya Rahman Rita Rahmawati Rizaldi Boer Rizki, Akbar Rizqi, Tasya Anisah ROCHYATI ROCHYATI Rumahorbo, Kusni Rohani Sahamony, Nur Fitriyani Saleh, Agus Muhammad Satriyo Wibowo Sentana Putra, I Gusti Ngurah Siregar, Jodi jhouranda Siti Aisyah Siti Raudlah Sitti Nurhaliza Soleh, Agus M Suhaeni, Cici Sundari, Marta Supriatin, Febriyani Eka Tendi Ferdian Diputra Titin Suhartini Titin Suhartini, Titin Tri Wahyuni Uswatun Hasanah Utami Dyah Syafitri Viarti Eminita Widhiyanti Nugraheni Yenni Angraini Yenni Kurniawati Yuli Eka Putri Zafira Fakhriyah