Garuda - Garba Rujukan Digital

p-Index From 2021 - 2026

1.671

P-Index

This Author published in this journals

All Journal IJCCS (Indonesian Journal of Computing and Cybernetics Systems) Media Statistika JOURNAL OF APPLIED INFORMATICS AND COMPUTING MIND (Multimedia Artificial Intelligent Networking Database) Journal Jurnal Informatika: Jurnal Pengembangan IT Parameter: Jurnal Matematika, Statistika dan Terapannya Journal of Mathematics, Computation and Statistics (JMATHCOS) Indonesian Journal of Statistics and Its Applications

Suhaeni, Cici

Unknown Affiliation

Author-ID : 1133385

Computer Science & IT Control & Systems Engineering Decision Sciences, Operations Research & Management Mathematics Other

Published : 11 Documents Claim Missing Document

Claim Missing Document

Articles

Title

Effect of Feature Normalization and Distance Metrics on K-Nearest Neighbors Performance for Diabetes Disease Classification Yusran, Muhammad; Sadik, Kusman; Soleh, Agus M; Suhaeni, Cici
Journal of Mathematics, Computations and Statistics Vol. 8 No. 2 (2025): Volume 08 Nomor 02 (Oktober 2025)
Publisher : Jurusan Matematika FMIPA UNM

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35580/jmathcos.v8i2.8012

Diabetes is a global health issue with a steadily increasing prevalence each year. Early detection of the disease is an important step in preventing severe complications. The K-Nearest Neighbors (KNN) algorithm is often used in disease classification, but its performance is highly influenced by the choice of normalization method and distance metric used. This study aims to evaluate the effect of various normalization methods and distance metrics on the performance of the KNN algorithm in diabetes disease classification. The three normalization methods were employed: z-score normalization, min-max scaling, and median absolute deviation (MAD). In addition, the seven distance metrics were assessed: Euclidean, Manhattan, Chebyshev, Canberra, Hassanat, Lorentzian, and Clark. The dataset used is Pima Indians Diabetes which consists of 768 observations and 8 features. The data were split into 80% training data and 20% test data, and using 5-fold cross-validation to determine the optimal k value. The results show that the MAD-Canberra combination produces the highest overall accuracy, recall, and F1-score of 87.32%, 82.33%, and 81.94%, respectively. The highest precision was obtained from the Baseline-Hassanat combination at 86.96%, while the lowest performance was observed for the Z-Score-Chebyshev combination with F1-Score 58.02%. These results highlight that no single combination universally outperforms others, underscoring the need for empirical evaluation. Nonetheless, combining MAD normalization with metrics such as Canberra or Hassanat can serve as a strong starting point for developing KNN-based classification systems, especially in medical contexts that are sensitive to misclassification.

Analysis and Optimization of Rainfall Prediction in Makassar City Using Artificial Neural Networks Based on Data Augmentation, Regularization, and Bayesian Optimization Abdullah, Adib Roisilmi; Sadik, Kusman; Suhaeni, Cici; Saleh, Agus Muhammad
Journal of Mathematics, Computations and Statistics Vol. 8 No. 2 (2025): Volume 08 Nomor 02 (Oktober 2025)
Publisher : Jurusan Matematika FMIPA UNM

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.35580/jmathcos.v8i2.8304

This study develops a robust and efficient rainfall prediction model using an Artificial Neural Network (ANN), significantly enhanced through integrated data augmentation, regularization, and Bayesian optimization techniques. We utilized a dataset of 118 monthly rainfall records from Makassar City, spanning 2014–2022, sourced from the Meteorological, Climatological, and Geophysical Agency (BMKG). To effectively capture inherent temporal patterns, lag features (specifically lag-1, lag-3, and lag-6 rainfall values) were meticulously constructed as input variables. Subsequently, Min-Max normalization was applied across all features, ensuring input consistency and optimizing the ANN's learning process. An initial manual grid search identified the most effective baseline ANN architecture, featuring four hidden layers ([128, 32, 16, 64] neurons), a tanh activation function, and a learning rate of 0.01. While the baseline ANN model achieved a commendable initial performance with an RMSE of 0.1608, comprehensive experiments revealed the superior benefits of a fully integrated approach. This advanced model, which synergistically combined data augmentation (to address data limitations and enhance generalization), regularization (to mitigate overfitting), and Bayesian optimization (for efficient hyperparameter tuning), demonstrated significantly improved generalization capabilities and enhanced model stability. This integrated model yielded an RMSE of 0.1861, an MSE of 0.0346, and an MAE of 0.1359. These compelling findings unequivocally underscore that integrated optimization strategies are crucial for developing more robust and reliable ANN-based rainfall prediction models, particularly for critical applications in climate-based time series forecasting.

Co-Authors Abdullah, Adib Roisilmi Agus Mohamad Soleh Alfa Dito, Gerry Anang Kurnia Andespa, Reyuli Bagus Sartono Dito, Gerry Alfa Endriani, Desy Fahira, Fani Hari Wijayanto Kamila, Sabrina Adnin Kusman Sadik M Soleh, Agus Muhammad Yusran Nisrina Az-Zahra, Putri NURADILLA, SITI Resiloy, Unique Desyrre A. Saleh, Agus Muhammad Sari, Jefita Resti Sentana Putra, I Gusti Ngurah Sunan, Muh. Zahra, Latifah

Title Search

Found 2 Documents Search Journal : Journal of Mathematics, Computation and Statistics (JMATHCOS)

Abstract

Abstract

Title

Found 2 Documents
Search
Journal : Journal of Mathematics, Computation and Statistics (JMATHCOS)