Floods in urban Indonesia pose severe environmental and public health challenges, exacerbating water contamination, vector proliferation, and disease outbreaks. Rapid urbanization, inadequate drainage systems, and climate change have intensified these impacts, emphasizing the need for integrated predictive frameworks. This study aims to develop a Data Mining (DM)-based modeling approach that combines environmental and health indicators to predict flood-related disease risks. Random Forest (RF) and Artificial Neural Network (ANN) algorithms were applied to multi-domain datasets from 30 flood-prone urban sub-districts between 2018 and 2023, encompassing rainfall, drainage density, land use, and water quality variables, integrated with disease incidence data such as diarrhea, dengue, and leptospirosis. The ANN model achieved superior predictive performance (93% accuracy, AUC 0.93) compared to RF (90% accuracy, AUC 0.90), identifying rainfall intensity, drainage density, and coliform contamination as the most influential predictors. These results demonstrate the capability of AI-driven DM techniques to capture complex interdependencies between environmental and health systems. The developed framework contributes to the field of informatics by providing a scalable, data-driven early warning tool for flood-related health risks, supporting evidence-based decision-making in disaster risk management and enhancing public health resilience in rapidly urbanizing regions.
Copyrights © 2025