Claim Missing Document
Check
Articles

Found 1 Documents
Search

Comparative Analysis of Hierarchical Clustering and K-Medoids for Clustering Cases of Childhood Respiratory Diseases in Lamongan Regency Adelia Yuandhika; Nezalfa Sabrina; Cahya Eka Melati; Dwi Arman Prasetya; Prismahardi Aji Riyantoko
Jurnal Aplikasi Sains Data Vol. 2 No. 1 (2026): Journal of Data Science Applications.
Publisher : Program Studi Sains Data UPN "Veteran" Jawa Timur

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33005/jasid.v2i1.37

Abstract

Abstract— Respiratory diseases affecting children remain a significant health issue in Indonesia, including in Lamongan Regency. The region faces challenges related to pediatric respiratory illnesses, particularly Childhood Tuberculosis, Pneumonia in toddlers, and Cough in toddlers, which impact children's quality of life and development. Therefore, understanding the spatial distribution and correlation patterns among these diseases is essential to support more targeted health intervention planning. This study analyzes the distribution patterns of pediatric respiratory diseases in Lamongan Regency and clusters regions based on similarities in the number of cases using an unsupervised learning approach. The method employed is Hierarchical Clustering with four distance calculation techniques: single, complete, average, and ward linkage and K-Medoids with two distance calculation techniques: euclidean and manhattan distance. The data, sourced from the Lamongan District Health Office, include four numerical variables related to respiratory diseases, aggregated by sub-districts. Data normalization was carried out using standardization, and cluster quality was evaluated using three internal metrics: Silhouette Score, Davies-Bouldin Index (DBI), and Calinski-Harabasz Index (CHI). The analysis results indicate that the optimal number of clusters is three. Among all methods tested, the Hierarchical Clustering with ward linkage method yielded the best performance, with a Silhouette Score of 0.5447, a DBI of 0.5884, and a CHI of 20.3018. These results demonstrate that the ward linkage method is the most effective in clustering regions based on the characteristics of pediatric respiratory disease cases and can be used for mapping priority health intervention areas in Lamongan Regency.