Toddlers are a vulnerable age group to various types of diseases due to their immune systems that are still developing. Limited utilization of medical record data and the lack of structured information regarding disease patterns in toddlers based on age and causative factors have resulted in suboptimal prevention and treatment efforts. Therefore, an approach is needed to systematically classify toddler disease data. This study aims to apply data mining techniques using the clustering method with the K-Means algorithm to group types of diseases in toddlers based on age and causative factors. The variables used in this study include toddler age, type of disease, and causative factors. The data were obtained from RSUD Dr. R. M. Djoelham Binjai and processed using MATLAB software with three clusters. The results show that the K-Means algorithm successfully groups toddler disease data into three clusters with different characteristics. The first cluster is dominated by toddlers aged 0–11 months with appendicitis caused by genetic factors. The second cluster is dominated by toddlers aged 1–3 years with diarrhea caused by environmental factors and has the largest number of members. Meanwhile, the third cluster is dominated by toddlers aged 0–11 months with sore throat caused by environmental factors. The clustering results indicate a relationship between toddler age, disease type, and causative factors, which can be used as supporting information for decision-making in the prevention and treatment of toddler diseases.
Copyrights © 2025