Diabetes Mellitus is a complex and progressive chronic metabolic disorder that requires a personalized management strategy tailored to each individual’s clinical, physiological, and lifestyle characteristics. Addressing this challenge, the present study aims to apply the K-Means algorithm to identify clustering patterns among diabetic patients using the Knowledge Discovery in Databases (KDD) framework. The dataset was obtained from the Kaggle repository, consisting of 769 patient medical records with key variables such as glucose levels, body mass index (BMI), blood pressure, age, and other metabolic parameters relevant to the diagnosis of Diabetes Mellitus. The research methodology includes several stages: data selection, preprocessing to handle missing values, duplication, and normalization to ensure the dataset is properly structured for analysis. The implementation of the K-Means algorithm was carried out using Orange Data Mining software to produce optimal clustering patterns. The analysis identified three primary clusters (C1, C2, C3) that demonstrated significant differences, particularly based on glucose levels as the dominant variable in cluster formation. The scatter plot visualization revealed clear separations among clusters, with high intra-cluster homogeneity and strong inter-cluster heterogeneity. These findings confirm the effectiveness of the K-Means algorithm as an unsupervised learning method capable of uncovering hidden patterns within clinical diabetes data. The results are expected to serve as a foundation for developing more adaptive and precise clinical decision support systems, assisting healthcare professionals in designing targeted management and intervention strategies aligned with each patient’s risk profile.
Copyrights © 2025