Heterogeneous synthetic data is artificial data that can include many types of features (demographics, examinations, therapies). Complex patients (many procedures & medications) but fast service process and low complications. All patients are divided into 4 clusters, patient segmentation includes cluster 1 including mild patients, Cluster 2 including complex patients, Cluster 3 including high costs, Cluster 4 including high readmission risk. The highest silhouette score is 0.2187, which is obtained when the number of clusters (k) is 2. Based on previous calculations, the Davies-Bouldin Index result for the current clustering solution is 2.33. The Calinski-Harabasz index for the clustering solution with k=4 is 367.72. Clustering results are simply groups, without labels. Further analysis is needed to assign clinical meaning to each cluster.
Copyrights © 2025