Suhaira, Zatin
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Analisis Persebaran Penyakit di Wilayah Menggunakan Algoritma K-Means Berbasis Data Kunjungan Fasilitas Kesehatan Suhaira, Zatin; Muliono, Rizki
INCODING: Journal of Informatics and Computer Science Engineering Vol 5, No 2 (2025): INCODING OKTOBER
Publisher : Mahesa Research Institute

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.34007/incoding.v5i2.983

Abstract

This study aims to analyze the distribution of diseases based on patient visit data to various healthcare facilities using the K-Means clustering method. The research data were obtained secondarily from the Kaggle platform, namely the ‘Healthcare Dataset’, which contains patient information, including healthcare facility attributes, medical conditions, and other related data. The determination of the optimal number of clusters was carried out using the Elbow Method, while the quality of clustering was evaluated with two internal metrics, namely the Silhouette Score and the Davies–Bouldin Index (DBI). The clustering results produced three main clusters with distinct characteristics. The first cluster was dominated by patients diagnosed with arthritis in the age group of 55–59 years with blood type O+. The second cluster showed a predominance of obesity in the age group of 35–39 years with blood type AB+, while the third cluster indicated cancer cases in the age group of 65–69 years with blood type O-. The evaluation resulted in a Silhouette Score of 0.5349 and a DBI of 0.5830, indicating that the clustering quality is fairly good, with compact and well-separated clusters. These findings not only highlight variations in disease distribution across healthcare facilities but also provide a foundation for mapping disease patterns and supporting strategic decision-making in public health..