Diabetes mellitus is one of the fastest-growing health problems in the 21st century. One of the causes is the lack of public awareness for regular health check-ups, while the lifestyle being led is quite unhealthy. Hemoglobin A1c (HbA1c) examination is highly recommended to detect diabetes. However, this service is not yet available at Posbindu in Bulupitu Village. Therefore, another approach is needed to detect the risk of diabetes early, namely through data mining. The data mining methods used in this research are the Naïve Bayes and kNN classification methods. The variables to determine the risk of diabetes include gender, age, family history of diabetes, frequent urination, Body Mass Index (BMI), blood sugar levels, and diabetes risk output. The division of testing and training datasets uses cross-validation and ratio (60:40, 70:30, 80:20, and 90:10). The best accuracy of the Naïve Bayes method was obtained by dividing the dataset using k-fold cross-validation with k=2, achieving 96.1%. In the kNN method, the best results were obtained from the 80:20 dataset ratio. Manhattan distance was found to be the best distance calculation in this study compared to Euclidean distance and Chebyshev distance.
Copyrights © 2024