PREDATECS: Public Research Journal of Engineering, Data Technology and Computer Science
Vol. 2 No. 1: PREDATECS July 2024

Classification of Diabetes Mellitus Sufferers Eating Patterns Using K-Nearest Neighbors, Naïve Bayes and Decission Tree

Lubis, Ayuni Fachrunisa (Unknown)
Haq, Hilmi Zalnel (Unknown)
Lestari, Indah (Unknown)
Iltizam, Muhammad (Unknown)
Samae, Nitasnim (Unknown)
Rofiqi, Muhammad Aufi (Unknown)
Abdurrahman, Sakhi Hasan (Unknown)
Tambusai, Balqis Hamasatiy (Unknown)
Salsilah, Puja Khalwa (Unknown)



Article Info

Publish Date
21 Apr 2024

Abstract

The study investigates three classification algorithms, namely K-Nearest Neighbor (K-NN), Naïve Bayes, and Decision Tree, for the classification of Diabetes Mellitus using a dataset from Kaggle. K-NN relies on distance calculations between test and training data, using the Euclidean distance formula. The choice of k, representing the nearest neighbor, significantly influences K-NN's effectiveness. Naïve Bayes, a probabilistic method, predicts class probabilities based on past events, and it employs the Gaussian distribution method for continuous data. Decision Trees, form prediction models with easily implementable rules. Data collection involves obtaining a Diabetes Mellitus dataset with eight attributes. Data preprocessing includes cleaning and normalization to minimize inconsistencies and incomplete data. The classification algorithms are applied using the Rapidminer tool, and the results are compared for accuracy. Naïve Bayes yields 77.34% accuracy, K-NN performance depends on the chosen k value, and Decision Trees generate rules for classification. The study provides insights into the strengths and weaknesses of each algorithm for diabetes classification

Copyrights © 2024






Journal Info

Abbrev

predatecs

Publisher

Subject

Computer Science & IT Control & Systems Engineering Decision Sciences, Operations Research & Management Electrical & Electronics Engineering Engineering

Description

PREDATECS: Public Research Journal of Engineering, Data Technology and Computer Science is a scientific journal published by the Institute of Research and Publication Indonesian (IRPI) or Institut Riset dan Publikasi Indonesia (IRPI). The main focus of PREDATECS Journal is Engineering, Data ...