Obesity is a global health problem that continues to increase and has serious impacts on physical and mental health. This research aims to predict a person's obesity status based on certain attributes using the K-Nearest Neighbor (KNN), Naive Bayes, and Random Forest algorithms. The dataset used was taken from the Kaggle platform with 2,111 data and 16 attributes, including gender, age, weight, height, frequency of consumption of high-calorie foods, physical activity, and water and vegetable consumption patterns. The research process follows the data mining stages, including business understanding, data understanding, data preparation, modeling, evaluation, and documentation. Experiments were carried out using RapidMiner with a cross-validation technique using 10 folds to measure overall model performance. The research results show that the Random Forest algorithm performs best in predicting obesity status compared to K-NN and Naive Bayes. Model evaluation using accuracy, precision, recall, and F1-score metrics shows significant results in distinguishing obesity categories. It is hoped that this research can contribute to the development of a machine learning-based health prediction system that can be used to support decision-making in the prevention and management of obesity.
Copyrights © 2025