Artificial Intelligence in Educational Decision Sciences
Vol 1 No 2 (2026): Artificial Intelligence in Educational Decision Sciences

Data-Driven Obesity Classification Integrating Genetic and Lifestyle Determinants Using Naive Bayes

Yusion Gandjang (Universitas Negeri Makassar)
Amaliah Safitri K (Universitas Negeri Makassar)
Nabila Dwi Anugra (Universitas Negeri Makassar)
Iyang Yuyung S (Universitas Negeri Makassar)
Akhmad Affandi (Dresden University)



Article Info

Publish Date
07 Feb 2026

Abstract

Purpose – This study aims to develop a data-driven obesity classification framework that integrates genetic predisposition and lifestyle determinants using the Naive Bayes algorithm, while empirically evaluating optimal training–testing data proportions for health decision support systems.Methods – A systematic computational workflow was applied to a public obesity dataset comprising 2,112 records, which was refined to 1,259 valid instances after preprocessing. Genetic indicators and lifestyle-related variables were encoded and classified into four obesity categories: normal weight, obesity type I, obesity type II, and obesity type III. The Naive Bayes model was evaluated using three training–testing data partition ratios (75:25, 80:20, and 85:15). Model performance was assessed using six metrics: Area Under the Curve (AUC), classification accuracy, F1-score, precision, recall, and Matthews Correlation Coefficient.Findings – The results demonstrate that the 80:20 and 85:15 data partitions achieved the highest performance, with an accuracy of 0.878 and an AUC of 0.979. The model showed excellent sensitivity in identifying severe obesity cases, while moderate misclassification occurred between obesity type I and type II due to phenotypic overlap in lifestyle patterns.Research limitations – This study relies on a single public dataset and lacks population-specific genetic calibration, which may limit generalizability to diverse regional contexts.Originality – This study provides empirical validation of a probabilistic obesity classification framework that integrates genetic and lifestyle factors, offering an interpretable and computationally efficient approach to support data-driven health decision making.

Copyrights © 2026






Journal Info

Abbrev

AIEDS

Publisher

Subject

Computer Science & IT Decision Sciences, Operations Research & Management Education Electrical & Electronics Engineering Engineering Social Sciences

Description

Artificial Intelligence in Educational Decision Sciences (AIEDS) focuses on high-quality empirical, theoretical, and methodological research that examines the role of artificial intelligence in shaping, supporting, and optimizing decision-making processes within educational systems. The journal is ...