Sleep is a fundamental human need that plays a crucial role in maintaining both physical and mental health. Poor sleep quality can trigger a variety of health problems, ranging from decreased concentration to an increased risk of chronic diseases. The complexity of factors influencing sleep quality—such as stress levels, heart rate, blood pressure, physical activity, and lifestyle—makes its assessment difficult through direct observation alone. Therefore, data mining approaches are increasingly utilized to identify relevant patterns in sleep-related data. This study aims to compare the performance of the C4.5 (Decision Tree) algorithm and the Naïve Bayes algorithm in predicting sleep quality using the Sleep Health and Lifestyle dataset, which contains information from 374 respondents. The research method applied is a quantitative comparative approach employing classification techniques with 10-fold cross-validation to ensure robust evaluation. Model performance is assessed using accuracy, precision, and recall metrics to provide a comprehensive understanding of the effectiveness of each algorithm. The findings indicate that the C4.5 algorithm achieves an accuracy of 96.26% and offers advantages in terms of interpretability through its decision tree visualization, enabling easier understanding of variable relationships. In contrast, the Naïve Bayes algorithm demonstrates superior predictive performance, achieving an accuracy of 98.66% along with consistently high precision and recall across nearly all classes. These results suggest that Naïve Bayes is more effective for predictive tasks involving sleep quality, while C4.5 remains highly valuable when the goal is to interpret variable interactions and decision rules. Overall, this research highlights the potential of data mining techniques in health informatics, particularly in improving the understanding and prediction of sleep quality, which in turn can contribute to better prevention and management of sleep-related health issues.
Copyrights © 2025