Garuda - Garba Rujukan Digital

ComTech: Computer, Mathematics and Engineering Applications

Vol. 17 No. 1 (2026): ComTech

Asysta Amalia Pasaribu (Department of Statistics, School of Computer Science, BINUS UNIVERSITY, Jakarta)
Nur Fitriyani Sahamony (Universitas Binawan, IPB University)
Khairil Anwar Notodiputro (1IPB University, Department of Statistics and Data Science, Bogor, 16680, Indonesia 2Universitas Binawan, Faculty of Business and Social Science, Jl. Dewi Sartika Raya, Jakarta Timur, 13630, Indonesia)
Bagus Sartono (1IPB University, Department of Statistics and Data Science, Bogor, 16680, Indonesia 2Universitas Binawan, Faculty of Business and Social Science, Jl. Dewi Sartika Raya, Jakarta Timur, 13630, Indonesia)

Publish Date
29 Jan 2026

Stunting is a form of chronic nutritional deficiency in toddlers and remains a major public health concern due to its impact on child growth and development. Efforts to reduce its prevalence continue to be strengthened in Indonesia, particularly in Sumatra Province. This study aims to evaluate the accuracy of a logistic regression model and three machine learning models—decision tree, random forest, and Support Vector Machine (SVM)—in classifying stunting prevalence. The response variable is the prevalence of stunting among toddlers and is categorized into two classes: exceeding the national target and not exceeding it, based on the 2024 national threshold. Although classification models can provide accurate predictions, they often lack interpretability. Therefore, this study applies the Shapley Additive exPlanations (SHAP) method to the best-performing machine learning model to identify the key factors influencing stunting. The use of Shapley values is justified through the uniqueness theorem, which establishes it as the only attribution method that satisfies desirable fairness properties. SHAP values explain the model by referencing both the trained model and the underlying data. The results show that the random forest model achieves the highest accuracy (90.00%) and outperforms the other models. SHAP analysis reveals that Underweight is the most influential predictor contributing to stunting prevalence in Sumatra Province. These findings highlight the importance of machine learning interpretability in supporting policy decisions to reduce stunting.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

ComTech: Computer, Mathematics and Engineering Applications

Website

Abbrev

comtech

Publisher

Universitas Bina Nusantara

Subject

Computer Science & IT Engineering Mathematics

Description

The journal invites professionals in the world of education, research, and entrepreneurship to participate in disseminating ideas, concepts, new theories, or science development in the field of Information Systems, Architecture, Civil Engineering, Computer Engineering, Industrial Engineering, Food ...

Article Info

Abstract

Explainable Machine Learning Models SHAP-based for Feature Importance Affecting Stunting Prevalence

Article Info

Abstract