Indonesian Journal of Electrical Engineering and Computer Science
Vol 41, No 2: February 2026

Machine learning models in the enhancement of PSE in high-dimensional socioeconomic data: a review

B. Catedrilla, Gene Marck (Unknown)
Aviles, Joey (Unknown)



Article Info

Publish Date
01 Feb 2026

Abstract

This study reviews the use of machine learning (ML) techniques to improve propensity score (PS) estimation in high-dimensional socioeconomic data. Traditional logistic regression (LR) often performs poorly under nonlinear and complex covariate structures, leading to bias and model misspecification. Across the reviewed studies, ensemble methods such as random forests (RF) and gradient boosting, and deep learning models consistently achieved better covariate balance, lower bias, and greater flexibility than conventional approaches, while classification-based methods improved performance in imbalanced datasets. The review also highlights practical considerations, including calibration, transparent reporting, and integration with doubly robust estimators to strengthen causal inference. The findings show that ML-based propensity score estimation (PSE) can substantially enhance the validity and reliability of socioeconomic evaluations, provided that its implementation is carefully guided by appropriate expertise and best-practice standards.

Copyrights © 2026