Claim Missing Document
Check
Articles

Found 2 Documents
Search

Optimization of Energy Consumption Prediction with Random Forest Regressor and XGBoost Feature Importance Syafei, Risma Moulidya; Nikmah, Tiara Lailatul; Anisa, Devi Nurul; Kharisma, Sidiq Noor
Journal of Information System Exploration and Research Vol. 4 No. 1 (2026): January 2026
Publisher : shmpublisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52465/joiser.v4i1.653

Abstract

Energy consumption is increasing as industry and technology advance. However, it will have a bad impact if its use is not properly controlled. Therefore, predicting energy consumption is needed to prevent energy waste and to streamline its use across several influencing factors. Predictions are made using the Random Forest Regressor method. Where regression and Random Forest techniques can produce accurate results for continuous values such as total energy consumption. The feature importance method is also used to select the most influential features. Where of the 40 features in the energy consumption dataset in Southern California, only 24 features were selected based on the average threshold of the gain value. The results showed that the use of XGBoost feature importance lowered the Mean Absolute Error (MAE) value of the Random Forest Regressor, which was 16.56 to 16.55. This value is the difference between the actual data and the predicted data. This proves that the model successfully predicts with a small error value. The application of feature importance in energy consumption prediction using Random Forest Regressor is expected to be more efficient in energy consumption, especially in the sectors that most affect the increase in energy consumption.
Enhanced Out-of-Fold Stacking with Feature Grouping and Model-Specific Transformations for Diabetes Prediction Improvement Putro, Ari Nugroho; Kharisma, Sidiq Noor; Al-Zahra, Gea Destadia; Muslim, Much Aziz; Pertiwi, Dwika Ananda Agustina
Journal of Student Research Exploration Vol. 4 No. 1 (2026): January 2026
Publisher : SHM Publisher

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.52465/josre.v4i1.674

Abstract

Diabetes mellitus is a chronic disease with serious implications for global health. Early detection is essential to reduce these risks, and machine learning methods are widely used in diabetes prediction. However, improving accuracy remains a major challenge in the development of predictive models. This study proposes a stacking-based ensemble learning approach with an out-of-fold (OOF) scheme to improve classification performance. The proposed method consists of several systematic steps, namely (1) data preprocessing via median imputation of invalid values and feature transformation according to model characteristics, (2) the creation of base learners comprising Logistic Regression, Gaussian Naïve Bayes, Support Vector Machine, Random Forest, and XGBoost, (3) model training using Stratified Cross Validation 5 Fold to generate OOF predictions, (4) combining all OOF predictions into a meta-feature matrix, and (5) training an XGBoost-based meta-model to generate the final prediction. This approach enables the meta-model to optimally learn the relationships among the outputs of the baseline models. Experimental results show that the proposed method achieves an accuracy of 91.15%, precision of 90.65%, recall of 83.21%, and an F1-score of 86.77%. These results indicate that stacking is effective in improving the accuracy of diabetes predictions.