This study aims to conduct a comparative modelling of pineapple production at PT Great Giant Pineapple (GGP) using Gaussian GLM as parametric statistical approach and Random Forest Regression method as machine learning based on monthly data from 2014 to 2022. Multicollinearity testing and distribution fitting were conducted to validate the Gaussian assumption. For the Random Forest Regression, hyperparameters were optimized by tuning the number of trees (ntree) and the number of predictors at each split (mtry) with model stability evaluated using Out-of-Bag (OOB) error. The Gaussian GLM achieved a MAPE of 8.41% (R² = 0.106) for the GP3 clone and 11.27% (R² = 0.149) for the F180 clone. Random Forest Regression produced a testing MAPE of 9.28% (R² = 0.144) for GP3 and 12.11% (R² = 0.105) for F180. While both models achieved low prediction error based on MAPE, they differed in identifying influential variables and showed limited explanatory power as indicated by low R² values. The Gaussian GLM identifies air pressure as significant for both clones and rainfall for F180 clone, while Random Forest consistently identifies rainfall as the most influential predictor. These findings confirm the complementary strengths of parametric and machine learning approaches in supporting climate-based production planning and risk mitigation.
Copyrights © 2026