Sinkron : Jurnal dan Penelitian Teknik Informatika
Vol. 8 No. 4 (2024): Article Research Volume 8 Issue 4, October 2024

Comparative Study of XGBoost, Random Forest, and Logistic Regression Models for Predicting Customer Interest in Vehicle Insurance

Airlangga, Gregorius (Unknown)



Article Info

Publish Date
30 Oct 2024

Abstract

In today’s competitive insurance market, accurately predicting customer interest in additional products, such as vehicle insurance, is crucial for optimizing marketing strategies and maximizing sales. This study presents a comparative analysis of three machine learning models such as XGBoost, RandomForest, and Logistic Regression to predict customer interest in vehicle insurance based on a dataset that includes demographic, vehicle, and policy-related features. The dataset was analyzed using five-fold cross-validation, and the performance of the models was evaluated using AUC-ROC, precision, recall, and F1-score. XGBoost demonstrated the highest recall (0.9525) and AUC-ROC (0.7854), making it the most effective model for identifying customers interested in vehicle insurance, though at the expense of lower precision (0.2585). RandomForest showed a more balanced trade-off between precision (0.3064) and recall (0.5341) but performed lower overall. Logistic Regression, while the most interpretable model, exhibited high variability in performance across different folds, with a lower average precision (0.2372). The findings of this research highlight that XGBoost is ideal for maximizing recall in high-volume campaigns, while RandomForest may be better suited for applications requiring fewer false positives. These results offer valuable insights into model selection based on business objectives and resource allocation.

Copyrights © 2024






Journal Info

Abbrev

sinkron

Publisher

Subject

Computer Science & IT

Description

Scope of SinkrOns Scientific Discussion 1. Machine Learning 2. Cryptography 3. Steganography 4. Digital Image Processing 5. Networking 6. Security 7. Algorithm and Programming 8. Computer Vision 9. Troubleshooting 10. Internet and E-Commerce 11. Artificial Intelligence 12. Data Mining 13. Artificial ...