herman mawengkang
Faculty of Matthematics and Natural Science Universitas Sumatera Utara, Medan, 20155, Indonesia

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Enhancing Unbalanced Data Classification with Cross-Validation and Extreme Gradient Boosting: A Comprehensive Analysis muhammad riki atsauri; herman mawengkang; syahril efendi
JOURNAL OF INFORMATICS AND TELECOMMUNICATION ENGINEERING Vol. 7 No. 1 (2023): Issues July 2023
Publisher : Universitas Medan Area

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.31289/jite.v7i1.8690

Abstract

As a novel and efficient ensemble learning algorithm, XGBoost has been widely applied due to its multiple advantages, but its classification effect in cases of data imbalance is often not ideal. Aiming at this problem, efforts were made to optimize XGBoost and the Cross Validation algorithm. The main idea is to combine cross validation and XGBoost on unbalanced data for data processing, and then get the final model based on XGBoost through training. At the same time, optimal parameters are searched and adjusted automatically through optimization algorithms to realize more accurate classification predictions. In the testing phase, the area under the curve (AUC) is used as an evaluation indicator to compare and analyze the classification performance of various sampling methods and algorithm models. The results of the model analysis using AUC are expected to verify the feasibility and effectiveness of the proposed algorithm.