Garuda - Garba Rujukan Digital

Journal of Computer Science and Engineering (JCSE)

Vol 6, No 2: August (2025)

Wahyudi, Rizki (Unknown)

Publish Date
08 Aug 2025

This study evaluates the performance of three tree boosting algorithms, Random Forest (RF), XGBoost (XGB), and LightGBM (LGBM), in detecting phishing websites using a phishing dataset based on HTML, URLs, and network features. Two hyperparameter optimization strategies were tested: Hyperband search (HalvingRandomSearchCV) and stacking ensemble combining all three models. The evaluation was conducted based on five main metrics: accuracy, precision, recall, F1-score, and AUC‑ROC. The results indicate that LightGBM tuned via Hyperband achieved the highest performance (accuracy 0.9724; AUC‑ROC 0.9702), followed by ensemble tuned (accuracy 0.9697; AUC‑ROC 0.9684). SHAP analysis was used to interpret the contribution of key features in predicting phishing websites. The AUC‑ROC difference of 0.0034 points from the XGBoost baseline (0.9668) confirms the effectiveness of Hyperband tuning and stacking ensembles for phishing detection

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Journal of Computer Science and Engineering (JCSE)

Website

Abbrev

JCSE

Publisher

Institute of Computer Science and Engineering

Subject

Computer Science & IT

Description

Computer Architecture, Processor design, operating systems, high-performance computing, parallel processing, computer networks, embedded systems, theory of computation, design and analysis of algorithms, data structures and database systems, theory of computation, design and analysis of algorithms, ...

Article Info

Abstract

Hyperband‑Optimized LightGBM and Ensemble Learning for Web Phishing Detection with SHAP‑Based Interpretability

Article Info

Abstract