IAES International Journal of Artificial Intelligence (IJ-AI)
Vol 15, No 3: June 2026

Enhancing fake news detection: a hybrid BERT-XGBoost model for improved performance and interpretability

Nishant Vasantkumar Hegde (R. V. College of Engineering)
Suneesh Bare (R. V. College of Engineering)
Namruth Reddy (NVIDIA Corporation)
Rajat Gondkar Aravinda (R. V. College of Engineering)
Minal Moharir (R. V. College of Engineering)
Aamir Ibrahim (R. V. College of Engineering)



Article Info

Publish Date
01 Jun 2026

Abstract

The widespread spread of fake news poses a serious threat to the integrity of information. The dominant approach to detection involves end-to-end fine-tuning of large transformer models like bidirectional encoder representations from transformers (BERT), which, despite achieving high accuracy, often function as opaque “black boxes” with limited interpretability. This paper proposes and validates a hybrid, decoupled architecture that proves to be a more practical and powerful alternative. We first fine-tune a DistilBERT model on the full WELFake dataset of 71,537 articles after cleaning to create domain-specific embeddings. These high-dimensional vectors are then used as input features to train a robust extreme gradient boosting (XGBoost) classifier. The results demonstrate that the hybrid model achieves a state-of-the-art accuracy of 99.76%, slightly surpassing the already high performance of a standard end-to-end fine-tuned model. Crucially, this approach provides this top-tier performance while offering significant advantages in model interpretability through feature importance analysis. This work establishes that a decoupled architecture is not just a viable alternative but a superior practical strategy for combating misinformation, successfully balancing state-of-the-art accuracy with essential model transparency.

Copyrights © 2026






Journal Info

Abbrev

IJAI

Publisher

Subject

Computer Science & IT Engineering

Description

IAES International Journal of Artificial Intelligence (IJ-AI) publishes articles in the field of artificial intelligence (AI). The scope covers all artificial intelligence area and its application in the following topics: neural networks; fuzzy logic; simulated biological evolution algorithms (like ...