Garuda - Garba Rujukan Digital

JUITA : Jurnal Informatika

JUITA Vol. 13 Issue 3, November 2025

Rananggana Trustha Dewangga (Universitas Negeri Semarang)
Budi Prasetiyo (Universitas Negeri Semarang)

Publish Date
08 Nov 2025

Clickbait uses sensational or misleading headlines to attract readers, which can degrade information quality in online news. This study presents a comparative evaluation of BERT and DistilBERT for detecting clickbait headline structures in the Indonesian language using the CLICK-ID dataset. The approach examines how class imbalance influences performance by training models on multiple dataset variants created through oversampling, undersampling, and data augmentation. Inputs are tokenized with model specific tokenizers and evaluated with accuracy, precision, recall, and F1-score. Confusion matrices are used to interpret error patterns across classes. Experimental results show that DistilBERT trained on an oversampled dataset achieves 94% for accuracy, precision, recall, and F1-score, while BERT on the same oversampled setting reaches 93%. Models trained on unbalanced data yield the lowest recall and F1 for the clickbait class, confirming the adverse effect of skewed distributions. Augmented and undersampled variants produce slightly lower but competitive results in the 92% to 93% range. Error analysis shows that DistilBERT reduces missed clickbait while maintaining a similar level of false positives, producing more balanced behavior across classes. These results outperform prior CLICK-ID studies and highlight the advantage of transformer architectures combined with effective class balancing for Indonesian clickbait detection.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

JUITA : Jurnal Informatika

Website

Abbrev

JUITA

Publisher

Universitas Muhammadiyah Purwokerto

Subject

Computer Science & IT

Description

UITA: Jurnal Informatika is a science journal and informatics field application that presents articles on thoughts and research of the latest developments. JUITA is a journal peer reviewed and open access. JUITA is published by the Informatics Engineering Study Program, Universitas Muhammadiyah ...

Article Info

Abstract

Analisis Komparasi Model BERT dan Model DISTILBERT Pada Klasifikasi Struktur Judul Berita Clickbait Online Berbahasa Indonesia

Article Info

Abstract