Compiler
Vol 15, No 1 (2026): May

Evaluating IndoBERT for Fraudulent Tweet Detection on Social Media X

Imroatul Khuluqi Izzah (Universitas Ahmad Dahlan)
Imam Riadi (Universitas Ahmad Dahlan)
Abdul Fadlil (Universitas Ahmad Dahlan)



Article Info

Publish Date
07 Jun 2026

Abstract

The spread of fraudulent content on social media X has become an important issue because perpetrators often use persuasive, urgent, and misleading language to influence users to transfer money, share personal data, or access suspicious links. This research evaluates the performance of IndoBERT for binary classification of fraud and non-fraud Indonesian-language posts on social media X using a two-stage fine-tuning design. The dataset consists of 5,235 manually labeled posts, including 2,557 fraud and 2,678 non-fraud instances. In Stage 1, four IndoBERT variants, namely indobert-base-p1, indobert-base-p2, indobert-large-p1, and indobert-large-p2, were compared using a uniform training configuration to identify the best model. The results showed that indobert-large-p1 at epoch 5 achieved the best performance, with a validation F1-score for the fraud class of 0.8898 and a test accuracy of 0.8989. In Stage 2, the selected model was re-evaluated through a controlled grid search by varying epoch, learning rate, and batch size. Although the best Stage 2 configuration improved the validation F1-score to 0.8975, it did not surpass the best Stage 1 model on the test set. These findings indicate that IndoBERT is effective for fraud detection and that a two-stage evaluation design supports more systematic model selection.

Copyrights © 2026






Journal Info

Abbrev

compiler

Publisher

Subject

Computer Science & IT

Description

Jurnal "COMPILER" dengan ISSN Cetak : 2252-3839 dan ISSN On Line 2549-2403 adalah jurnal yang diterbitkan oleh Departement Informatika Sekolah Tinggi Teknologi Adisutjipto Yogyakarta. Jurnal ini memuat artikel yang merupakan hasil-hasil penelitian dengan bidang kajian Struktur Diskrit, Ilmu ...