LINGUISTS : JOURNAL OF LINGUISTICS AND LANGUAGE TEACHING
Vol 12, No 1 (2026): July (In Press)

INTEGRATING INTENTION ATTRIBUTION INTO HATE SPEECH CORPUS IN THE INDONESIAN CONTEXT: A PRAGMATIC FRAMEWORK FOR NLP FOUNDATIONS

Fauzan Novaldy Pratama (Universitas Pendidikan Indonesia Universitas Nasional Pasim)
Eri Kurniawan (Universitas Pendidikan Indonesia)
Andika Dutha Bachari (Universitas Pendidikan Indonesia)
Siti Sopiah (Universitas Islam Nusantara)
Zainul Muttaqin (Universitas Pendidikan Indonesia)



Article Info

Publish Date
02 Jun 2026

Abstract

This study develops a hate speech corpus by integrating Searle’s Speech Act theory to identify the illocutionary intentions behind offensive utterances, elaborated in two research objectives: 1) identifying illocutionary points within the corpus containing social identity content by employing Searle’s speech acts approach, and 2) evaluating corpus quality from a natural language processing perspective. Achieving these objectives requires a methodology that integrates linguistically qualitative description with quantitative machine learning measurement. The data was obtained from a readjusted corpus, with a focused annotation on 3,315 data points containing social identity markers. The study employed a qualitative linguistic framework for intention attribution, followed by a quantitative evaluation using a hybrid BiLSTM-IndoBERT algorithm to assess corpus consistency and model predictability. The findings indicate that hate speech in the Indonesian context is predominantly manifested through negatively expressive utterances, with religion being the most frequent target, followed by ethnicity-based directive attacks. The hybrid model achieved an F1-score of 87%, demonstrating the viability of the annotated corpus for automated detection. Integrating intention attribution provides a more granular linguistic foundation for language models compared to purely semantic-based approaches. This study offers a framework for stakeholders to map hate speech patterns, though future work should incorporate more diverse sociopolitical contexts.

Copyrights © 2026






Journal Info

Abbrev

linguists

Publisher

Subject

Humanities Education Languange, Linguistic, Communication & Media Social Sciences

Description

The aim of this Journal is to promote a principled approach to research on language and language-related concerns by encouraging enquiry into relationship between theoretical and practical studies. The journal welcomes contributions in such areas of current analysis in: Second and foreign language ...