Garuda - Garba Rujukan Digital

Article Per Year (5 Year)

p-Index From 2021 - 2026

0.23

P-Index

This Author published in this journals

All Journal LINGUISTS : JOURNAL OF LINGUISTICS AND LANGUAGE TEACHING

Fauzan Novaldy Pratama

Universitas Pendidikan Indonesia Universitas Nasional Pasim

Author-ID : 10230291

Humanities Education Languange, Linguistic, Communication & Media Social Sciences

Published : 1 Documents Claim Missing Document

Claim Missing Document

Articles

INTEGRATING INTENTION ATTRIBUTION INTO HATE SPEECH CORPUS IN THE INDONESIAN CONTEXT: A PRAGMATIC FRAMEWORK FOR NLP FOUNDATIONS Fauzan Novaldy Pratama; Eri Kurniawan; Andika Dutha Bachari; Siti Sopiah; Zainul Muttaqin
Linguists : Journal of Linguistics and Language Teaching Vol 12, No 1 (2026): July (In Press)
Publisher : Universitas Islam Negeri (UIN) Fatmawati Sukarno Bengkulu

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.29300/ling.v12i1.10932

This study develops a hate speech corpus by integrating Searle’s Speech Act theory to identify the illocutionary intentions behind offensive utterances, elaborated in two research objectives: 1) identifying illocutionary points within the corpus containing social identity content by employing Searle’s speech acts approach, and 2) evaluating corpus quality from a natural language processing perspective. Achieving these objectives requires a methodology that integrates linguistically qualitative description with quantitative machine learning measurement. The data was obtained from a readjusted corpus, with a focused annotation on 3,315 data points containing social identity markers. The study employed a qualitative linguistic framework for intention attribution, followed by a quantitative evaluation using a hybrid BiLSTM-IndoBERT algorithm to assess corpus consistency and model predictability. The findings indicate that hate speech in the Indonesian context is predominantly manifested through negatively expressive utterances, with religion being the most frequent target, followed by ethnicity-based directive attacks. The hybrid model achieved an F1-score of 87%, demonstrating the viability of the annotated corpus for automated detection. Integrating intention attribution provides a more granular linguistic foundation for language models compared to purely semantic-based approaches. This study offers a framework for stakeholders to map hate speech patterns, though future work should incorporate more diverse sociopolitical contexts.

Co-Authors Andika Dutha Bachari Eri Kurniawan Siti Sopiah Zainul Muttaqin

Title

Found 1 Documents
Search

Abstract

Title Search

Found 1 Documents Search

Abstract

Title

Found 1 Documents
Search