International Journal of Electrical and Computer Engineering
Vol 15, No 3: June 2025

Arabic offensive text classification using emojis: Including emoji data in Arabic natural language processing

Albalawi, Amal (Unknown)
Yafooz, Wael M. S. (Unknown)



Article Info

Publish Date
01 Jun 2025

Abstract

In the digital social media ecosystem, controlling offensive language requires advanced algorithmic tools. This study examines the influence of including emojis translation in the text preprocessing stage of the classification of offensive Arabic text. A novel dataset of 10,000 Arabic tweets was developed, with rigorous annotations to classify content as offensive or non-offensive. The dataset was meticulously annotated and validated using Cohen's kappa (CK) and Krippendorff's Alpha (α) to ensure consistency and accuracy. Several experiments evaluated the dataset with the most common text classification models: seven machine learning (ML) classifiers and three deep learning (DL) models. Two experimental sets were conducted: one with emoji translation in preprocessing to enrich text input and another without emoji translation to directly assess the impact of emojis on classification accuracy. The findings indicate that emojis significantly affect text classification models, with advanced DL models showing higher sensitivity to contextual nuances conveyed by emojis compared to traditional ML classifiers. This research highlights the dual role of emojis, which are often linked to positive emotions and offensive contexts, adding complexity to digital communication. It contributes to the development of more accurate and context-sensitive natural language processing (NLP) tools.

Copyrights © 2025






Journal Info

Abbrev

IJECE

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering

Description

International Journal of Electrical and Computer Engineering (IJECE, ISSN: 2088-8708, a SCOPUS indexed Journal, SNIP: 1.001; SJR: 0.296; CiteScore: 0.99; SJR & CiteScore Q2 on both of the Electrical & Electronics Engineering, and Computer Science) is the official publication of the Institute of ...