Claim Missing Document
Check
Articles

Found 2 Documents
Search

Arabic offensive text classification using emojis: Including emoji data in Arabic natural language processing Albalawi, Amal; Yafooz, Wael M. S.
International Journal of Electrical and Computer Engineering (IJECE) Vol 15, No 3: June 2025
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijece.v15i3.pp3332-3345

Abstract

In the digital social media ecosystem, controlling offensive language requires advanced algorithmic tools. This study examines the influence of including emojis translation in the text preprocessing stage of the classification of offensive Arabic text. A novel dataset of 10,000 Arabic tweets was developed, with rigorous annotations to classify content as offensive or non-offensive. The dataset was meticulously annotated and validated using Cohen's kappa (CK) and Krippendorff's Alpha (α) to ensure consistency and accuracy. Several experiments evaluated the dataset with the most common text classification models: seven machine learning (ML) classifiers and three deep learning (DL) models. Two experimental sets were conducted: one with emoji translation in preprocessing to enrich text input and another without emoji translation to directly assess the impact of emojis on classification accuracy. The findings indicate that emojis significantly affect text classification models, with advanced DL models showing higher sensitivity to contextual nuances conveyed by emojis compared to traditional ML classifiers. This research highlights the dual role of emojis, which are often linked to positive emotions and offensive contexts, adding complexity to digital communication. It contributes to the development of more accurate and context-sensitive natural language processing (NLP) tools.
Exploring social media sentiment patterns for improved cyberbullying detection Yafooz, Wael M. S.; Yahya, Abdulsamad Ebrahim; Alsaeedi, Abdullah; Alluhaibi, Reyadh; Jamil, Faisal; Elsayed, Mahmoud Salaheldin
IAES International Journal of Artificial Intelligence (IJ-AI) Vol 14, No 5: October 2025
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijai.v14.i5.pp4211-4225

Abstract

Cases of online bullying and aggressive behaviors directed at social media users have surged in recent years. These behaviors have had negative impacts on victims from a wide range of demographic groups. While efforts have been made to address persistent digital harassment, the expected outcome has been limited due to the lack of effective tools to quickly identify cyberbullying behaviors and censor them accordingly on social media platforms. This study presents a scalable and systematic method to detect and analyze offensive behavior and bullying on Twitter (now known as X). Our methodology involves extracting textual, user-related, and network-related attributes to understand the traits of individuals involved in such behaviors. This approach aims to recognize distinctive characteristics that set them apart from regular users. This study proposes a novel model by employing an integrated deep-learning model, combining the bidirectional gated recurrent unit (BiGRU), transformer block, and convolutional neural network (CNN). This model aims to classify X comments into offensive and non-offensive categories. The proposed model’s efficiacy has been evaluated through several experiments by combining three widely recognized datasets of hate speech. The proposed model achieves an accuracy rate of approximately 98.95%, showing promising results in identifying and categorizing offensive behavior in cyberbullying.