IAES International Journal of Artificial Intelligence (IJ-AI)
Vol 14, No 6: December 2025

Arabic text classification using machine learning and deep learning algorithms

Alqahtani, Rawad Awad (Unknown)
Abdelhafez, Hoda A. (Unknown)



Article Info

Publish Date
01 Dec 2025

Abstract

The classification of Arabic textual content presents considerable challenges due to the language's rich morphological structure and the wide variation among its dialects. This study aims to enhance classification accuracy by leveraging ensemble learning techniques and a deep bidirectional transformer-based model, specifically the multilingual autoregressive BERT (MARBERT). To address linguistic variability, advanced preprocessing techniques were employed, including Farasa, Tashaphyne, and Assem stemming methods. The Al Khaleej dataset served as the basis for supervised learning, providing a representative sample of Arabic text. Furthermore, term frequency-inverse document frequency (TF-IDF) with bigram and trigram feature extraction was utilized to effectively capture contextual semantics. Experimental results indicate that the proposed approach, particularly with the integration of MARBERT, achieves a peak classification accuracy of 98.59%, outperforming existing models. This research underscores the efficacy of combining ensemble learning with deep transformer-based models for Arabic text classification and highlights the critical role of robust preprocessing techniques in managing linguistic complexity and improving model performance.

Copyrights © 2025






Journal Info

Abbrev

IJAI

Publisher

Subject

Computer Science & IT Engineering

Description

IAES International Journal of Artificial Intelligence (IJ-AI) publishes articles in the field of artificial intelligence (AI). The scope covers all artificial intelligence area and its application in the following topics: neural networks; fuzzy logic; simulated biological evolution algorithms (like ...