Indonesian Journal of Electrical Engineering and Computer Science
Vol 28, No 3: December 2022

Arabic authorship attribution on Twitter: what is really matters?

Anoual El kah (Mohamed First University)
Aymane El airej (Moulay Ismail University of Meknes)
Imad Zeroual (Moulay Ismail University of Meknes)



Article Info

Publish Date
01 Dec 2022

Abstract

Recently, authorship attribution (AA) of online social networks texts has gained more attention. However, since 2015, when the first work that addressed the AA of Arabic tweets was published, we found that nothing much has been done after that. Thus, the current paper presents an extensive study that investigates the effects of various factors on the AA of Arabic short-texts, especially tweets. This led to a proposed architecture in which the AA accuracy is examined depending on the size of the training dataset, the number of classes covered, the text processing techniques applied, the methods used for both feature selection and extraction, and finally, the classifier implemented. As a result, we performed 792 different tests. The highest accuracy recorded is 97.4%, and it is among the best results published so far.

Copyrights © 2022