International Journal of Electrical and Computer Engineering
Vol 11, No 5: October 2021

Smart detection of offensive words in social media using the soundex algorithm and permuterm index

Malek Z. Alksasbeh (Al-Hussein Bin Talal University)
Bassam A. Y. Alqaralleh (Al Hussein Bin Talal University)
Tamer Abukhalil (Al Hussein Bin Talal University)
Anas Abukaraki (Al Hussein Bin Talal University)
Tawfiq Al Rawashdeh (Al Hussein Bin Talal University)
Moha'med Al-Jaafreh (Al Hussein Bin Talal University)



Article Info

Publish Date
01 Oct 2021

Abstract

Offensive posts in the social media that are inappropriate for a specific age, level of maturity, or impression are quite often destined more to unadult than adult participants. Nowadays, the growth in the number of the masked offensive words in the social media is one of the ethically challenging problems. Thus, there has been growing interest in development of methods that can automatically detect posts with such words. This study aimed at developing a method that can detect the masked offensive words in which partial alteration of the word may trick the conventional monitoring systems when being posted on social media. The proposed method progresses in a series of phases that can be broken down into a pre-processing phase, which includes filtering, tokenization, and stemming; offensive word extraction phase, which relies on using the soundex algorithm and permuterm index; and a post-processing phase that classifies the users’ posts in order to highlight the offensive content. Accordingly, the method detects the masked offensive words in the written text, thus forbidding certain types of offensive words from being published. Results of evaluation of performance of the proposed method indicate a 99% accuracy of detection of offensive words.

Copyrights © 2021






Journal Info

Abbrev

IJECE

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering

Description

International Journal of Electrical and Computer Engineering (IJECE, ISSN: 2088-8708, a SCOPUS indexed Journal, SNIP: 1.001; SJR: 0.296; CiteScore: 0.99; SJR & CiteScore Q2 on both of the Electrical & Electronics Engineering, and Computer Science) is the official publication of the Institute of ...