International Journal of Electrical and Computer Engineering
Vol 6, No 4: August 2016

Inferring Student's Chat Topic in Colloquial Arabic Text using Semantic Representation

Faisal T. Khamayseh (College of Information Technology and Computer Engineering Palestine Polytechnic University)



Article Info

Publish Date
01 Aug 2016

Abstract

Since the colloquial Arabic is now widespread it is required to describe the collection and classification of a multi-dialectal corpus of Arabic. Nowadays, colloquial multi-dialectal comes in almost country based forms such as Egyptian, Iraqi, Levantine, Tunisian, etc. This paper discusses a new method for analyzing the conversation of the educational chat room using Corpus for Palestinian Arabic and Stanford Tagger. This method represents the key words using semantic net-like representation to obtain the main subjects of the conversation. The main subject of the chat is obtained using the proposed method which shows a high accuracy. Using Arabic Corpus, Stanford Tagger and percentage of words will add more accuracy. The study also examines the effect of pivot distribution based on occurrences and betweeness values of the pivots over the text. This study examines some of the characteristics of the texts written in colloquial Arabic dialect and analyzes the free expressive Arabic statements. The results of the paper show that the core can be determined by combining both the occurrences and the distribution of the word over the conversation.

Copyrights © 2016






Journal Info

Abbrev

IJECE

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering

Description

International Journal of Electrical and Computer Engineering (IJECE, ISSN: 2088-8708, a SCOPUS indexed Journal, SNIP: 1.001; SJR: 0.296; CiteScore: 0.99; SJR & CiteScore Q2 on both of the Electrical & Electronics Engineering, and Computer Science) is the official publication of the Institute of ...