International Journal of Electrical and Computer Engineering
Vol 9, No 6: December 2019

A computational analysis of short sentences based on ensemble similarity model

Arifah Che Alhadi (Universiti Malaysia Terengganu)
Aziz Deraman (Universiti Malaysia Terengganu)
Masita Masila Abdul Jalil (Universiti Malaysia Terengganu)
Wan Nural Jawahir Wan Yussof (Universiti Malaysia Terengganu)
Rosmayati Mohemad (Universiti Malaysia Terengganu)



Article Info

Publish Date
01 Dec 2019

Abstract

The rapid development of Internet along with the wide use of social media applications produce huge volume of unstructured data in short text form such as tweets, text snippets and instant messages. This form of data rarely contains repeated word. It presents challenge in sentences similarity analysis as the standard text similarity models merely rely on the number of word occurrence, often resulting unreliable similarity value. Besides, the use of abbreviation, acronyms, slang, smiley, jargon, symbol or non-standard short form also contributes to the difficulty in similarity analysis. Thus, an extended ensemble similarity model approach is proposed. An experimental study has been conducted using datasets of English short sentences. The findings are very encouraging in improving the similarity value for short sentences.

Copyrights © 2019






Journal Info

Abbrev

IJECE

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering

Description

International Journal of Electrical and Computer Engineering (IJECE, ISSN: 2088-8708, a SCOPUS indexed Journal, SNIP: 1.001; SJR: 0.296; CiteScore: 0.99; SJR & CiteScore Q2 on both of the Electrical & Electronics Engineering, and Computer Science) is the official publication of the Institute of ...