Indonesian Journal of Electrical Engineering and Computer Science
Vol 10, No 3: June 2018

An Empirical Comparative Study of Instance-based Schema Matching

Mogahed Alzeber (Department of Computer Science, Kulliyyah of Information and Communication Technolgoy, International Islamic University Malaysia)
Ali A. Alwan (Department of Computer Science, Kulliyyah of Information and Communication Technolgoy, International Islamic University Malaysia)
Azlin Nordin (Department of Computer Science, Kulliyyah of Information and Communication Technolgoy, International Islamic University Malaysia)
Abedallah Zaid Abualkishik (College of Computer Information Technology, American University in the Emirates)



Article Info

Publish Date
01 Jun 2018

Abstract

The main issue concern of schema matching is how to support the merging decision by providing matching between attributes of different schemas. There have been many works in the literature toward utilizing database instances to detect the correspondence between attributes. Most of these previous works aim at improving the match accuracy. We observed that no technique managed to provide an accurate matching for different types of data. In other words, some of the techniques treat numeric values as strings. Similarly, other techniques process textual instance, as numeric, and this negatively influences the process of discovering the match and compromising the matching result. Thus, a practical comparative study between syntactic and semantic techniques is needed. The study emphasizes on analyzing these techniques to determine the strengths and weaknesses of each technique. This paper aims at comparing two different instance-based matching techniques, namely: (i) regular expression and (ii) Google similarity to identify the match between attributes. Several analyses have been conducted on real and synthetic data sets to evaluate the performance of these techniques with respect to Precision (P), Recall (R) and F-Measure.

Copyrights © 2018