Loading [MathJax]/extensions/TeX/mhchem.js
A Semi-automatic Approach to Reduce Uncertainty of Schema Matching | IEEE Conference Publication | IEEE Xplore
IEEE Xplore has reached the limit on seats for your organization so has automatically paused access. You can continue to browse and search. Please try again shortly or contact us if you have any questions.

A Semi-automatic Approach to Reduce Uncertainty of Schema Matching

Publisher: IEEE

Abstract:

Schema matching is a prime problem in data integration domain. Some well-known automated tools have been provided to accomplish the task of schema matching, but the resul...View more

Abstract:

Schema matching is a prime problem in data integration domain. Some well-known automated tools have been provided to accomplish the task of schema matching, but the results generated by these tools are often uncertain. The uncertainty is universally inherent because of the inability of schema to fully capture the semantics of the represented data. We propose the method to a semi-automatic approach to reduce the uncertainty of schema matching. Our experimental results shows that the approach is able to reduce the uncertainty and improve the precision and recall.
Date of Conference: 08-10 July 2016
Date Added to IEEE Xplore: 03 November 2016
ISBN Information:
Publisher: IEEE
Conference Location: Beijing, China

I. Introduction

Schema matching refers to get the mapping relation between elements of the two given schemata. As is shown in Figure 1, we give an example of schema matching. Schema matching is always an important issue in many fields, such as data integration, data warehouse and electronic commerce. A lot of significant researches have been proposed, and some of the ideas have been implemented as schema matching tools. COMA [8], iMap [7] and Clio [6] are classical among the proposed methods, and these methods are respectively based on the schema semantics or schemata instance information. It cannot be denied that these excellent methods can give comparatively accurate answers, but deficiencies inevitably exist. The accuracy of these methods is generally low, namely the high uncertainty that exists in other fields [13], [14]. The prime causes of the uncertainty's occurrence is the fuzziness of data's descriptions between two heterogeneous databases and the ambiguity of schema's descriptions between the source schemas and the target schemas [13] [14].

References

References is not available for this document.