Property-Based Semantic Reconciliation of Heterogeneous Information Sources

Integrating information from diverse sources is of great importance in the database area. The main difficulty in information integration is reconciling data semantics. Common approaches to semantic reconciliation are based on first identifying similar entity types in various sources, and then reconciling entity type properties (attributes and relationships). Such approaches assume all instances to be reconciled belong to well-defined types. We suggest an alternative approach based on two fundamental principles. First, reconciliation does not require that instances be assigned to specific types. Instead, sources can be reconciled by analyzing similarities of properties. Second, properties that appear different may be manifestations of a higher-level property that has the same meaning across sources. We present the fundamental ideas underlying our approach, analyze its potential advantages, suggest how the approach can be formalized, demonstrate with examples the feasibility of using it for semantic reconciliation, and suggest directions for further research.

[1]  Renée J. Miller Using schematically heterogeneous structures , 1998, SIGMOD '98.

[2]  CastanoS.,et al.  Conceptual schema analysis , 1998 .

[3]  Silvana Castano,et al.  Global Viewing of Heterogeneous Data Sources , 2001, IEEE Trans. Knowl. Data Eng..

[4]  Ron Weber,et al.  On the deep structure of information systems , 1995, Inf. Syst. J..

[5]  Arnon Rosenthal,et al.  Using semantic values to facilitate interoperability among heterogeneous information systems , 1994, TODS.

[6]  Leo Obrst,et al.  Unpacking the semantics of source and usage to perform semantic reconciliation in large-scale information systems , 1999, SGMD.

[7]  Yair Wand,et al.  Emancipating instances from the tyranny of classes in information modeling , 2000, TODS.

[8]  Veda C. Storey,et al.  An ontological analysis of the relationship construct in conceptual modeling , 1999, TODS.

[9]  Ron Weber,et al.  On the ontological expressiveness of information systems analysis and design grammars , 1993, Inf. Syst. J..

[10]  Stefano Spaccapietra,et al.  Issues and approaches of database integration , 1998, CACM.

[11]  Jeong-Oog Lee,et al.  SemQL: a semantic query language for multidatabase systems , 1999, CIKM '99.

[12]  Silvana Castano,et al.  Conceptual schema analysis: techniques and applications , 1998, TODS.

[13]  Santtu Toivonen,et al.  Using RDF(S) to provide multiple views into a single ontology , 2001, SemWeb.

[14]  Carole A. Goble,et al.  Conceptual Open Hypermedia = The Semantic Web? , 2001, SemWeb.

[15]  Ron Weber,et al.  An Ontological Model of an Information System , 1990, IEEE Trans. Software Eng..

[16]  Nick Roussopoulos,et al.  Interoperability of multiple autonomous databases , 1990, CSUR.

[17]  William W. Cohen Integration of heterogeneous databases without common domains using queries based on textual similarity , 1998, SIGMOD '98.

[18]  Aris M. Ouksel,et al.  A classification of semantic conflicts in heterogeneous database systems , 1995, J. Organ. Comput..

[19]  Yair Wand,et al.  A Proposal for a Formal Model of Objects , 1989, Object-Oriented Concepts, Databases, and Applications.

[20]  Steffen Staab,et al.  Learning Ontologies for the Semantic Web , 2001 .

[21]  Xiaolei Qian Semantic interoperation via intelligent mediation , 1993, Proceedings RIDE-IMS `93: Third International Workshop on Research Issues in Data Engineering: Interoperability in Multidatabase Systems.

[22]  Kevin Chen-Chuan Chang,et al.  Interoperability for digital libraries worldwide , 1998, CACM.

[23]  Chris Clifton,et al.  Experience with a Combined Approach to Attribute-Matching Across Heterogeneous Databases , 1997, DS-7.

[24]  Amar Gupta,et al.  A Methodology for Integration of Heterogeneous Databases , 1994, IEEE Trans. Knowl. Data Eng..