An a Priori Approach for Automatic Integration of Heterogeneous and Autonomous Databases

Data integration is the process that gives users access to multiple data sources through queries against a global schema. The semantic heterogeneity has been identified as the most important and toughest problem when integrating various sources. The mapping between the global schema and local schemas was done manually in the first generation of integrated systems, when ontologies are not used to make explicit data meaning. It is semi automatic when ontologies and ontology mappings are defined at integration level. In this paper, we propose a fully automatic integration approach based on ontologies. It supposes that each data source contains a formal ontology that references a shared ontology. The relationships between each local ontology and the shared ontology are defined at the database design time and also embedded in each source. We assume that a domain ontology exists, but each source may extend it by adding new concepts and properties. Our approach is currently prototyped in various environments: OODB, ORDB, and RDB.

[1]  Dieter Fensel,et al.  A Two-Layered Integration Approach for Product Information in B2B E-commerce , 2001, EC-Web.

[2]  Ioana Manolescu,et al.  Active XML: Peer-to-Peer Data and Web Services Integration , 2002, VLDB.

[3]  Erhard Rahm,et al.  On Matching Schemas Automatically , 2001 .

[4]  Trevor J. M. Bench-Capon,et al.  Resolving Ontological Heterogeneity in the KRAFT Project , 1999, DEXA.

[5]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.

[6]  Joann J. Ordille,et al.  The World Wide Web as a Collection of Views: Query Processing in the Information Manifold , 1996, VIEWS.

[7]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[8]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[9]  Silvana Castano,et al.  Global Viewing of Heterogeneous Data Sources , 2001, IEEE Trans. Knowl. Data Eng..

[10]  Ken Barker,et al.  Integrating relational database schemas using a standardized dictionary , 2001, SAC.

[11]  GIORGIO TERRACINA,et al.  A uniform methodology for extracting type conflicts and subscheme similarities from heterogeneous databases , 2000, Inf. Syst..

[12]  Farshad Hakimpour,et al.  Global Schema Generation Using Formal Ontologies , 2002, ER.

[13]  François Goasdoué,et al.  The Use of CARIN Language and Algorithms for Information Integration: The PICSEL System , 2000, Int. J. Cooperative Inf. Syst..

[14]  Vipul Kashyap,et al.  Managing Multiple Information Sources through Ontologies: Relationship between Vocabulary Heterogeneity and Loss of Information , 1996, KRDB.

[15]  Nicola Guarino,et al.  Ontological Analysis of Taxonomic Relationships , 2000, ER.

[16]  Jeffrey D. Ullman,et al.  Information integration using logical views , 1997, Theor. Comput. Sci..

[17]  Guy Pierra,et al.  From digital libraries to electronic catalogues for engineering and manufacturing , 2003, Int. J. Comput. Appl. Technol..

[18]  Guy Pierra,et al.  Context-explication in conceptual ontologies: the PLIB approach , 2003, ISPE CE.

[19]  Kamalakar Karlapalem,et al.  Some issues in design of data warehousing systems , 2001 .

[20]  Stéphane Bressan,et al.  Context Interchange: New Features and Formalisms for the Intelligent Integration of Information Context Interchange: New Features and Formalisms for the Intelligent Integration of Information , 1997 .

[21]  Heiner Stuckenschmidt,et al.  Ontology-Based Integration of Information - A Survey of Existing Approaches , 2001, OIS@IJCAI.

[22]  Silvana Castano,et al.  Semantic dictionary design for database interoperability , 1997, Proceedings 13th International Conference on Data Engineering.