IBIS: Semantic Data Integration at Work

In this paper we present IBIS (Internet-Based Information System), a system for the semantic integration of heterogeneous data sources, which adopts innovative and state-of-the-art solutions to deal with all aspects of a complex data-integration environment, including query answering under integrity constraints and limitations on source access. IBIS is based on the global-as-view approach, using a relational mediated schema to query the data at the sources. Sources are wrapped so as to provide a relational view on them. A key issue is that the system allows the specification of integrity constraints (modeling constraints in the domain of interest) in the global schema. Since sources are autonomous, the extracted data in general do not satisfy the constraints. IBIS adapts and integrates the data extracted from the sources making use of the constraints in the global schema, so as to answer queries at best with the information available. IBIS deals with limitations in accessing data sources, and exploits techniques developed for querying sources with access limitations in order to retrieve the maximum set of answers. In particular, it may use integrity constraints available on the sources to improve the efficiency of the extraction process.

[1]  Ioana Manolescu,et al.  Query optimization in the presence of limited access patterns , 1999, SIGMOD '99.

[2]  Jeffrey D. Ullman,et al.  Information integration using logical views , 1997, Theor. Comput. Sci..

[3]  Laura M. Haas,et al.  Towards heterogeneous multimedia information systems: the Garlic approach , 1995, Proceedings RIDE-DOM'95. Fifth International Workshop on Research Issues in Data Engineering-Distributed Object Management.

[4]  Andrea Calì,et al.  Accessing Data Integration Systems through Conceptual Schemas , 2001, ER.

[5]  Diego Calvanese,et al.  Data Integration in Data Warehousing (Keynote Address) , 2001, CAiSE Workshops.

[6]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.

[7]  Maurizio Lenzerini,et al.  Source inconsistency and incompleteness in data integration , 2002, KRDB.

[8]  Jennifer Widom,et al.  Maintenance of Materialized Views: Problems, Techniques, and Applications , 1999, IEEE Data Eng. Bull..

[9]  Edward Y. Chang,et al.  Query planning with limited source capabilities , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[10]  Andrea Calì,et al.  Optimized Querying of Integrated Data over the Web , 2002, Engineering Information Systems in the Internet Context.

[11]  Jeffrey D. Ullman,et al.  Capability based mediation in TSIMMIS , 1998, SIGMOD '98.

[12]  Stéphane Bressan,et al.  Context Interchange: New Features and Formalisms for the Intelligent Integration of Information Context Interchange: New Features and Formalisms for the Intelligent Integration of Information , 1997 .

[13]  Edward Y. Chang,et al.  Answering queries with useful bindings , 2001, TODS.

[14]  Inderpal Singh Mumick,et al.  Using Object Matching And Materialization To Integrate Heterogeneous Databases , 1999 .

[15]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[16]  Silvana Castano,et al.  Semantic integration of heterogeneous information sources , 2001, Data Knowl. Eng..

[17]  Andrea Calì,et al.  Data integration under integrity constraints , 2004, Inf. Syst..

[18]  Ron van der Meyden,et al.  Logical Approaches to Incomplete Information: A Survey , 1998, Logics for Databases and Information Systems.

[19]  Matthias Jarke,et al.  Fundamentals of Data Warehouses , 2000, Springer Berlin Heidelberg.

[20]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[21]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.