Knowledge representation for information integration

An information integration system provides a uniform query interface to a collection of distributed and heterogeneous information sources, giving users or other agents the illusion that they interrogate a centralized and homogeneous information system. In this paper, we focus on the use of knowledge representation techniques for building mediators for information integration. A mediator is based on the specification of a single mediated schema describing a domain of interest, and on a set of source descriptions expressing how the content of each source available to the system is related to the domain of interest. These source descriptions, also called mappings because they model the correspondence between the mediated schema and the schemas of the data sources, play a central role in the query answering process. We present two recent information integration systems, namely PICSEL and Xyleme, which are illustrative of two radically different choices concerning the expressivity of the mediated schema.

[1]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[2]  Akhil Kumar,et al.  A dynamic warehouse for XML Data of the Web. , 2001 .

[3]  Michael R. Genesereth,et al.  Infomaster: an information integration system , 1997, SIGMOD '97.

[4]  Diego Calvanese,et al.  Answering Queries Using Views over Description Logics Knowledge Bases , 2000, AAAI/IAAI.

[5]  Serge Abiteboul,et al.  Acquiring XML pages for a WebHouse , 2000, BDA.

[6]  Silvana Castano,et al.  Information Integration: The MOMIS Project Demonstration , 2000, VLDB.

[7]  Craig A. Knoblock,et al.  SIMS: Retrieving and integrating information from multiple sources , 1993, SIGMOD '93.

[8]  Guido Moerkotte,et al.  Efficient Storage of XML Data , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[9]  Marc Friedman,et al.  Efficiently Executing Information-Gathering Plans , 1997, IJCAI.

[10]  Alon Y. Halevy,et al.  Combining Horn Rules and Description Logics in CARIN , 1998, Artif. Intell..

[11]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.

[12]  Deborah L. McGuinness,et al.  CLASSIC: a structural data model for objects , 1989, SIGMOD '89.

[13]  Divesh Srivastava,et al.  The Information Manifold , 1995 .

[14]  Tova Milo,et al.  Views in a large-scale XML repository , 2002, The VLDB Journal.

[15]  Diego Calvanese,et al.  Answering Queries Using Views in Description Logics , 1999, KRDB.

[16]  Claude Delobel,et al.  Semantic integration in Xyleme: a uniform tree-based approach , 2003, Data Knowl. Eng..

[17]  Francois Goasdoue Reecriture de requetes en termes de vues dans carin et integration d'informations , 2001 .

[18]  Dan Suciu,et al.  Schema mediation in peer data management systems , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[20]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[21]  Y HalevyAlon Answering queries using views: A survey , 2001, VLDB 2001.

[22]  Serge Abiteboul,et al.  Monitoring XML data on the Web , 2001, SIGMOD '01.

[23]  Craig A. Knoblock,et al.  Query processing in the SIMS information mediator , 1997 .

[24]  François Goasdoué,et al.  Compilation and Approximation of Conjunctive Queries by Concept Descriptions , 2002, Description Logics.

[25]  Robert M. MacGregor,et al.  The Loom Knowledge Representation Language. , 1987 .

[26]  Christine Froidevaux,et al.  Repairing Queries in a Mediator Approach , 2000, ECAI.

[27]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[28]  Vipul Kashyap,et al.  Observer: an approach for query processing in global information systems based on interoperation across pre-existing ontologies , 1996, Proceedings First IFCIS International Conference on Cooperative Information Systems.

[29]  François Goasdoué,et al.  The Use of CARIN Language and Algorithms for Information Integration: The PICSEL System , 2000, Int. J. Cooperative Inf. Syst..

[30]  Oren Etzioni,et al.  A softbot-based interface to the Internet , 1994, CACM.

[31]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.