Query Processing in Ontology-Based Peer-to-Peer Systems

The unstructured, heterogeneous and dynamic nature of the Web poses a new challenge to query-answering over multiple data sources. The so-called Semantic Web aims at providing more and semantically richer structures in terms of ontologies and meta-data. A problem that remains is the combined use of heterogeneous sources. In a dynamic environment, it is no longer realistic to assume that the involved data sources act as if they were a single (virtual) source, modelled as a global schema, as is done in classical data integration approaches. In this paper, we propose an alternative approach where we replace the role of a single virtual data source schema with a peer-to-peer approach relying on limited shared (or: overlapping) vocabularies between peers. Since overlaps between vocabularies of peers will be limited and the dynamic nature of the system prohibits the design of accurate mappings, query processing will have to be approximate. We provide a formal model for such approximate query processing based on limited shared vocabularies between peers, and we show how the quality of the approximation can be adjusted in a gradual manner. The result is a flexible architecture for query-processing in heterogenous and dynamic environments, based on a formal foundation. We present the approach and discuss it on the basis of a case study.

[1]  Kevin Chen-Chuan Chang,et al.  Approximate query mapping: Accounting for translation closeness , 2001, The VLDB Journal.

[2]  Silvana Castano,et al.  Semantic integration of semistructured and structured data sources , 1999, SGMD.

[3]  Jérôme Euzenat,et al.  An infrastructure for formally ensuring interoperability in a heterogeneous semantic web , 2001, SWWS.

[4]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[5]  Diego Calvanese,et al.  Description Logics for Information Integration , 2002, Computational Logic: Logic Programming and Beyond.

[6]  Fausto Giunchiglia,et al.  Data Management for Peer-to-Peer Computing : A Vision , 2002, WebDB.

[7]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[8]  François Goasdoué,et al.  Rewriting Conjunctive Queries using Views in Description Logics with Existential Restrictions , 2000, Description Logics.

[9]  Heiner Stuckenschmidt Approximate Information Filtering with Multiple Classification Hierarchies , 2002, Int. J. Comput. Intell. Appl..

[10]  Alon Y. Levy Combining artificial intelligence and databases for data integration , 1999 .

[11]  Andrew B. Whinston,et al.  P2P Networking: An Information-Sharing Alternative , 2001, Computer.

[12]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[13]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[14]  H. Stuckenschmidt,et al.  Ontology-Based Information Sharing in Weakly Structured Environments , 2003 .

[15]  Laura M. Haas,et al.  Capabilities-Based Query Rewriting in Mediator Systems , 2004, Distributed and Parallel Databases.

[16]  Ian Horrocks,et al.  A Conjunctive Query Language for Description Logic Aboxes , 2000, AAAI/IAAI.

[17]  Heiner Stuckenschmidt,et al.  Ontology-Based Integration of Information - A Survey of Existing Approaches , 2001, OIS@IJCAI.

[18]  Norman W. Paton,et al.  Estimating the quality of answers when querying over description logic ontologies , 2003, Data Knowl. Eng..

[19]  Francesco M. Donini,et al.  Reasoning in description logics , 1997 .

[20]  Pedro M. Domingos,et al.  Representing and reasoning about mappings between domain models , 2002, AAAI/IAAI.

[21]  Ian Horrocks The FaCT System , 1998, TABLEAUX.

[22]  Asunción Gómez-Pérez,et al.  Ontology Specification Languages for the Semantic Web , 2002, IEEE Intell. Syst..

[23]  Fausto Giunchiglia,et al.  Making Peer Databases Interact - A Vision for an Architecture Supporting Data Coordination , 2002, CIA.

[24]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.

[25]  Bart Selman,et al.  Knowledge compilation and theory approximation , 1996, JACM.