Efficient Query Routing in RDF/S schema based P2P Systems

P2P computing gains increasing attention lately, since it provides the means for realizing computing systems that scale to very large numbers of participating peers, while ensuring high autonomy and fault-tolerance. However, existing P2P systems offer poor data management facilities, limited to IR-style keyword search using eventually a fixed set of document attributes. Schema-based P2P systems have been proposed to support richer facilities in exchanging, querying and integrating structured and semi-structured data hosted by peers. In this paper, we address routing issues for expressive queries in schema-based P2P systems, where peers advertise their local bases using fragments of a community RDF/S schema (i.e., views). We then introduce an original encoding of RDF/S schema fragments, allowing to efficiently check whether a peer view is subsumed by a query. We rely on this encoding to design an RDF/S view lookup service featuring a centralized and distributed execution over a DHT-based P2P system. We finally evaluate experimentally the proposed framework to demonstrate its scalability for large P2P networks and arbitrary RDF/S schema fragments, as well as, to estimate the number of routing hops required by the two versions of our lookup service.

[1]  David J. DeWitt,et al.  Locating Data Sources in Large Distributed Systems , 2003, VLDB.

[2]  Min Cai,et al.  RDFPeers: a scalable distributed RDF repository based on a structured peer-to-peer network , 2004, WWW '04.

[3]  Vassilis Christophides,et al.  Benchmarking RDF Schemas for the Semantic Web , 2002, SEMWEB.

[4]  Theoni Pitoura,et al.  Towards a Unifying Framework for Complex Query Processing over Structured Peer-to-Peer Data Networks , 2003, DBISP2P.

[5]  Hector Garcia-Molina,et al.  Semantic Overlay Networks for P2P Systems , 2004, AP2PC.

[6]  Vassilis Christophides,et al.  On labeling schemes for the semantic web , 2003, WWW '03.

[7]  Wolfgang Nejdl,et al.  Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks , 2003, WWW '03.

[8]  Peter A. Boncz,et al.  AmbientDB: Relational Query Processing in a P2P Network , 2003, DBISP2P.

[9]  Fausto Giunchiglia,et al.  Data Management for Peer-to-Peer Computing : A Vision , 2002, WebDB.

[10]  Vassilis Christophides,et al.  Viewing the Semantic Web through RVL Lenses , 2003, SEMWEB.

[11]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[12]  Johannes Gehrke,et al.  Querying peer-to-peer networks using P-trees , 2004, WebDB '04.

[13]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[14]  Karl Aberer,et al.  GridVine: Building Internet-Scale Semantic Overlay Networks , 2004, SEMWEB.

[15]  Alexander Borgida,et al.  Efficient management of transitive relationships in large data and knowledge bases , 1989, SIGMOD '89.

[16]  Steffen Staab,et al.  SWAP - Ontology-based Knowledge Management with Peer-to-Peer Technology , 2003, WOW.

[17]  Alon Y. Halevy,et al.  Piazza: data management infrastructure for semantic web applications , 2003, WWW '03.

[18]  Beng Chin Ooi,et al.  BATON: A Balanced Tree Structure for Peer-to-Peer Networks , 2005, VLDB.

[19]  Vassilis Christophides,et al.  RQL: a declarative query language for RDF , 2002, WWW.

[20]  Vassilis Christophides,et al.  The ICS-FORTH SWIM: A Powerful Semantic Web Integration Middleware , 2003, SWDB.

[21]  Karl Aberer,et al.  P-Grid: A Self-Organizing Access Structure for P2P Information Systems , 2001, CoopIS.

[22]  Raphael Volz,et al.  A Comparison of RDF Query Languages , 2004, SEMWEB.