Distributed Query Processing in P2P Systems with Incomplete Schema Information

The peer-to-peer (P2P) paradigm has emerged recently, mainly by file sharing systems like Napster or Gnutella and in terms of scalable distributed data structures. Because of the decentralization P2P systems promise an improved scalability and robustness, and they open a new view on data integration approaches, too. By exploiting already available mappings between pairs of peers a new peer joining the systems can immediately participate and access all the available data after establishing a correspondence mapping to at least one other peer. One of the technical challenges in building scalable P2P based integration systems is the efficient processing of queries which is complicated by the locally restricted knowledge about data placement and schema information. In this paper, we address this problem by investigating query processing strategies dealing with incomplete schemas and present results of our experimental evaluation.

[1]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[2]  Björn Þór Jónsson,et al.  Performance tradeoffs for client-server query processing , 1996, SIGMOD '96.

[3]  Wolfgang Nejdl,et al.  Distributed Queries and Query Optimization in Schema-Based P2P-Systems , 2003, DBISP2P.

[4]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[5]  Karl Aberer,et al.  P2P information systems , 2002, Proceedings / International Conference on Data Engineering.

[6]  Donald Kossmann,et al.  The state of the art in distributed query processing , 2000, CSUR.

[7]  David J. DeWitt,et al.  Locating Data Sources in Large Distributed Systems , 2003, VLDB.

[8]  Dan Suciu,et al.  The Piazza peer data management project , 2003, SGMD.

[9]  Hector Garcia-Molina,et al.  Routing indices for peer-to-peer systems , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[10]  Dan Suciu,et al.  What Can Database Do for Peer-to-Peer? , 2001, WebDB.

[11]  Karl Aberer,et al.  Peer-to-peer information systems: concepts and models, state-of-the-art, and future systems , 2001, ESEC/FSE-9.

[12]  Norman W. Paton,et al.  Adaptive Query Processing: A Survey , 2002, BNCOD.

[13]  Peter A. Boncz,et al.  AmbientDB: Relational Query Processing in a P2P Network , 2003, DBISP2P.

[14]  Robert Morris,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM 2001.

[15]  David J. DeWitt,et al.  Putting XML Query Algebras into Context , 2002 .

[16]  David Maier,et al.  Mutant Query Plans , 2002, Inf. Softw. Technol..