P2P Data Management

P2P (peer-to-peer) is the key technology of reconstructing the future distributed architecture and has a good application perspective. As the issues in P2P systems mostly come down to data placement and retrieval, P2P data management has recently become an active topic in database community. In this paper, the advantages of P2P systems are first described. Then the goals of P2P data management researches are presented. Thirdly, research of P2P data management is described from three facets, i.e. P2P information retrieval, P2P database-style queries and P2P continuous queries. Particularly, the index construction methods, semantic coordination, query semantics, query processing strategies, types of queries supported, and query optimization of P2P database-style queries are discussed in detail. Finally, the issues to be further studied are proposed.

[1]  Joseph M. Hellerstein,et al.  Eddies: continuously adaptive query processing , 2000, SIGMOD 2000.

[2]  Fausto Giunchiglia,et al.  Data Management for Peer-to-Peer Computing : A Vision , 2002, WebDB.

[3]  Beng Chin Ooi,et al.  Relational data sharing in peer-based data management systems , 2003, SGMD.

[4]  Wolfgang Nejdl,et al.  Design issues and challenges for RDF- and schema-based peer-to-peer systems , 2003, SGMD.

[5]  Michael Stonebraker,et al.  The Aurora and Medusa Projects , 2003, IEEE Data Eng. Bull..

[6]  Srinivasan Seshan,et al.  Mercury: supporting scalable multi-attribute range queries , 2004, SIGCOMM 2004.

[7]  Min Cai,et al.  RDFPeers: a scalable distributed RDF repository based on a structured peer-to-peer network , 2004, WWW '04.

[8]  Alon Y. Halevy,et al.  Introduction to the special issue on semantic integration , 2004, SGMD.

[9]  Frank Adelstein,et al.  Fundamentals of Mobile and Pervasive Computing , 2004 .

[10]  Scott Shenker,et al.  Querying the Internet with PIER , 2003, VLDB.

[11]  James Aspnes,et al.  Skip graphs , 2003, SODA '03.

[12]  Theoni Pitoura,et al.  Towards a Unifying Framework for Complex Query Processing over Structured Peer-to-Peer Data Networks , 2003, DBISP2P.

[13]  Marcelo Arenas,et al.  Data Sharing Through Query Translation in Autonomous Sources , 2004, VLDB.

[14]  Gerhard Weikum,et al.  Towards a Statistically Semantic Web , 2004, ER.

[15]  Johannes Gehrke,et al.  Guaranteeing correctness and availability in P2P range indices , 2005, SIGMOD '05.

[16]  Takashige Hoshiai,et al.  Decentralized Meta-Data Strategies: Effective Peer-to-Peer Search , 2003 .

[17]  Scott Shenker,et al.  Can Heterogeneity Make Gnutella Scalable? , 2002, IPTPS.

[18]  Hector Garcia-Molina,et al.  Improving search in peer-to-peer networks , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[19]  Gurmeet Singh Manku,et al.  SETS: search enhanced by topic segmentation , 2003, SIGIR.

[20]  Divyakant Agrawal,et al.  Approximate Range Selection Queries in Peer-to-Peer Systems , 2003, CIDR.

[21]  Wolf-Tilo Balke Supporting Information Retrieval in Peer-to-Peer Systems , 2005, Peer-to-Peer Systems and Applications.

[22]  Johannes Gehrke,et al.  Querying peer-to-peer networks using P-trees , 2004, WebDB '04.

[23]  David Maier,et al.  Distributed queries without distributed state , 2002, WebDB.

[24]  Scott Shenker,et al.  The Architecture of PIER: an Internet-Scale Query Processor , 2005, CIDR.

[25]  Rajeev Motwani,et al.  The price of validity in dynamic networks , 2004, SIGMOD '04.

[26]  Michael J. Freedman,et al.  Efficient Peer-to-Peer Lookup Based on a Distributed Trie , 2002, IPTPS.

[27]  Bei Yu,et al.  Efficient semantic-based content search in P2P network , 2004, IEEE Transactions on Knowledge and Data Engineering.

[28]  Divyakant Agrawal,et al.  A peer-to-peer framework for caching range queries , 2004, Proceedings. 20th International Conference on Data Engineering.

[29]  Ashwin Machanavajjhala,et al.  An indexing framework for peer-to-peer systems , 2004, SIGMOD '04.

[30]  Verena Kantere,et al.  Coordinating Peer Databases Using ECA Rules , 2003, DBISP2P.

[31]  Qian Weining,et al.  PeerIS: A Peer-to-Peer Based Information Retrieval System , 2004 .

[32]  Alon Y. Halevy,et al.  Efficient query reformulation in peer data management systems , 2004, SIGMOD '04.

[33]  Hector Garcia-Molina,et al.  Peer-to-peer research at Stanford , 2003, SGMD.

[34]  Hector Garcia-Molina,et al.  Open Problems in Data-Sharing Peer-to-Peer Systems , 2003, ICDT.

[35]  Hector Garcia-Molina,et al.  Adlib: a self-tuning index for dynamic peer-to-peer systems , 2005, 21st International Conference on Data Engineering (ICDE'05).

[36]  Karl Aberer,et al.  An Overview of Peer-to-Peer Information Systems , 2002, WDAS.

[37]  Wolfgang Nejdl,et al.  Schema-Based Peer-to-Peer Systems , 2005, Peer-to-Peer Systems and Applications.

[38]  Sriram Ramabhadran,et al.  Brief announcement: prefix hash tree , 2004, PODC '04.

[39]  Diego Calvanese,et al.  Logical foundations of peer-to-peer data integration , 2004, PODS '04.

[40]  Scott Shenker,et al.  Complex Queries in Dht-based Peer-to-peer Networks , 2002 .

[41]  David J. DeWitt,et al.  Processing Queries in a Large Peer-to-Peer System , 2003, CAiSE.

[42]  Hector Garcia-Molina,et al.  Online Balancing of Range-Partitioned Data with Applications to Peer-to-Peer Systems , 2004, VLDB.

[43]  Joseph M. Hellerstein,et al.  Toward network data independence , 2003, SGMD.

[44]  Karl Aberer,et al.  Start making sense: The Chatty Web approach for global semantic agreements , 2003, J. Web Semant..

[45]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[46]  Karl Aberer,et al.  P-Grid: A Self-Organizing Access Structure for P2P Information Systems , 2001, CoopIS.

[47]  Beng Chin Ooi,et al.  PeerDB: a P2P-based system for distributed data sharing , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[48]  Shahram Ghandeharizadeh,et al.  PePeR: A Distributed Range Addressing Space for Peer-to-Peer Systems , 2003, DBISP2P.

[49]  Wolfgang Nejdl,et al.  Publish/Subscribe for RDF-based P2P Networks , 2004, ESWS.

[50]  Rajeev Motwani,et al.  Estimating Aggregates on a Peer-to-Peer Network , 2003 .

[51]  David Maier,et al.  Distributed Query Processing and Catalogs for Peer-to-Peer Systems , 2003, CIDR.

[52]  Edith Cohen,et al.  Search and replication in unstructured peer-to-peer networks , 2002 .

[53]  Ling Liu,et al.  PeerCQ: a decentralized and self-configuring peer-to-peer information monitoring system , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[54]  Farnoush Banaei Kashani,et al.  Criticality-based analysis and design of unstructured peer-to-peer networks as "Complex systems" , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..