A conceptual model for data management and distribution in peer-to-peer systems

While Peer-to-Peer (P2P) model gains significant attention in distributed computing, it is also expected to be a powerful model for information sharing. P2P systems are expected to provide exhaustive reliable computational resources and scalable accessibility. The data management and distribution in such systems requires storage, replication, data modeling, indexing, querying, retrieval, streaming, and topology management. While a lot of data management strategies have been proposed through the last years, these strategies have not been investigated with respect to a common model for P2P systems. However, since the services provided by the P2P systems are so diverse, it is very challenging to come up with a common layer-based model for all P2P systems. In this paper, we firstly propose a conceptual model for P2P systems, and then provide a classification and summary of data management and distribution strategies by referring to this model. The horizontal layers of the model correspond to modules of a P2P system whereas the columns are related to the services provided. The modules include base P2P service, storage, indexing, logical, service, and application modules. The services include security, querying, publish, join/leave, collaboration, and streaming. The paper concludes by providing a comprehensive list of data management and distribution strategies used in the existing P2P systems.

[1]  Nigel Shadbolt,et al.  Resource Description Framework (RDF) , 2009 .

[2]  Dieter Pfoser,et al.  DBGlobe: a service-oriented P2P system for global computing , 2003, SGMD.

[3]  Yang Guo,et al.  A survey on peer-to-peer video streaming systems , 2008, Peer-to-Peer Netw. Appl..

[4]  Amin Vahdat,et al.  Bullet: high bandwidth data dissemination using an overlay mesh , 2003, SOSP '03.

[5]  Andrew V. Goldberg,et al.  Towards an archival Intermemory , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[6]  Robert Morris,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM 2001.

[7]  Xuxian Jiang,et al.  GnuStream: a P2P media streaming system prototype , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[8]  Mary Baker,et al.  The LOCKSS peer-to-peer digital preservation system , 2005, TOCS.

[9]  Aidong Zhang,et al.  PLUS: a probe-loss utilization streaming mechanism for distributed multimedia presentation systems , 2002, IEEE Trans. Multim..

[10]  Dinesh C. Verma,et al.  ALMI: An Application Level Multicast Infrastructure , 2001, USITS.

[11]  Divyakant Agrawal,et al.  A peer-to-peer framework for caching range queries , 2004, Proceedings. 20th International Conference on Data Engineering.

[12]  Bharat K. Bhargava,et al.  PROMISE: peer-to-peer media streaming using CollectCast , 2003, MULTIMEDIA '03.

[13]  Ben Y. Zhao,et al.  Tapestry: a resilient global-scale overlay for service deployment , 2004, IEEE Journal on Selected Areas in Communications.

[14]  Vassilis Christophides,et al.  Semantic Query Routing and Processing in P2P Database Systems: The ICS-FORTH SQPeer Middleware , 2004, EDBT Workshops.

[15]  Gade Krishna,et al.  A scalable peer-to-peer lookup protocol for Internet applications , 2012 .

[16]  S. K. Nandy,et al.  Can streaming of stored playback video be supported on peer to peer infrastructure? , 2004, 18th International Conference on Advanced Information Networking and Applications, 2004. AINA 2004..

[17]  Jon Crowcroft,et al.  A survey and comparison of peer-to-peer overlay network schemes , 2005, IEEE Communications Surveys & Tutorials.

[18]  Aruna Seneviratne,et al.  PROST: a programmable structured peer-to-peer overlay network , 2004 .

[19]  Yiming Hu,et al.  Integrating semantics-based access mechanisms with P2P file systems , 2003, Proceedings Third International Conference on Peer-to-Peer Computing (P2P2003).

[20]  Dah Ming Chiu,et al.  Erasure code replication revisited , 2004 .

[21]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[22]  Kirk L. Johnson,et al.  Overcast: reliable multicasting with on overlay network , 2000, OSDI.

[23]  Renée J. Miller,et al.  Mapping data in peer-to-peer systems: semantics and algorithmic issues , 2003, SIGMOD '03.

[24]  Bo Li,et al.  DONet: A Data-Driven Overlay Network For Efficient Live Media Streaming , 2004, INFOCOM 2005.

[25]  Li Xiao,et al.  DiCAS: An Efficient Distributed Caching Mechanism for P2P Systems , 2006, IEEE Transactions on Parallel and Distributed Systems.

[26]  Dick C. A. Bulterman Using SMIL to encode interactive, peer-level multimedia annotations , 2003, DocEng '03.

[27]  Cyril Labbé,et al.  PinS: peer to peer interrogation and indexing system , 2004, Proceedings. International Database Engineering and Applications Symposium, 2004. IDEAS '04..

[28]  Kien A. Hua,et al.  A peer-to-peer architecture for media streaming , 2004, IEEE Journal on Selected Areas in Communications.

[29]  Bhavani M. Thuraisingham,et al.  Centralized Security Labels in Decentralized P2P Networks , 2007, Twenty-Third Annual Computer Security Applications Conference (ACSAC 2007).

[30]  Reza Rejaie,et al.  PRIME: peer-to-peer receiver-driven mesh-based streaming , 2009, TNET.

[31]  Peter A. Boncz,et al.  AmbientDB: Relational Query Processing in a P2P Network , 2003, DBISP2P.

[32]  Wolfgang Nejdl Design Issues and Challenges for RDF- and Schema-Based Peer-to-Peer Systems , 2003, DBISP2P.

[33]  J. Crowcroft,et al.  P 2 P support for Group-Communication Applications : a Cross-Layer Approach for MANET Environments , 2006 .

[34]  Wolfgang Nejdl,et al.  Design issues and challenges for RDF- and schema-based peer-to-peer systems , 2003, SGMD.

[35]  Patrick Traynor,et al.  Leveraging Identity-Based Cryptography for Node ID Assignment in Structured P2P Systems , 2009, IEEE Trans. Parallel Distributed Syst..

[36]  Kyung Dong Ryu,et al.  BYPASS: topology-aware lookup overlay for DHT-based P2P file locating services , 2004, Proceedings. Tenth International Conference on Parallel and Distributed Systems, 2004. ICPADS 2004..

[37]  Katja Hose,et al.  Query routing and processing in schema-based P2P systems , 2004 .

[38]  Ben Y. Zhao,et al.  Bayeux: an architecture for scalable and fault-tolerant wide-area data dissemination , 2001, NOSSDAV '01.

[39]  Edith Cohen,et al.  Replication strategies in unstructured peer-to-peer networks , 2002, SIGCOMM.

[40]  Siu Cheung Hui,et al.  Intelligent content-based retrieval for P2P networks , 2003, Proceedings. 2003 International Conference on Cyberworlds.

[41]  Antony I. T. Rowstron,et al.  Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility , 2001, SOSP.

[42]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[43]  Mark Handley,et al.  Application-Level Multicast Using Content-Addressable Networks , 2001, Networked Group Communication.

[44]  Sachin Agarwal,et al.  Gossip Based Streaming with Incentives for Peer Collaboration , 2006, Eighth IEEE International Symposium on Multimedia (ISM'06).

[45]  Joseph M. Hellerstein,et al.  Toward network data independence , 2003, SGMD.

[46]  Dimitrios Tsoumakos,et al.  APRE: A Replication Method for Unstructured P2P Networks , 2006 .

[47]  Bent Guldbjerg Christensen Experiences Developing Mobile P2P Applications with LightPeers , 2006, Sixth IEEE International Conference on Peer-to-Peer Computing (P2P'06).

[48]  Bharat K. Bhargava,et al.  On peer-to-peer media streaming , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[49]  Bo Li,et al.  Opportunities and Challenges of Peer-to-Peer Internet Video Broadcast , 2008, Proceedings of the IEEE.

[50]  Miguel Castro,et al.  Scribe: a large-scale and decentralized application-level multicast infrastructure , 2002, IEEE J. Sel. Areas Commun..

[51]  Gennaro Cordasco,et al.  Architecture of a p2p distributed adaptive directory , 2004, WWW Alt. '04.

[52]  Peter Druschel,et al.  Storage management and caching in PAST , 2001 .

[53]  Wang Dan,et al.  A Layered Resource Management Model in P2P System , 2005, Sixth International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT'05).

[54]  Aris M. Ouksel,et al.  In-context peer-to-peer information filtering on the Web , 2003, SGMD.

[55]  Gennaro Cordasco,et al.  A P2P Distributed Adaptive Directory , 2004, AH.

[56]  P. Bellini,et al.  P2P Architecture for Automated B2B Cross Media Content Distribution , 2007, Third International Conference on Automated Production of Cross Media Content for Multi-Channel Distribution (AXMEDIS'07).

[57]  Prasant Mohapatra,et al.  HostCast: a new overlay multicasting protocol , 2003, IEEE International Conference on Communications, 2003. ICC '03..

[58]  Tore Risch,et al.  EDUTELLA: a P2P networking infrastructure based on RDF , 2002, WWW.

[59]  Diomidis Spinellis,et al.  A survey of peer-to-peer content distribution technologies , 2004, CSUR.

[60]  Dimitrios Tsoumakos,et al.  Adaptive probabilistic search for peer-to-peer networks , 2003, Proceedings Third International Conference on Peer-to-Peer Computing (P2P2003).

[61]  Miguel Castro,et al.  SplitStream: high-bandwidth multicast in cooperative environments , 2003, SOSP '03.

[62]  John R. Douceur,et al.  The Sybil Attack , 2002, IPTPS.

[63]  Srinivasan Seshan,et al.  A case for end system multicast , 2002, IEEE J. Sel. Areas Commun..

[64]  Edith Cohen,et al.  Search and replication in unstructured peer-to-peer networks , 2002, ICS '02.

[65]  Nahid Shahmehri,et al.  Intelligent software delivery using P2P , 2002, Proceedings. Second International Conference on Peer-to-Peer Computing,.

[66]  Wolfgang Nejdl,et al.  Searching SCORM metadata in a RDF-based e-learning P2P network using XQuery and Query by example , 2003, Proceedings 3rd IEEE International Conference on Advanced Technologies.

[67]  Qian Zhang,et al.  Peer-to-peer based multimedia distribution service , 2004, IEEE Transactions on Multimedia.

[68]  Beng Chin Ooi,et al.  Relational data sharing in peer-based data management systems , 2003, SGMD.

[69]  David R. Karger,et al.  Wide-area cooperative storage with CFS , 2001, SOSP.

[70]  George Danezis,et al.  Sybil-Resistant DHT Routing , 2005, ESORICS.

[71]  William Yurcik,et al.  A survey of peer-to-peer storage techniques for distributed file systems , 2005, International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II.

[72]  Zhiyong Xu,et al.  HIERAS: a DHT based hierarchical P2P routing algorithm , 2003, 2003 International Conference on Parallel Processing, 2003. Proceedings..

[73]  Verena Kantere,et al.  The hyperion project: from data integration to data coordination , 2003, SGMD.

[74]  Ingeborg Sølvberg,et al.  A vision on semantic retrieval in P2P network , 2004, 18th International Conference on Advanced Information Networking and Applications, 2004. AINA 2004..

[75]  Robert Tappan Morris,et al.  Ivy: a read/write peer-to-peer file system , 2002, OSDI '02.

[76]  Aris M. Ouksel,et al.  Distributed databases and peer-to-peer databases: past and present , 2008, SGMD.

[77]  Laxmi N. Bhuyan,et al.  An effective pointer replication algorithm in P2P networks , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[78]  Ming Zhang,et al.  Supporting rich queries in DHT-based peer-to-peer systems , 2003, WET ICE 2003. Proceedings. Twelfth IEEE International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises, 2003..

[79]  John C.-I. Chuang,et al.  Incentive mechanism for peer-to-peer media streaming , 2004, Twelfth IEEE International Workshop on Quality of Service, 2004. IWQOS 2004..

[80]  Fausto Giunchiglia,et al.  The Local Relational Model: Model and Proof Theory , 2001 .

[81]  Anne-Marie Kermarrec,et al.  Lightweight probabilistic broadcast , 2001, 2001 International Conference on Dependable Systems and Networks.

[82]  Jacob R. Lorch,et al.  Farsite: federated, available, and reliable storage for an incompletely trusted environment , 2002, OSDI '02.

[83]  Beng Chin Ooi,et al.  PeerDB: a P2P-based system for distributed data sharing , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[84]  Sugih Jamin,et al.  End-Host Multicast Communication Using Switch-Trees Protocols , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[85]  Miguel Oom Temudo de Castro,et al.  Practical Byzantine fault tolerance , 1999, OSDI '99.

[86]  Fang Chen,et al.  Managing distributed objects in peer-to-peer systems , 2004, IEEE Network.

[87]  Tim Moors,et al.  Survey of Research towards Robust Peer-to-Peer Networks: Search Methods , 2007, RFC.

[88]  Dan S. Wallach,et al.  A Survey of Peer-to-Peer Security Issues , 2002, ISSS.

[89]  Michalis Faloutsos,et al.  BiToS: Enhancing BitTorrent for Supporting Streaming Applications , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[90]  Cristina L. Abad,et al.  A survey and comparison of end-system overlay multicast solutions suitable for network-centric warfare , 2004, SPIE Defense + Commercial Sensing.

[91]  Peter Druschel,et al.  Pastry: Scalable, distributed object location and routing for large-scale peer-to- , 2001 .

[92]  Aaron Harwood,et al.  Complex Applications over Peer-to-Peer Networks , 2005 .

[93]  C. Baccigalupo,et al.  Poolcasting: A Social Web Radio Architecture for Group Customisation , 2007, Third International Conference on Automated Production of Cross Media Content for Multi-Channel Distribution (AXMEDIS'07).

[94]  Ashraf Matrawy,et al.  A Multilayer Peer-to-Peer Framework for Distributed Synchronous Collaboration , 2006, IEEE Internet Computing.

[95]  Bobby Bhattacharjee,et al.  Scalable application layer multicast , 2002, SIGCOMM '02.

[96]  Dan Suciu,et al.  The Piazza peer data management system , 2004, IEEE Transactions on Knowledge and Data Engineering.

[97]  Volker Turau,et al.  A tree-based DHT Approach to scalable weakly consistent peer-to-peer data management , 2004 .

[98]  Juyoung Park,et al.  A Survey on Peer-to-Peer Overlay Network Schemes , 2008, 2008 10th International Conference on Advanced Communication Technology.

[99]  Cheng Yang Peer-to-peer architecture for content-based music retrieval on acoustic data , 2003, WWW '03.

[100]  Y. Charlie Hu,et al.  Transparent query caching in peer-to-peer overlay networks , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[101]  Hector Garcia-Molina,et al.  Peer-to-peer research at Stanford , 2003, SGMD.

[102]  Bhavani M. Thuraisingham,et al.  Secure peer-to-peer networks for trusted collaboration , 2007, 2007 International Conference on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom 2007).

[103]  Eytan Adar,et al.  Free Riding on Gnutella , 2000, First Monday.

[104]  D. Milojicic,et al.  Peer-to-Peer Computing , 2010 .

[105]  Margo I. Seltzer,et al.  Building a reliable mutable file system on peer-to-peer storage , 2002, 21st IEEE Symposium on Reliable Distributed Systems, 2002. Proceedings..

[106]  R. Sarnath,et al.  Proceedings of the International Conference on Parallel Processing , 1992 .