P2P Data Replication: Techniques and Applications

Peer-to-Peer (P2P) computing systems offer many advantages of decentralized distributed systems but suffer from availability and reliability. In order to increase availability and reliability, data replication techniques are considered commonplace in P2P computing systems. Replication can be seen as a family of techniques. Full documents or just chunks can be replicated. Since the same data can be found at multiple peers, availability is assured in case of peer failure. Consistency is a challenge in replication systems that allow dynamic updates of replicas. Fundamental to any of them is the degree of replication (full vs. partial), as well as the source of the updates and the way updates are propagated in the system. Due to the various characteristics of distributed systems as well as system’s and application’s requirements, a variety of data replication techniques have been proposed in the distributed computing field. One important distributed computing paradigm is that of P2P systems, which distinguish for their large scale and unreliable nature. In this chapter we study some data replication techniques and requirements for different P2P applications. We identify several contexts and use cases where data replication can greatly support collaboration. This chapter will also discuss existing optimistic replication solutions and P2P replication strategies and analyze their advantages and disadvantages. We also propose and evaluate the performance of a fuzzy-based system for finding the best replication factor in a P2P network.

[1]  Fatos Xhafa,et al.  Evaluation of intra‐group optimistic data replication in P2P groupware systems , 2015, Concurr. Comput. Pract. Exp..

[2]  Fatos Xhafa,et al.  Data Replication in P2P Collaborative Systems , 2012, 2012 Seventh International Conference on P2P, Parallel, Grid, Cloud and Internet Computing.

[3]  Patrick Valduriez,et al.  Consistency management for partial replication in a high performance database cluster , 2005, 11th International Conference on Parallel and Distributed Systems (ICPADS'05).

[4]  Tomoya Enokido,et al.  Process Allocation Algorithms for Saving Power Consumption in Peer-to-Peer Systems , 2011, IEEE Transactions on Industrial Electronics.

[5]  Michio Sugeno,et al.  Fuzzy systems theory and its applications , 1991 .

[6]  Mustafa Mat Deris,et al.  Improving data availability using hybrid replication technique in peer-to-peer environments , 2004, 18th International Conference on Advanced Information Networking and Applications, 2004. AINA 2004..

[7]  Hiroshi Yamamoto,et al.  Replication methods for load balancing on distributed storages in P2P networks , 2005, The 2005 Symposium on Applications and the Internet.

[8]  Esther Pacitti,et al.  Fast Algorithms for Maintaining Replica Consistency in Lazy Master Replicated Databases , 1999, VLDB.

[9]  Ishfaq Ahmad,et al.  Static and adaptive data replication algorithms for fast information access in large distributed systems , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[10]  Vassilios V. Dimakopoulos,et al.  Creating and Maintaining Replicas in Unstructured Peer-to-Peer Systems , 2006, Euro-Par.

[11]  Yasushi Saito,et al.  Optimistic replication , 2005, CSUR.

[12]  Janis Barzdins,et al.  Databases and Information Systems , 2001, Springer Netherlands.

[13]  Fatos Xhafa,et al.  Improvement of JXTA Protocols for Supporting Reliable Distributed Applications in P2P Systems , 2007, NBiS.

[14]  Kurt Stockinger,et al.  Dynamic data replication in LCG 2008 , 2008 .

[15]  Edith Cohen,et al.  Search and replication in unstructured peer-to-peer networks , 2002 .

[16]  Rick F. van der Lans Introduction to Data Virtualization , 2012, miproBIS 2012.

[17]  S. S. Ravi,et al.  Deferred updates and data placement in distributed databases , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[18]  Jussi Kangasharju,et al.  Optimal content replication in P2P communities , 2002 .

[19]  Fatos Xhafa,et al.  JXTA-Overlay: A P2P Platform for Distributed, Collaborative, and Ubiquitous Computing , 2011, IEEE Transactions on Industrial Electronics.

[20]  David Taniar,et al.  A Novel Structure and Access Mechanism for Mobile Data Broadcast in Digital Ecosystems , 2011, IEEE Transactions on Industrial Electronics.

[21]  Andreas Heuer,et al.  Configured replication for mobile applications , 2001 .

[22]  Fatos Xhafa,et al.  M3PS: A JXTA-based Multi-platform P2P System and its Web Application Tools , 2007, Int. J. Web Inf. Syst..

[23]  Brian Larson,et al.  Peer Pressure: Distributed Recovery from Attacks in Peer-to-Peer Systems , 2002, NETWORKING Workshops.

[24]  Ralf Steinmetz,et al.  The effectiveness of realistic replication strategies on quality of availability for peer-to-peer systems , 2003, Proceedings Third International Conference on Peer-to-Peer Computing (P2P2003).

[25]  Fatos Xhafa,et al.  A P2P Replication-Aware Approach for Content Distribution in E-Learning Systems , 2012, 2012 Sixth International Conference on Complex, Intelligent, and Software Intensive Systems.

[26]  Peter Honeyman,et al.  A replicated file system for Grid computing , 2008 .

[27]  Philip A. Bernstein,et al.  The failure and recovery problem for replicated databases , 1983, PODC '83.

[28]  Fatos Xhafa,et al.  Evaluation of Optimistic Replication Techniques for Dynamic Files in P2P Systems , 2011, 2011 International Conference on P2P, Parallel, Grid, Cloud and Internet Computing.

[29]  Dennis Shasha,et al.  The dangers of replication and a solution , 1996, SIGMOD '96.

[30]  Fatos Xhafa,et al.  Secure Communication Setup for a P2P-Based JXTA-Overlay Platform , 2011, IEEE Transactions on Industrial Electronics.

[31]  Patrick Valduriez,et al.  Survey of data replication in P2P systems , 2006 .

[32]  O. Ulusoy Research issues in Peer-to-Peer data management , 2007, 2007 22nd international symposium on computer and information sciences.

[33]  Gustavo Alonso,et al.  A new approach to developing and implementing eager database replication protocols , 2000, TODS.

[34]  Rajkumar Buyya,et al.  Data Replication Strategies in Wide-Area Distributed Systems , 2007 .

[35]  Albert Y. Zomaya,et al.  Intelligent scheduling and replication: a synergistic approach , 2009 .

[36]  Enrico Gregori,et al.  Web Engineering and Peer-to-Peer Computing , 2002, Lecture Notes in Computer Science.

[37]  Heiko Schuldt,et al.  FAS - A Freshness-Sensitive Coordination Middleware for a Cluster of OLAP Components , 2002, VLDB.

[38]  Norbert Meyer,et al.  Euro-Par 2006 Workshops: Parallel Processing , 2007, Lecture Notes in Computer Science.

[39]  Ian Clarke,et al.  Freenet: A Distributed Anonymous Information Storage and Retrieval System , 2000, Workshop on Design Issues in Anonymity and Unobservability.