Policies for Efficient Data Replication in P2P Systems

This paper addresses the problem of maintaining replicated data in large scale P2P systems. Although this topic has been extensively studied in the literature, to maintain replicated data in this setting, in an efficient manner, still remains a significant challenge. This paper proposes novel policies to address this problem and evaluates their performance against different criteria, such as monitoring costs, data transfer costs, and load unbalance costs. We show that one of these new policies significantly outperforms previous work. Interestingly, this policy is based on a somehow counter-intuitive approach, that uses less reliable nodes to store the most accessed data items. The insights to derive this policy were obtained from an in depth analysis of existing solutions, that is also captured in the paper.

[1]  Pierre Sens,et al.  Churn-Resilient Replication Strategy for Peer-to-Peer Distributed Hash-Tables , 2009, SSS.

[2]  Andreas Wombacher,et al.  Enabling High Data Availability in a DHT , 2005, 16th International Workshop on Database and Expert Systems Applications (DEXA'05).

[3]  Brian D. Noble,et al.  Exploiting Availability Prediction in Distributed Systems , 2006, NSDI.

[4]  Ivan Beschastnikh,et al.  Scalable consistency in Scatter , 2011, SOSP.

[5]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[6]  Rodrigo Rodrigues,et al.  Proceedings of Hotos Ix: the 9th Workshop on Hot Topics in Operating Systems Hotos Ix: the 9th Workshop on Hot Topics in Operating Systems High Availability, Scalable Storage, Dynamic Peer Networks: Pick Two , 2022 .

[7]  Robbert van Renesse,et al.  Elastic Replication for Scalable Consistent Services , 2011 .

[8]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[9]  João Leitão,et al.  Rollerchain: A DHT for Efficient Replication , 2013, 2013 IEEE 12th International Symposium on Network Computing and Applications.

[10]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[11]  Richard M. Karp,et al.  Load balancing in dynamic structured P2P systems , 2004, IEEE INFOCOM 2004.

[12]  Erwan Le Merrer,et al.  Finding Good Partners in Availability-Aware P2P Networks , 2009, SSS.

[13]  David R. Karger,et al.  Wide-area cooperative storage with CFS , 2001, SOSP.

[14]  Seif Haridi,et al.  ID-Replication for Structured Peer-to-Peer Systems , 2012, Euro-Par.

[15]  Kam-Wing Ng,et al.  Analytical Study on Improving DHT Lookup Performance under Churn , 2006, Sixth IEEE International Conference on Peer-to-Peer Computing (P2P'06).

[16]  Valerio Schiavoni,et al.  Exploiting Node Connection Regularity for DHT Replication , 2011, 2011 IEEE 30th International Symposium on Reliable Distributed Systems.

[17]  Seif Haridi,et al.  Symmetric Replication for Structured Peer-to-Peer Systems , 2005, DBISP2P.