Randomized network coding in distributed storage systems with layered overlay

In this paper, we analyze the performance of a peer-to-peer (P2P) distributed storage architecture based on a layered overlay scheme, where some nodes provide their disk capacities for hosting data fragments generated by other active users. This system has been introduced and studied, by means of simulations, in a previous paper, where the use of randomized network coding has been proposed as an appealing alternative to classical erasure coding to generate the required redundancy in a distributed storage system. In fact, the proposed network coding-based scheme provides proactive network maintenance to guarantee sufficient resource redundancy: this leads to a reduction of maintenance complexity and to a larger amount of free disk space on storage nodes. In the current paper, we provide a preliminary analytical framework that encompasses the use of a few characteristic parameters of the considered distributed storage system, such as the resource availability and the available free disk space portion. Our results show a good agreement between analysis and simulations, thus confirming the effectiveness of our network coding-based approach.

[1]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[2]  Muriel Medard,et al.  How good is random linear coding based distributed networked storage , 2005 .

[3]  Rudolf Ahlswede,et al.  Network information flow , 2000, IEEE Trans. Inf. Theory.

[4]  GhemawatSanjay,et al.  The Google file system , 2003 .

[5]  Christina Fragouli,et al.  Network Coding Applications , 2008, Found. Trends Netw..

[6]  Rodrigo Rodrigues,et al.  High Availability in DHTs: Erasure Coding vs. Replication , 2005, IPTPS.

[7]  Christos Gkantsidis,et al.  Network coding for large scale content distribution , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[8]  Werner Vogels,et al.  Dynamo: amazon's highly available key-value store , 2007, SOSP.

[9]  Michele Amoretti,et al.  DEUS: a discrete event universal simulator , 2009, SimuTools.

[10]  Yeh-Ching Chung,et al.  Malugo: A peer-to-peer storage system , 2010, Int. J. Ad Hoc Ubiquitous Comput..

[11]  Ben Y. Zhao,et al.  Distributed Object Location in a Dynamic Network , 2002, SPAA '02.

[12]  Christina Fragouli,et al.  Network Coding Fundamentals , 2007, Found. Trends Netw..

[13]  Alexandros G. Dimakis,et al.  Network Coding for Distributed Storage Systems , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[14]  ChanYu-Wei,et al.  Malugo: A peer-to-peer storage system , 2010 .

[15]  Ben Y. Zhao,et al.  Pond: The OceanStore Prototype , 2003, FAST.

[16]  Michele Amoretti,et al.  A Practical Network Coding Approach for Peer-to-Peer Distributed Storage , 2010, 2010 IEEE International Symposium on Network Coding (NetCod).

[17]  Michele Amoretti,et al.  A Survey of Peer- to -Peer Overlay Schemes: Effectiveness, Efficiency and Security , 2009 .

[18]  Stefan Savage,et al.  Total Recall: System Support for Automated Availability Management , 2004, NSDI.

[19]  Tracey Ho,et al.  A Random Linear Network Coding Approach to Multicast , 2006, IEEE Transactions on Information Theory.