A Novel Replica Placement Strategy for Data Center Network

With the rapid development of the cloud and data-intensive computing, many data center network are growing more large scale, and the number of servers is increasing at an exponential rate. As commodity-class PCs is used in the current DCN, failures caused by node failure, rack failures, link failures and routing failures become a very common phenomenon in the parallel processing of data blocks, tasks, and job scheduling. DCell is a novel network structure adapted to the DCN, which uses a recursively-defined structure to interconnect servers. Although DCell is fault tolerant, and addresses various failures, it has many questions. If the destination nodes are the failure, no matter what cannot route to them. The existing technology of DCell fail to guarantee durability and reliable for the data on the node. This paper proposes a novel idea that a replica is placed in the node of the layer by the fixed-point single-source shortest path, and there is at least one replica in each layer of the DCell. According to the proposed placement strategy of replicas, we suppose a mechanism to restore lost replica based on the neighboring nodes in the same path. Finally, we simulate and validate algorithm of placement and repair for replica. Experiments show that our algorithm has obvious effect to improve the reliability of data.

[1]  Lei Shi,et al.  Dcell: a scalable and fault-tolerant network structure for data centers , 2008, SIGCOMM '08.

[2]  André Schiper,et al.  Optimistic active replication , 2001, Proceedings 21st International Conference on Distributed Computing Systems.

[3]  Howard Gobioff,et al.  The Google file system , 2003, SOSP '03.

[4]  Jason Lee,et al.  Generalized DCell Structure for Load-Balanced Data Center Networks , 2010, 2010 INFOCOM IEEE Conference on Computer Communications Workshops.

[5]  Lili Qiu,et al.  On the placement of Web server replicas , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[6]  Jun Li,et al.  The Generalized DCell Network Structures and Their Graph Properties , 2009 .

[7]  Amin Vahdat,et al.  A scalable, commodity data center network architecture , 2008, SIGCOMM '08.

[8]  Ming Zhong,et al.  Replication degree customization for high availability , 2008, Eurosys '08.

[9]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[10]  Michal Szymaniak,et al.  Latency-driven replica placement , 2005, The 2005 Symposium on Applications and the Internet.

[11]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[12]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[13]  Jianliang Xu,et al.  Optimal replica placement on transparent replication proxies for read/write data , 2002, Conference Proceedings of the IEEE International Performance, Computing, and Communications Conference (Cat. No.02CH37326).

[14]  Veronika Rehn-Sonigo Optimal Replica Placement in Tree Networks with QoS and Bandwidth Constraints and the Closest Allocation Policy , 2007, ArXiv.

[15]  John A. Chandy A generalized replica placement strategy to optimize latency in a wide area distributed storage system , 2008, DADC '08.

[16]  Luiz André Barroso,et al.  Web Search for a Planet: The Google Cluster Architecture , 2003, IEEE Micro.

[17]  Andreas Haeberlen,et al.  Efficient Replica Maintenance for Distributed Storage Systems , 2006, NSDI.

[18]  Stefan Savage,et al.  Total Recall: System Support for Automated Availability Management , 2004, NSDI.