Datacast: A Scalable and Efficient Reliable Group Data Delivery Service for Data Centers

Reliable Group Data Delivery (RGDD) is a pervasive traffic pattern in data centers. In an RGDD group, a sender needs to reliably deliver a copy of data to all the receivers. Existing solutions either do not scale due to the large number of RGDD groups (e.g., IP multicast) or cannot efficiently use network bandwidth (e.g., end-host overlays). Motivated by recent advances on data center network topology designs (multiple edge-disjoint Steiner trees for RGDD) and innovations on network devices (practical in-network packet caching), we propose Datacast for RGDD. Datacast explores two design spaces: 1) Datacast uses multiple edge-disjoint Steiner trees for data delivery acceleration. 2) Datacast leverages in-network packet caching and introduces a simple soft-state based congestion control algorithm to address the scalability and efficiency issues of RGDD. Our analysis reveals that Datacast congestion control works well with small cache sizes (e.g., 125KB) and causes few duplicate data transmissions (e.g., 1.19%). Both simulations and experiments confirm our theoretical analysis. We also use experiments to compare the performance of Datacast and BitTorrent. In a BCube(4, 1) with 1Gbps links, we use both Datacast and BitTorrent to transmit 4GB data. The link stress of Datacast is 1.01, while it is 1.39 for BitTorrent. By using two Steiner trees, Datacast finishes the transmission in 16.9s, while BitTorrent uses 52s.

[1]  Luigi Rizzo,et al.  pgmcc: a TCP-friendly single-rate multicast , 2000, SIGCOMM.

[2]  Chuanxiong Guo,et al.  Datacast: A Scalable and Efficient Group Data Delivery Service for Data Centers , 2011 .

[3]  Miguel Castro,et al.  SplitStream: high-bandwidth multicast in cooperative environments , 2003, SOSP '03.

[4]  Pierre Fraigniaud,et al.  Arc-Disjoint Spanning Trees on Cube-Connected Cycles Networks , 1991, ICPP.

[5]  R. Graham,et al.  Unlikelihood that minimal phylogenies for a realistic biological study can be constructed in reasonable computational time , 1982 .

[6]  Srinivasan Seshan,et al.  A case for end system multicast , 2002, IEEE J. Sel. Areas Commun..

[7]  Mark Handley,et al.  The Reliable Multicast Design Space for Bulk Data Transfer , 2000, RFC.

[8]  David Wetherall,et al.  Towards an active network architecture , 1996, CCRV.

[9]  Jörg Widmer,et al.  TCP-Friendly Multicast Congestion Control (TFMCC): Protocol Specification , 2006, RFC.

[10]  B. Cohen,et al.  Incentives Build Robustness in Bit-Torrent , 2003 .

[11]  Amin Vahdat,et al.  A scalable, commodity data center network architecture , 2008, SIGCOMM '08.

[12]  Eugene L. Lawler,et al.  A Faster Algorithm for Finding Edge-Disjoint Branchings , 1983, Inf. Process. Lett..

[13]  Haitao Wu,et al.  BCube: a high performance, server-centric network architecture for modular data centers , 2009, SIGCOMM '09.

[14]  Haitao Wu,et al.  ServerSwitch: A Programmable and High Performance Platform for Data Center Networks , 2011, NSDI.

[15]  Jou-Ming Chang,et al.  Independent Spanning Trees on Multidimensional Torus Networks , 2010, IEEE Transactions on Computers.

[16]  Luigi Rizzo,et al.  pgmcc: a TCP-friendly single-rate multicast congestion control scheme , 2000, SIGCOMM.

[17]  Antony Rowstron,et al.  Symbiotic routing in future data centers , 2010, SIGCOMM 2010.

[18]  GhemawatSanjay,et al.  The Google file system , 2003 .

[19]  Van Jacobson,et al.  Networking named content , 2009, CoNEXT '09.

[20]  Srinivasan Seshan,et al.  Packet caches on routers: the implications of universal redundant traffic elimination , 2008, SIGCOMM '08.

[21]  Jou-Ming Chang,et al.  Parallel construction of optimal independent spanning trees on hypercubes , 2007, Parallel Comput..

[22]  Michael I. Jordan,et al.  Managing data transfers in computer clusters with orchestra , 2011, SIGCOMM.

[23]  Stephen J. Garland,et al.  Active reliable multicast , 1998, Proceedings. IEEE INFOCOM '98, the Conference on Computer Communications. Seventeenth Annual Joint Conference of the IEEE Computer and Communications Societies. Gateway to the 21st Century (Cat. No.98.

[24]  Yuan Yu,et al.  Dryad: distributed data-parallel programs from sequential building blocks , 2007, EuroSys '07.

[25]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[26]  D. R. Fulkerson,et al.  On edge-disjoint branchings , 1976, Networks.

[27]  Injong Rhee,et al.  CUBIC: a new TCP-friendly high-speed TCP variant , 2008, OPSR.

[28]  ZHANGLi-xia,et al.  A reliable multicast framework for light-weight sessions and application level framing , 1995 .