Datacast: A Scalable and Efficient Reliable Group Data Delivery Service for Data Centers

Reliable Group Data Delivery (RGDD) is a pervasive traffic pattern in data centers. In an RGDD group, a sender needs to reliably deliver a copy of data to all the receivers. Existing solutions either do not scale due to the large number of RGDD groups (e.g., IP multicast) or cannot efficiently use network bandwidth (e.g., end-host overlays). Motivated by recent advances on data center network topology designs (multiple edge-disjoint Steiner trees for RGDD) and innovations on network devices (practical in-network packet caching), we propose Datacast for RGDD. Datacast explores two design spaces: 1) Datacast uses multiple edge-disjoint Steiner trees for data delivery acceleration. 2) Datacast leverages in-network packet caching and introduces a simple soft-state based congestion control algorithm to address the scalability and efficiency issues of RGDD. Our analysis reveals that Datacast congestion control works well with small cache sizes (e.g., 125KB) and causes few duplicate data transmissions (e.g., 1.19%). Both simulations and experiments confirm our theoretical analysis. We also use experiments to compare the performance of Datacast and BitTorrent. In a BCube(4, 1) with 1Gbps links, we use both Datacast and BitTorrent to transmit 4GB data. The link stress of Datacast is 1.01, while it is 1.39 for BitTorrent. By using two Steiner trees, Datacast finishes the transmission in 16.9s, while BitTorrent uses 52s.

[1]  Injong Rhee,et al.  CUBIC: a new TCP-friendly high-speed TCP variant , 2008, OPSR.

[2]  Antony I. T. Rowstron,et al.  Symbiotic routing in future data centers , 2010, SIGCOMM '10.

[3]  Jörg Widmer,et al.  TCP-Friendly Multicast Congestion Control (TFMCC): Protocol Specification , 2006, RFC.

[4]  B. Cohen,et al.  Incentives Build Robustness in Bit-Torrent , 2003 .

[5]  Amin Vahdat,et al.  A scalable, commodity data center network architecture , 2008, SIGCOMM '08.

[6]  Stephen J. Garland,et al.  Active reliable multicast , 1998, Proceedings. IEEE INFOCOM '98, the Conference on Computer Communications. Seventeenth Annual Joint Conference of the IEEE Computer and Communications Societies. Gateway to the 21st Century (Cat. No.98.

[7]  David L. Tennenhouse,et al.  Towards an active network architecture , 2007, Comput. Commun. Rev..

[8]  R. Graham,et al.  Unlikelihood that minimal phylogenies for a realistic biological study can be constructed in reasonable computational time , 1982 .

[9]  Srinivasan Seshan,et al.  Packet caches on routers: the implications of universal redundant traffic elimination , 2008, SIGCOMM '08.

[10]  Luigi Rizzo,et al.  pgmcc: a TCP-friendly single-rate multicast , 2000, SIGCOMM.

[11]  Yuan Yu,et al.  Dryad: distributed data-parallel programs from sequential building blocks , 2007, EuroSys '07.

[12]  Jou-Ming Chang,et al.  Independent Spanning Trees on Multidimensional Torus Networks , 2010, IEEE Transactions on Computers.

[13]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[14]  Eugene L. Lawler,et al.  A Faster Algorithm for Finding Edge-Disjoint Branchings , 1983, Inf. Process. Lett..

[15]  Michael I. Jordan,et al.  Managing data transfers in computer clusters with orchestra , 2011, SIGCOMM 2011.

[16]  Haitao Wu,et al.  BCube: a high performance, server-centric network architecture for modular data centers , 2009, SIGCOMM '09.

[17]  Haitao Wu,et al.  ServerSwitch: A Programmable and High Performance Platform for Data Center Networks , 2011, NSDI.

[18]  Miguel Castro,et al.  SplitStream: high-bandwidth multicast in cooperative environments , 2003, SOSP '03.

[19]  Chen Chen,et al.  Datacast: A Scalable and Efficient Reliable Group Data Delivery Service for Data Centers , 2012, IEEE Journal on Selected Areas in Communications.

[20]  Chuanxiong Guo,et al.  Datacast: A Scalable and Efficient Group Data Delivery Service for Data Centers , 2011 .

[21]  Srinivasan Seshan,et al.  A case for end system multicast , 2002, IEEE J. Sel. Areas Commun..

[22]  Van Jacobson,et al.  Networking named content , 2009, CoNEXT '09.

[23]  Mark Handley,et al.  The Reliable Multicast Design Space for Bulk Data Transfer , 2000, RFC.

[24]  Steven McCanne,et al.  A reliable multicast framework for light-weight sessions and application level framing , 1995, SIGCOMM '95.

[25]  Howard Gobioff,et al.  The Google file system , 2003, SOSP '03.