Collective Receiver-Initiated Multicast for Grid Applications

Grid applications often need to distribute large amounts of data efficiently from one cluster to multiple others (multicast). Existing sender-initiated methods arrange nodes in optimized tree structures, based on external network monitoring data. This dependence on monitoring data severely impacts both ease of deployment and adaptivity to dynamically changing network conditions. In this paper, we present Robber, a collective, receiver-initiated, high-throughput multicast approach inspired by the BitTorrent protocol. Unlike BitTorrent, Robber is specifically designed to maximize the throughput between multiple cluster computers. Nodes in the same cluster work together as a collective that tries to steal data from peer clusters. Instead of using potentially outdated monitoring data, Robber automatically adapts to the currently achievable bandwidth ratios. Within a collective, nodes automatically tune the amount of data they steal remotely to their relative performance. Our experimental evaluation compares Robber to BitTorrent, to Balanced Multicasting, and to its predecessor MOB. Balanced Multicasting optimizes multicast trees based on external monitoring data, while MOB uses collective, receiver-initiated multicast with static load balancing. We show that both Robber and MOB outperform BitTorrent. They are competitive with Balanced Multicasting as long as the network bandwidth remains stable, and outperform it by wide margins when bandwidth changes dynamically. In large environments and heterogeneous clusters, Robber outperforms MOB.

[1]  Jason Maassen,et al.  Satin: Simple and Efficient Java-based Grid Programming , 2005, Scalable Comput. Pract. Exp..

[2]  Miguel Castro,et al.  SplitStream: high-bandwidth multicast in cooperative environments , 2003, SOSP '03.

[3]  Klara Nahrstedt,et al.  Maxmin overlay multicast: rate allocation and tree construction , 2004, Twelfth IEEE International Workshop on Quality of Service, 2004. IWQOS 2004..

[4]  Alan M. Frieze,et al.  On the connectivity of random m-orientable graphs and digraphs , 1982, Comb..

[5]  Amin Vahdat,et al.  Bullet: high bandwidth data dissemination using an overlay mesh , 2003, SOSP '03.

[6]  Vinay S. Pai,et al.  Chainsaw: Eliminating Trees from Overlay Multicast , 2005, IPTPS.

[7]  Kai Lai Chung,et al.  Markov Chains with Stationary Transition Probabilities , 1961 .

[8]  Jason Maassen,et al.  Smartsockets: solving the connectivity problems in grid computing , 2007, HPDC '07.

[9]  Christos Gkantsidis,et al.  Network coding for large scale content distribution , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[10]  Marcel Worring,et al.  High-Performance Distributed Image and Video Content Analysis with Parallel-Horus , 2007 .

[11]  Rauf Izmailov,et al.  Fast Parallel File Replication in Data Grid , 2004 .

[12]  Antonio J. Plaza,et al.  Commodity cluster-based parallel processing of hyperspectral imagery , 2006, J. Parallel Distributed Comput..

[13]  B. Cohen,et al.  Incentives Build Robustness in Bit-Torrent , 2003 .

[14]  Richard Wolski,et al.  Experiences with predicting resource performance on-line in computational grid settings , 2003, PERV.

[15]  Reuven Cohen,et al.  A unicast-based approach for streaming multicast , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[16]  Jason Maassen,et al.  Ibis: a flexible and efficient Java‐based Grid programming environment , 2005, Concurr. Pract. Exp..

[17]  Mingchu Li,et al.  On the connectedness of peer-to-peer overlay networks , 2005, 11th International Conference on Parallel and Distributed Systems (ICPADS'05).

[18]  Jason Maassen,et al.  Middleware adaptation with the Delphoi service , 2006, Concurr. Comput. Pract. Exp..

[19]  Brian E. Smith,et al.  Massively Parallel BLAST for the Blue Gene / L , 2005 .

[20]  Kees Verstoep,et al.  Efficient reliable multicast on Myrinet , 1996, Proceedings of the 1996 ICPP Workshop on Challenges for Parallel Processing.

[21]  Bronis R. de Supinski,et al.  Exploiting hierarchy in parallel computer networks to optimize collective operation performance , 2000, Proceedings 14th International Parallel and Distributed Processing Symposium. IPDPS 2000.

[22]  Klara Nahrstedt,et al.  On achieving optimized capacity utilization in application overlay networks with multiple competing sessions , 2004, SPAA '04.

[23]  Min Sik Kim,et al.  Optimal distribution tree for Internet streaming media , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[24]  Yves Robert,et al.  Broadcast trees for heterogeneous platforms , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[25]  Kees Verstoep,et al.  Network performance-aware collective communication for clustered wide-area systems , 2001, Parallel Comput..

[26]  Henri E. Bal,et al.  MagPIe: MPI's collective communication operations for clustered wide area systems , 1999, PPoPP '99.

[27]  Jun Wang,et al.  TRIBLER: a social‐based peer‐to‐peer system , 2008, IPTPS.

[28]  S. Hemminger Network Emulation with NetEm , 2022 .

[29]  William Chan,et al.  Improving Traffic Locality in BitTorrent via Biased Neighbor Selection , 2006, 26th IEEE International Conference on Distributed Computing Systems (ICDCS'06).

[30]  B Lowekamp,et al.  A Hierarchy of Network Performance Characeristics for Grid Applications and Services , 2003 .

[31]  Thomas R. Gross,et al.  Design, Implementation, and Evaluation of the Remos Network Monitoring System , 2004, Journal of Grid Computing.

[32]  Marcel Worring,et al.  High-Performance Distributed Video Content Analysis with Parallel-Horus , 2007, IEEE MultiMedia.

[33]  Thilo Kielmann,et al.  MOB: zero-configuration high-throughput multicasting for grid applications , 2007, HPDC '07.