Efficient multicast on Myrinet using link-level flow control

This paper studies the implementation of efficient multicast protocols for Myrinet, a switched, wormhole-routed Gigabit-per-second network technology. Since Myrinet does not support multicasting in hardware, multicast services must be implemented in software. We present a new, efficient, and reliable software multicast protocol that uses the network interface to efficiently forward multicast traffic. The new protocol is constructed on top of reliable, flow-controlled channels between pairs of network interfaces. We describe the design of the protocol and make a detailed comparison with a previous multicast protocol. We show that our protocol is simpler and scales better than the previous protocol. This claim is supported by extensive performance measurements on a 64-node Myrinet cluster.

[1]  Jack J. Dongarra,et al.  A message passing standard for MPP and workstations , 1996, CACM.

[2]  Dhabaleswar K. Panda,et al.  Implementing Multidestination Worms in Switch-Based Parallel Systems: Architectural Alternatives and Their Impact , 2000, IEEE Trans. Parallel Distributed Syst..

[3]  Kees Verstoep,et al.  Efficient reliable multicast on Myrinet , 1996, Proceedings of the 1996 ICPP Workshop on Challenges for Parallel Processing.

[4]  K. Anjan,et al.  An efficient, fully adaptive deadlock recovery scheme: DISHA , 1995, Proceedings 22nd Annual International Symposium on Computer Architecture.

[5]  Charles L. Seitz,et al.  Myrinet: A Gigabit-per-Second Local Area Network , 1995, IEEE Micro.

[6]  Pedro López,et al.  A very efficient distributed deadlock detection mechanism for wormhole networks , 1998, Proceedings 1998 Fourth International Symposium on High-Performance Computer Architecture.

[7]  Richard M. Karp,et al.  Optimal broadcast and summation in the LogP model , 1993, SPAA '93.

[8]  David E. Culler,et al.  Virtual network transport protocols for Myrinet , 1998, IEEE Micro.

[9]  Philip K. McKinley,et al.  Efficient collective operations with ATM network interface support , 1996, Proceedings of the 1996 ICPP Workshop on Challenges for Parallel Processing.

[10]  Jehoshua Bruck,et al.  On the design and implementation of broadcast and global combine operations using the postal model , 1994, Proceedings of 1994 6th IEEE Symposium on Parallel and Distributed Processing.

[11]  William J. Dally,et al.  Deadlock-Free Message Routing in Multiprocessor Interconnection Networks , 1987, IEEE Transactions on Computers.

[12]  Henri E. Bal,et al.  Performance evaluation of the Orca shared-object system , 1998, TOCS.

[13]  Mario Gerla,et al.  Multicasting protocols for high-speed, wormhole-routing local area networks , 1996, SIGCOMM 1996.

[14]  Cezary Dubnicki,et al.  VMMC-2 : Efficient Support for Reliable, Connection-Oriented Communication , 1997 .

[15]  José Duato,et al.  A Necessary and Sufficient Condition for Deadlock-Free Routing in Cut-Through and Store-and-Forward Networks , 1996, IEEE Trans. Parallel Distributed Syst..

[16]  Dhabaleswar K. Panda,et al.  Optimal multicast with packetization and network interface support , 1997, Proceedings of the 1997 International Conference on Parallel Processing (Cat. No.97TB100162).

[17]  Scott Pakin,et al.  High Performance Messaging on Workstations: Illinois Fast Messages (FM) for Myrinet , 1995, Proceedings of the IEEE/ACM SC95 Conference.