A new scalable and cost-effective congestion management strategy for lossless multistage interconnection networks

In this paper, we propose a new congestion management strategy for lossless multistage interconnection networks that scales as network size and/or link bandwidth increase. Instead of eliminating congestion, our strategy avoids performance degradation beyond the saturation point by eliminating the HOL blocking produced by congestion trees. This is achieved in a scalable manner by using separate queues for congested flows. These are dynamically allocated only when congestion arises, and deallocated when congestion subsides. Performance evaluation results show that our strategy responds to congestion immediately and completely eliminates the performance degradation produced by HOL blocking while using only a small number of additional queues.

[1]  Gregory F. Pfister,et al.  “Hot spot” contention and combining in multistage interconnection networks , 1985, IEEE Transactions on Computers.

[2]  Cui-Qing Yang,et al.  A taxonomy for congestion control algorithms in packet switching networks , 1995, IEEE Netw..

[3]  William J. Dally,et al.  The torus routing chip , 2005, Distributed Computing.

[4]  Dimitrios N. Serpanos,et al.  Credit-flow-controlled ATM for MP interconnection: The ATLAS I single-chip ATM switch , 1998, Proceedings 1998 Fourth International Symposium on High-Performance Computer Architecture.

[5]  Mithuna Thottethodi,et al.  BLAM: a high-performance routing algorithm for virtual cut-through networks , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[6]  Larry L. Peterson,et al.  TCP Vegas: End to End Congestion Avoidance on a Global Internet , 1995, IEEE J. Sel. Areas Commun..

[7]  Mithuna Thottethodi,et al.  Self-tuned congestion control for multiprocessor networks , 2001, Proceedings HPCA Seventh International Symposium on High-Performance Computer Architecture.

[8]  John Wilkes,et al.  UNIX Disk Access Patterns , 1993, USENIX Winter.

[9]  Sudhakar Yalamanchili,et al.  Adaptive routing protocols for hypercube interconnection networks , 1993, Computer.

[10]  Venkata S. Krishnan,et al.  A Localized Congestion Control Mechanism for PCI Express Advanced Switching Fabrics , 2004 .

[11]  Timothy Mark Pinkston,et al.  A new mechanism for congestion and deadlock resolution , 2002, Proceedings International Conference on Parallel Processing.

[12]  Pedro López,et al.  A congestion control mechanism for wormhole networks , 2001, Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing.

[13]  Pedro López,et al.  Deadlock-Free Adaptive Routing Algorithms for the 3D-Torus: Limitations and Solutions , 1993, PARLE.

[14]  Li Shang,et al.  Dynamic voltage scaling with links for power optimization of interconnection networks , 2003, The Ninth International Symposium on High-Performance Computer Architecture, 2003. HPCA-9 2003. Proceedings..

[15]  William J. Dally,et al.  Globally Adaptive Load-Balanced Routing on Tori , 2004, IEEE Computer Architecture Letters.

[16]  Steven L. Scott,et al.  The Cray T3E Network: Adaptive Routing in a High Performance 3D Torus , 1996 .

[17]  William J. Dally,et al.  Virtual-channel flow control , 1990, [1990] Proceedings. The 17th Annual International Symposium on Computer Architecture.

[18]  Raj Jain,et al.  A delay-based approach for congestion avoidance in interconnected heterogeneous computer networks , 1989, CCRV.

[19]  Andrew A. Chien,et al.  Compressionless routing: a framework for adaptive and fault-tolerant routing , 1994, ISCA '94.

[20]  William J. Dally,et al.  Deadlock-Free Adaptive Routing in Multicomputer Networks Using Virtual Channels , 1993, IEEE Trans. Parallel Distributed Syst..

[21]  Pedro López,et al.  A Robust Mecahnism for Congestion Control: INC , 2003, Euro-Par.

[22]  Nicholas P. Carter,et al.  Comparing Adaptive Routing and Dynamic Voltage Scaling for Link Power Reduction , 2004, IEEE Computer Architecture Letters.

[23]  Thomas E. Anderson,et al.  High-speed switch scheduling for local-area networks , 1993, TOCS.

[24]  Yuval Tamir,et al.  Dynamically-Allocated Multi-Queue Buffers for VLSI Communication Switches , 1992, IEEE Trans. Computers.

[25]  Lars-Erik Thorelli,et al.  Global reactive congestion control in multicomputer networks , 1998, Proceedings. Fifth International Conference on High Performance Computing (Cat. No. 98EX238).

[26]  S. Konstantinidou,et al.  Chaos router: architecture and performance , 1991, [1991] Proceedings. The 18th Annual International Symposium on Computer Architecture.

[27]  David F. Heidel,et al.  An Overview of the BlueGene/L Supercomputer , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[28]  Emilio Luque,et al.  A new method to make communication latency uniform: distributed routing balancing , 1999, ICS '99.

[29]  José Duato,et al.  A New Theory of Deadlock-Free Adaptive Routing in Wormhole Networks , 1993, IEEE Trans. Parallel Distributed Syst..