Model and simulation of exascale communication networks

Exascale supercomputers will have millions or even hundreds of millions of processing cores and the potential for nearly billion-way parallelism. Exascale compute and data storage architectures will be critically dependent on the interconnection network. The most popular interconnection network for current and future supercomputer systems is the torus (eg, k-ary, n-cube). This paper focuses on the modelling and simulation of ultra-large-scale torus networks using Rensselaer's Optimistic Simulator System. We compare real communication delays between our model and the actual torus network from Blue Gene/L using 2048 processors. Our performance experiments demonstrate the ability to simulate million-node to billion-node torus networks. The torus network model for a 16-million-node configuration shows a high degree of strong scaling when going from 1024 cores to 32 768 cores on Blue Gene/L, with a peak event-rate of nearly 5 billion events per second. We also demonstrate the performance of our torus network model configured with 1 billion nodes on both Blue Gene/L and Blue Gene/P systems. The observed best event rate at 128 K cores is 12.36 billion per second on Blue Gene/P.

[1]  J. Flich,et al.  Routing in InfiniBand TM Torus Network Topologies , .

[2]  Robert E. Tarjan,et al.  Self-adjusting binary search trees , 1985, JACM.

[3]  Anant Agarwal,et al.  Limits on Interconnection Network Performance , 1991, IEEE Trans. Parallel Distributed Syst..

[4]  Christopher D. Carothers,et al.  Efficient optimistic parallel simulations using reverse computation , 1999, Workshop on Parallel and Distributed Simulation.

[5]  Mahmood Fathy,et al.  Analysis of circuit switching for the torus interconnect networks with hot-spot traffic , 2006, 2006 International Conference on Parallel Processing Workshops (ICPPW'06).

[6]  Randy Brown,et al.  Calendar queues: a fast 0(1) priority queue implementation for the simulation event set problem , 1988, CACM.

[7]  Philip Heidelberger,et al.  Blue Gene/L torus interconnection network , 2005, IBM J. Res. Dev..

[8]  Leonid Oliker,et al.  Analyzing Ultra-Scale Application Communication Requirements for a Reconfigurable Hybrid Interconnect , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[9]  Laxmikant V. Kalé,et al.  Simulating Large Scale Parallel Applications Using Statistical Models for Sequential Execution Blocks , 2010, 2010 IEEE 16th International Conference on Parallel and Distributed Systems.

[10]  Philip Heidelberger,et al.  IBM Research Report Design and Analysis of the BlueGene/L Torus Interconnection Network , 2003 .

[11]  Mina Guirguis Azer,et al.  Routing Tradeoffs inside a d-dimensional Torus with applicability to CAN , 2004 .

[12]  Christopher D. Carothers,et al.  Scalable Time Warp on Blue Gene Supercomputers , 2009, 2009 ACM/IEEE/SCS 23rd Workshop on Principles of Advanced and Distributed Simulation.

[13]  Christopher D. Carothers,et al.  Modeling Billion-Node Torus Networks Using Massively Parallel Discrete-Event Simulation , 2011, 2011 IEEE Workshop on Principles of Advanced and Distributed Simulation.

[14]  William J. Dally,et al.  Microarchitecture of a High-Radix Router , 2005, ISCA 2005.

[15]  Antonio Robles,et al.  Routing in InfiniBand Torus Network Topologies , 2003 .

[16]  Christopher D. Carothers,et al.  Analysis of time warp on a 32,768 processor ibm blue Gene/L supercomputer , 2008 .

[17]  Mohamed Ould-Khaoua,et al.  Prediction of communication delay in torus networks under multiple time-scale correlated traffic , 2005, Perform. Evaluation.

[18]  Sam Miller,et al.  Blue Gene/Q resource management architecture , 2010, 2010 3rd Workshop on Many-Task Computing on Grids and Supercomputers.

[19]  Shankar Pasupathy,et al.  Measurement and Analysis of Large-Scale Network File System Workloads , 2008, USENIX Annual Technical Conference.

[20]  Abhinandan Das,et al.  SWIM: scalable weakly-consistent infection-style process group membership protocol , 2002, Proceedings International Conference on Dependable Systems and Networks.

[21]  Philip Heidelberger,et al.  The IBM Blue Gene/Q interconnection network and message unit , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[22]  Antony I. T. Rowstron,et al.  Symbiotic routing in future data centers , 2010, SIGCOMM '10.

[23]  Susumu Horiguchi,et al.  High performance hierarchical torus network under matrix transpose traffic patterns , 2004, 7th International Symposium on Parallel Architectures, Algorithms and Networks, 2004. Proceedings..

[24]  Pedro López,et al.  Towards an efficient switch architecture for high-radix switches , 2006, 2006 Symposium on Architecture For Networking And Communications Systems.

[25]  Christopher D. Carothers,et al.  Large-scale TCP models using optimistic parallel simulation , 2003, Seventeenth Workshop on Parallel and Distributed Simulation, 2003. (PADS 2003). Proceedings..

[26]  Galen M. Shipman,et al.  Jaguar: The World?s Most Powerful Computer , 2009 .

[27]  Robert B. Ross,et al.  CODES: Enabling Co-Design of Multi-Layer Exascale Storage Architectures , 2011 .

[28]  Murat Yuksel,et al.  Large-scale network simulation techniques: examples of TCP and OSPF models , 2003, CCRV.

[29]  S. Horiguchi,et al.  Inter-Processor Communication Performance of a Hierarchical Torus Network under Bit-Flip Traffic Patterns , 2006, 2006 International Conference on Electrical and Computer Engineering.

[30]  David R. Jefferson,et al.  Virtual time , 1985, ICPP.