Prediction of communication delay in torus networks under multiple time-scale correlated traffic

The efficiency of a large-scale multicomputer is critically dependent on the performance of its interconnection network. Current multicomputers have widely employed the torus as their underlying network topology for efficient interprocessor communication. In order to ensure a successful exploitation of the computational power offered by multicomputers it is essential to obtain a clear understanding of the performance capabilities of their interconnection networks under various system configurations. Analytical modelling plays an important role in achieving this goal. This study proposes a concise performance model for computing communication delay in the torus network with circuit switching in the presence of multiple time-scale correlated traffic which is found in many real-world parallel computation environments and has strong impact on network performance. The tractability and reasonable accuracy of the analytical model demonstrated by extensive simulation experiments make it a practical and cost-effective evaluation tool to investigate network performance with various alternative design solutions and under different operating conditions.

[1]  Anwar Elwalid,et al.  The Importance of Long-Range Dependence of VBR Video Traffic in ATM Traffic Engineering: Myths and Realities , 1996, SIGCOMM.

[2]  Randolph D. Nelson,et al.  Probability, stochastic processes, and queueing theory - the mathematics of computer performance modeling , 1995 .

[3]  Yutaka Takahashi,et al.  Practical Time-Scale Fitting of Self-Similar Traffic with Markov-Modulated Poisson Process , 2001, Telecommun. Syst..

[4]  W. Whitt,et al.  The transient BMAP/G/l queue , 1994 .

[5]  Sudhakar Yalamanchili,et al.  Interconnection Networks: An Engineering Approach , 2002 .

[6]  Walter Willinger,et al.  Long-range dependence in variable-bit-rate video traffic , 1995, IEEE Trans. Commun..

[7]  Péter Kacsuk,et al.  Advanced Computer Architectures , 1997 .

[8]  Anant Agarwal,et al.  Limits on Interconnection Network Performance , 1991, IEEE Trans. Parallel Distributed Syst..

[9]  D. Manjunath,et al.  Input queued switches for variable length packets: analysis for Poisson and self-similar traffic , 2002, Comput. Commun..

[10]  H. Heffes,et al.  A class of data traffic processes — covariance function characterization and related queuing results , 1980, The Bell System Technical Journal.

[11]  Azer Bestavros,et al.  Self-similarity in World Wide Web traffic: evidence and possible causes , 1996, SIGMETRICS '96.

[12]  Mahmut T. Kandemir,et al.  Array regrouping and its use in compiling data-intensive, embedded applications , 2004, IEEE Transactions on Computers.

[13]  Péter Kacsuk,et al.  Advanced computer architectures - a design space approach , 1997, International computer science series.

[14]  Imrich Chlamtac,et al.  A Performance Model of a Connection-Oriented Hypercube Interconnection System , 1996, Perform. Evaluation.

[15]  Emmanouel A. Varvarigos,et al.  Circuit Switching with Input Queuing: An Analysis for the d-Dimensional Wraparound Mesh and the Hypercube , 1997, IEEE Trans. Parallel Distributed Syst..

[16]  Mohamed Ould-Khaoua,et al.  Modelling adaptive routing in circuit switched networks , 2001, J. Syst. Archit..

[17]  Mohamed Ould-Khaoua,et al.  Analysis of fully adaptive wormhole routing in tori , 1999, Parallel Comput..

[18]  William J. Dally Virtual-channel flow control , 1990, ISCA '90.

[19]  Lester Lipsky,et al.  Impact of aggregated, self-similar ON/OFF traffic on delay in stationary queueing models (extended version) , 2001, Perform. Evaluation.

[20]  Hamid Sarbazi-Azad,et al.  An accurate analytical model of adaptive wormhole routing in k-ary n-cubes interconnection networks , 2001, Perform. Evaluation.

[21]  John Riordan,et al.  Introduction to Combinatorial Analysis , 1959 .

[22]  Mohamed Ould-Khaoua,et al.  A Performance Model for Duato's Fully Adaptive Routing Algorithm in k-Ary n-Cubes , 1999, IEEE Trans. Computers.

[23]  Walter Willinger,et al.  On the self-similar nature of Ethernet traffic , 1993, SIGCOMM '93.

[24]  Andrew S. Tanenbaum,et al.  Computer Networks , 1981 .

[25]  Chita R. Das,et al.  Performance Analysis of Buffering Schemes on Wormhole Routers , 1997, IEEE Trans. Computers.

[26]  Leonard Kleinrock,et al.  Queueing Systems: Volume I-Theory , 1975 .

[27]  Juan-Carlos Cano,et al.  Self-similarity in SPLASH-2 workloads on shared memory multiprocessors systems , 2000, Proceedings 8th Euromicro Workshop on Parallel and Distributed Processing.

[28]  Nicolas D. Georganas,et al.  On self-similar traffic in ATM queues: definitions, overflow probability bound, and cell delay distribution , 1997, TNET.

[29]  Walter Willinger,et al.  Experimental queueing analysis with long-range dependent packet traffic , 1996, TNET.

[30]  Bruno Ciciani,et al.  Performance Analysis of Circuit-Switching Interconnection Networks with Deterministic and Adaptive Routing , 1998, Perform. Evaluation.

[31]  Bo Friis Nielsen,et al.  A Markovian approach for modeling packet traffic with long-range dependence , 1998, IEEE J. Sel. Areas Commun..

[32]  John Riordan,et al.  Introduction to Combinatorial Analysis , 1958 .

[33]  Leonard Kleinrock,et al.  Theory, Volume 1, Queueing Systems , 1975 .

[34]  Mark S. Squillante,et al.  Analysis of Job Arrival Patterns and Parallel Scheduling Performance , 1999, Perform. Evaluation.

[35]  Walter Willinger,et al.  Self-Similar Network Traffic and Performance Evaluation , 2000 .

[36]  Wolfgang Fischer,et al.  The Markov-Modulated Poisson Process (MMPP) Cookbook , 1993, Perform. Evaluation.

[37]  Hugh Garraway Parallel Computer Architecture: A Hardware/Software Approach , 1999, IEEE Concurrency.

[38]  Mohamed Ould-Khaoua,et al.  A performance model for wormhole-switched interconnection networks under self-similar traffic , 2004, IEEE Transactions on Computers.

[39]  Dimiter R. Avresky,et al.  Performance evaluation of the ServerNet(R) SAN under self-similar traffic , 1999, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999.