Simulation-Based Performance Prediction for Large Parallel Machines

We present a performance prediction environment for large scale computers such as the Blue Gene machine. It consists of a parallel simulator, BigSim, for predicting performance of machines with a very large number of processors, and BigNetSim, which incorporates a pluggable module of a detailed contention-based network model. The simulators provide the ability to make performance predictions for very large machines such as Blue Gene/L. We illustrate the utility of our simulators using validation and prediction studies of several applications using smaller numbers of processors for simulations.

[1]  Chris H. Q. Ding,et al.  A ghost cell expansion method for reducing communications in solving PDE problems , 2001, SC.

[2]  Laxmikant V. Kalé,et al.  NAMD: Biomolecular Simulation on Thousands of Processors , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[3]  Laxmikant V. Kale,et al.  NAMD2: Greater Scalability for Parallel Molecular Dynamics , 1999 .

[4]  Laxmikant V. Kalé,et al.  Performance modeling and programming environments for petaflops computers and the Blue Gene machine , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[5]  Laxmikant V. Kale,et al.  POSE: getting over grainsize in parallel discrete event simulation , 2004 .

[6]  Laxmikant V. Kalé,et al.  BigSim: a parallel simulator for performance prediction of extremely large parallel machines , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[7]  Laxmikant V. Kalé,et al.  Emulating petaflops machines and blue gene , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[8]  Laxmikant V. Kalé,et al.  A parallel-object programming model for petaflops machines and blue gene/cyclops , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[9]  Brian Beckman,et al.  Time warp operating system , 1987, SOSP '87.

[10]  Laxmikant V. Kalé,et al.  Supporting dynamic parallel object arrays , 2003, Concurr. Comput. Pract. Exp..

[11]  Laxmikant V. Kalé,et al.  A Parallel Framework for Explicit FEM , 2000, HiPC.

[12]  Philip Heidelberger,et al.  IBM Research Report Design and Analysis of the BlueGene/L Torus Interconnection Network , 2003 .

[13]  Laxmikant V. Kalé,et al.  Prioritization in Parallel Symbolic Computing , 1992, Parallel Symbolic Computing.