A tool for performance modeling of parallel programs

Current performance prediction analytical models try to characterize the performance behavior of actual machines through a small set of parameters. In practice, substantial deviations are observed. These differences are due to factors as memory hierarchies or network latency. A natural approach is to associate a different proportionality constant with each basic block, and analogously, to associate different latencies and bandwidths with each "communication block". Unfortunately, to use this approach implies that the evaluation of parameters must be done for each algorithm. This is a heavy task, implying experiment design, timing, statistics, pattern recognition and multi-parameter fitting algorithms. Software support is required. We present a compiler that takes as source a C program annotated with complexity formulas and produces as output an instrumented code. The trace files obtained from the execution of the resulting code are analyzed with an interactive interpreter, giving us, among other information, the values of those parameters.

[1]  Leslie G. Valiant,et al.  A bridging model for parallel computation , 1990, CACM.

[2]  Michael T. Heath,et al.  Visualizing the performance of parallel programs , 1991, IEEE Software.

[3]  Tomàs Margalef,et al.  Automatic performance evaluation of parallel programs , 1998, Proceedings of the Sixth Euromicro Workshop on Parallel and Distributed Processing - PDP '98 -.

[4]  Andrea Zavanella,et al.  Predictability of bulk synchronous programs using MPI , 2000, Proceedings 8th Euromicro Workshop on Parallel and Distributed Processing.

[5]  Olaf Bonorden,et al.  The Paderborn university BSP (PUB) library-design, implementation and performance , 1999, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999.

[6]  Jack J. Dongarra,et al.  A Portable Programming Interface for Performance Evaluation on Modern Processors , 2000, Int. J. High Perform. Comput. Appl..

[7]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[8]  Ben H. H. Juurlink,et al.  The E-BSP Model: Incorporating General Locality and Unbalanced Communication into the BSP Model , 1996, Euro-Par, Vol. II.

[9]  J. Tukey,et al.  An algorithm for the machine calculation of complex Fourier series , 1965 .

[10]  Jesús Labarta,et al.  DiP: A Parallel Program Development Environment , 1996, Euro-Par, Vol. II.

[11]  Torsten Suel,et al.  BSPlib: The BSP programming library , 1998, Parallel Comput..

[12]  José Luis Roda García,et al.  h-Relation Models for Current Standard Parallel Platforms , 1998, Euro-Par.

[13]  Thomas Fahringer,et al.  A static parameter based performance prediction tool for parallel programs , 1993, ICS '93.

[14]  Interner Bericht VAMPIR: Visualization and Analysis of MPI Resources , 1996 .