Analytical performance prediction on multicomputers

Multicomputers have the potential to deliver Gigaflop performance on many scientific applications. Initial implementations of parallel programs on these machines, however, are often inefficient and require significant optimization before they can harness the potential power of the machine. Performance prediction tools can provide valuable information on which optimizations will result in increased performance. This paper describes an analytical performance prediction model. The model is designed to provide performance data to compilers, programmers and system architects to assist them in making choices which will lead to more efficient implementations. Efficient performance prediction tools can provide information which will help programmers make better use of the power of multicomputers.

[1]  Seth Copen Goldstein,et al.  Active messages: a mechanism for integrating communication and computation , 1998, ISCA '98.

[2]  Xiaodong Zhang Performance Measurement and Modeling to Evaluate Various Effects on a Shared Memory Multiprocessor , 1991, IEEE Trans. Software Eng..

[3]  John L. Hennessy,et al.  Mtool: An Integrated System for Performance Debugging Shared Memory Multiprocessor Applications , 1993, IEEE Trans. Parallel Distributed Syst..

[4]  Robert E. Benner,et al.  Development of Parallel Methods for a $1024$-Processor Hypercube , 1988 .

[5]  Steven A. Moyer,et al.  Performance of the IPSC/860 Node Architecture , 1991 .

[6]  Roger W. Hockney,et al.  Comparison of communications on the Intel iPSC/860 and Touchstone Delta (Short communication) , 1992, Parallel Comput..

[7]  D. Gareth Evans,et al.  Modelling distributed and parallel computer systems , 1992, Parallel Comput..

[8]  Milos D. Ercegovac,et al.  A methodology for performance analysis of parallel computations with looping constructs , 1992 .

[9]  Dieter Müller-Wichards Problem size scaling in the presence of parallel overhead , 1991, Parallel Comput..

[10]  Ken Kennedy,et al.  Performance of parallel processors , 1989, Parallel Comput..

[11]  Ken Kennedy,et al.  A static performance estimator to guide data partitioning decisions , 1991, PPOPP '91.

[12]  D. McCallum,et al.  A graphical user interface for data-parallel programming , 1993, [1993] Proceedings of the Twenty-sixth Hawaii International Conference on System Sciences.

[13]  Horace P. Flatt Further results using the overhead model for parallel systems , 1991, IBM J. Res. Dev..

[14]  Philip J. Hatcher,et al.  Data-Parallel Programming on MIMD Computers , 1991, IEEE Trans. Parallel Distributed Syst..

[15]  Thomas Fahringer,et al.  A static parameter based performance prediction tool for parallel programs , 1993, ICS '93.

[16]  James R. Kenevan,et al.  Theoretical Performance-Based Cost-Effectiveness of Multicomputers , 1992, Comput. J..

[17]  R. Ruhl,et al.  Balancing interprocessor communication and computation on torus-connected multicomputers running compiler-parallelized code , 1992, Proceedings Scalable High Performance Computing Conference SHPCC-92..