Multivariate statistical techniques for parallel performance prediction

Performance prediction can play an important role in improving the efficiency of multicomputers in executing scalable parallel applications. An accurate model of program execution time must include detailed algorithmic and architectural characterizations. The exact values for critical model parameters such as message latency and cache miss penalty can often be difficult to determine. This research uses multivariate data analysis to estimate the values of these coefficients in an analytical model. Representing the coefficients as random variables with a specified mean and variance improves the utility of a performance model. Confidence intervals for predicted execution time can be generated using the standard error values for model parameters. Improvements in the model can also be made by investigating the cause of large variance values for a particular architecture.<<ETX>>

[1]  M. R. Sampford,et al.  Introduction to Sampling Theory , 1962 .

[2]  Cherri M. Pancake,et al.  Where are we headed? , 1991, CACM.

[3]  Mark Crovella,et al.  The Search for Lost Cycles: A New Approach to Parallel Program Performance Evaluation , 1993 .

[4]  Trevor Hastie,et al.  Statistical Models in S , 1991 .

[5]  Cherri M. Pancake,et al.  Software Support for Parallel Computing: Where Are We headed? , 1991 .

[6]  William G. Ruesink,et al.  Introduction to Sampling Theory , 1980 .

[7]  Philip J. Hatcher,et al.  Data-Parallel Programming on MIMD Computers , 1991, IEEE Trans. Parallel Distributed Syst..

[8]  Ricardo Bianchini,et al.  Using communication-to-computation ratio in parallel program design and performance prediction , 1992, [1992] Proceedings of the Fourth IEEE Symposium on Parallel and Distributed Processing.

[9]  Henk Sol,et al.  Proceedings of the 54th Hawaii International Conference on System Sciences , 1997, HICSS 2015.

[10]  R. Ruhl,et al.  Balancing interprocessor communication and computation on torus-connected multicomputers running compiler-parallelized code , 1992, Proceedings Scalable High Performance Computing Conference SHPCC-92..

[11]  Thomas Fahringer,et al.  A static parameter based performance prediction tool for parallel programs , 1993, ICS '93.

[12]  M. J. Quinn,et al.  Analytical performance prediction on multicomputers , 1993, Supercomputing '93.

[13]  Richard J. Harris A primer of multivariate statistics , 1975 .

[14]  Barton P. Miller,et al.  Dynamic program instrumentation for scalable performance tools , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.

[15]  Bruce W. Char,et al.  Maple V Language Reference Manual , 1993, Springer US.

[16]  Rolph E. Anderson,et al.  Multivariate Data Analysis with Readings , 1979 .

[17]  Ken Kennedy,et al.  A static performance estimator to guide data partitioning decisions , 1991, PPOPP '91.

[18]  Mark J. Clement,et al.  Medium Grain Size Applications on Distributed Memory Multicomputers , 1993 .