Automated Performance Prediction of Message-Passing Parallel Programs

The increasing use of massively parallel supercomputers to solve large-scale scientific problems has generated a need for tools that can predict scalability trends of applications written for these machines. Much work has been done to create simple models that represent important characteristics of parallel programs, such as latency, network contention, and communication volume. But many of these methods still require substantial manual effort to represent an application in the model's format. The MK toolkit described in this paper is the result of an on-going effort to automate the formation of analytic expressions of program execution time, with a minimum of programmer assistance. In this paper we demonstrate the feasibility of our approach, by extending previous work to detect and model communication patterns automatically, with and without overlapped computations. The predictions derived from these models agree, within reasonable limits, with execution times of programs measured on the Intel iPSC/860 and Paragon. Further, we demonstrate the use of MK in selecting optimal computational grain size and studying various scalability metrics.

[1]  Michael T. Heath,et al.  Recent developments and case studies in performance visualization using ParaGraph , 1993 .

[2]  Pankaj Mehra,et al.  Automated modeling of message-passing programs , 1994, Proceedings of International Workshop on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[3]  Pankaj Mehra,et al.  Automated scalability analysis of message-passing parallel programs , 1995, IEEE Parallel Distributed Technol. Syst. Appl..

[4]  Jerry C. Yan Performance Tuning with AIMS - An Automated Instrumentation and Monitoring System for Multicomputers , 1994, HICSS.

[5]  Thomas J. LeBlanc,et al.  Parallel performance prediction using lost cycles analysis , 1994, Proceedings of Supercomputing '94.

[6]  Peter C. Jurs,et al.  Mathematica , 2019, J. Chem. Inf. Comput. Sci..

[7]  M. J. Quinn,et al.  Analytical performance prediction on multicomputers , 1993, Supercomputing '93.

[8]  Jenq Kuen Lee,et al.  Sigma II: A Tool Kit for Building Parallelizing Compilers and Performance Analysis Systems , 1992, Programming Environments for Parallel Computing.

[9]  Mark Crovella,et al.  Parallel performance using lost cycles analysis , 1994, SC.

[10]  Pankaj Mehra,et al.  Analysis and Optimization of Software Pipeline Performance on MIMD Parallel Computers , 1996, J. Parallel Distributed Comput..

[11]  Sekhar R. Sarukkai,et al.  Scalability analysis tools for SPMD message-passing parallel programs , 1994, Proceedings of International Workshop on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[12]  J. C. Yan,et al.  Performance tuning with AIMS/spl minus/an Automated Instrumentation and Monitoring System for multicomputers , 1994, 1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences.