Automatic analysis of speedup of MPI applications

The intricacy of high performance computing applications has been growing very fast in the last years. Only skilled analysts are able to determine the factors that are undermining the performance of up-to-date applications. Analyst time is a very expensive resource and, for that reason, a strong effort to develop automatic performance analysis methodologies has been made by the scientific community. In this paper, we propose a methodology that is able to automatically detect the main performance problems of applications. This methodology is based on, first, a size reduction of the performance data obtained from the executions and, second, an analytical model obtained from this performance data which fits the speedup of the applications in terms of several parameters related to several performance issues. The paper also shows results obtained from real up-to-date applications and validates the conclusions automatically derived from the methodology.

[1]  Barton P. Miller,et al.  On-line automated performance diagnosis on thousands of processes , 2006, PPoPP '06.

[2]  Bernd Mohr,et al.  Scalable Parallel Trace-Based Performance Analysis , 2006, PVM/MPI.

[3]  Timothy Sherwood,et al.  Wavelet-based phase classification , 2006, 2006 International Conference on Parallel Architectures and Compilation Techniques (PACT).

[4]  Chen Ding,et al.  Locality phase prediction , 2004, ASPLOS XI.

[5]  Jesús Labarta,et al.  Automatic Phase Detection of MPI Applications , 2007, PARCO.

[6]  Dieter Kranzlmüller,et al.  DeWiz - A Modular tool Architecture for Parallel Program Analysis , 2003, Euro-Par.

[7]  Jesús Labarta,et al.  Automatic Structure Extraction from MPI Applications Tracefiles , 2007, Euro-Par.

[8]  Michael Gerndt,et al.  Distributed Application Monitoring for Clustered SMP Architectures , 2003, Euro-Par.

[9]  Wolfgang E. Nagel,et al.  High performance event trace visualization , 2005, 13th Euromicro Conference on Parallel, Distributed and Network-Based Processing.

[10]  Jesús Labarta,et al.  Validation of Dimemas Communication Model for MPI Collective Operations , 2000, PVM/MPI.

[11]  Wolfgang E. Nagel,et al.  VAMPIR: Visualization and Analysis of MPI Resources , 2010 .

[12]  Bernd Mohr,et al.  Automatic Trace-Based Performance Analysis of Metacomputing Applications , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[13]  Jesús Labarta,et al.  DiP: A Parallel Program Development Environment , 1996, Euro-Par, Vol. II.

[14]  Dieter Kranzlmüller,et al.  Tools for Scalable Parallel Program Analysis - Vampir VNG and DeWiz , 2004, DAPSYS.

[15]  Brad Calder,et al.  Discovering and Exploiting Program Phases , 2003, IEEE Micro.