Usage of the SCALASCA toolset for scalable performance analysis of large-scale parallel applications

scalasca is a performance toolset that has been specifically designed to analyze parallel application behavior on large-scale systems, but is also well-suited for small- and medium-scale hpc platforms. scalasca offers an incremental performance-analysis process that integrates runtime summaries with in-depth studies of concurrent behavior via event tracing, adopting a strategy of successively refined measurement configurations. A distinctive feature of scalasca is its ability to identify wait states even for very large processor counts. The current version supports the mpi, Openmp and hybrid programming constructs most widely used in highly-scalable hpc applications.

[1]  Jack Dongarra,et al.  An algebra for cross-experiment performance analysis , 2004 .

[2]  Leonid Oliker,et al.  Scientific Application Performance on Candidate PetaScale Platforms , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[3]  Felix Wolf,et al.  Timestamp Synchronization for Event Traces of Large-Scale Message-Passing Applications , 2007, PVM/MPI.

[4]  Wolfgang E. Nagel,et al.  VAMPIR: Visualization and Analysis of MPI Resources , 2010 .

[5]  Bernd Mohr,et al.  Holistic Hardware Counter Performance Analysis of Parallel Programs , 2005, PARCO.

[6]  Bernd Mohr,et al.  Scalable Parallel Trace-Based Performance Analysis , 2006, PVM/MPI.

[7]  Bernd Mohr,et al.  Large Event Traces in Parallel Performance Analysis , 2006, ARCS Workshops.

[8]  Jack J. Dongarra,et al.  A Portable Programming Interface for Performance Evaluation on Modern Processors , 2000, Int. J. High Perform. Comput. Appl..

[9]  Jesús Labarta,et al.  DiP: A Parallel Program Development Environment , 1996, Euro-Par, Vol. II.

[10]  D. E. Post,et al.  HPC needs a tool strategy , 2005, SE-HPCS '05.

[11]  Bernd Mohr,et al.  Automatic performance analysis of hybrid MPI/OpenMP applications , 2003, Eleventh Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2003. Proceedings..