Performance analysis of multilevel parallel applications on shared memory architectures

In this paper we describe how to apply powerful performance analysis techniques to understand the behavior of multilevel parallel applications. We use the Paraver/OMPItrace performance analysis system for our study. This system consists of two major components: The OMPItrace dynamic instrumentation mechanism, which allows the tracing of processes and threads and the Paraver graphical user interface for inspection and analyses of the generated traces. We apply the system to conduct a detailed comparative study of a benchmark code implemented in five different programming paradigms applicable for shared memory computer architectures.

[1]  James R. Taft,et al.  Achieving 60 GFLOP/s on the production CFD code OVERFLOW-MLP , 2001, Parallel Comput..

[2]  Michael A. Frumkin,et al.  Automatic Generation of OpenMP Directives and Its Application to Computational Fluid Dynamics Codes , 2000, ISHPC.

[3]  Eduard Ayguadé,et al.  NanosCompiler: supporting flexible multilevel parallelism exploitation in OpenMP , 2000 .

[4]  J. Taft PERFORMANCE OF THE OVERFLOW-MLP CFD CODE ON THE NASA AMES 512 CPU ORIGIN SYSTEM , 2000 .

[5]  Eduard Ayguadé,et al.  Automatic multilevel parallelization using OpenMP , 2003, Sci. Program..

[6]  Nacho Navarro,et al.  DITools: Application-level Support for Dynamic Extension and Flexible Composition , 2000, USENIX Annual Technical Conference, General Track.

[7]  Jaswinder Pal Singh,et al.  A Comparison of MPI, SHMEM and Cache-Coherent Shared Address Space Programming Models on a Tightly-Coupled Multiprocessors , 2001, International Journal of Parallel Programming.

[8]  Dimitri J. Mavriplis,et al.  Parallel Performance Investigations of an Unstructured Mesh Navier-Stokes Solver , 2000, Int. J. High Perform. Comput. Appl..

[9]  R.D. Loft,et al.  Terascale Spectral Element Dynamical Core for Atmospheric General Circulation Models , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[10]  Leonid Oliker,et al.  A Comparison of Three Programming Models for Adaptive Applications on the Origin2000 , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[11]  Cos S. Ierotheou,et al.  Computer Aided Parallelisation Tools (CAPTools) - Conceptual Overview and Performance on the Parallelisation of Structured Mesh Codes , 1996, Parallel Comput..