Event-Based Measurement and Analysis of One-Sided Communication

To analyze the correctness and the performance of a program, information about the dynamic behavior of all participating processes is needed. The dynamic behavior can be modeled as a stream of events required for a later analysis including appropriate attributes. Based on this idea, kojak, a trace-based toolkit for performance analysis, records and analyzes the activities of mpi-1 point-to-point and collective communication. To support remote-memory access (rma) hardware in a portable way, mpi-2 introduced a standardized interface for remote memory access. However, potential performance gains come at the expense of more complex semantics. From a programmer's point of view, an mpi-2 data transfer is only completed after a sequence of communication and associated synchronization calls. This paper describes the integration of performance measurement and analysis methods for rma communication into the kojak toolkit. Special emphasis is put on the underlying event model used to represent the dynamic behavior of mpi-2 rma operations. We show that our model reflects the relationships between communication and synchronization more accurately than existing models. In addition, the model is general enough to also cover alternate but simpler rma interfaces, such as shmem and Co-Array Fortran.

[1]  Bernd Mohr,et al.  A Performance Measurement Infrastructure for Co-array Fortran , 2005, Euro-Par.

[2]  Felix Wolf,et al.  Automatic performance analysis on parallel computers with SMP nodes , 2003 .

[3]  Marco Danelutto,et al.  Euro-Par 2004 Parallel Processing , 2004, Lecture Notes in Computer Science.

[4]  William Gropp,et al.  Mpi - The Complete Reference: Volume 2, the Mpi Extensions , 1998 .

[5]  Bernd Mohr,et al.  Efficient Pattern Search in Large Traces Through Successive Refinement , 2004, Euro-Par.

[6]  Robert W. Numrich,et al.  Co-array Fortran for parallel programming , 1998, FORF.

[7]  Jack Dongarra,et al.  MPI: The Complete Reference , 1996 .

[8]  Karen L. Karavanic,et al.  Performance Tool Support for MPI-2 on Linux , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[9]  Jack Dongarra,et al.  MPI - The Complete Reference: Volume 1, The MPI Core , 1998 .

[10]  Arthur A. Mirin,et al.  A Scalable Implementation of a Finite-Volume Dynamical Core in the Community Atmosphere Model , 2005, Int. J. High Perform. Comput. Appl..

[11]  Marc Snir,et al.  The MPI core , 1998 .

[12]  Bernd Mohr,et al.  Automatic performance analysis of hybrid MPI/OpenMP applications , 2003, Eleventh Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2003. Proceedings..

[13]  Stephen Gilmore,et al.  Flexible Skeletal Programming with eSkel , 2005, Euro-Par.

[14]  Allen D. Malony,et al.  Portable profiling and tracing for parallel, scientific applications using C++ , 1998, SPDT '98.