Automatic Trace-Based Performance Analysis of Metacomputing Applications

The processing power and memory capacity of independent and heterogeneous parallel machines can be combined to form a single parallel system that is more powerful than any of its constituents. However, achieving satisfactory application performance on such a metacomputer is hard because the high latency of inter-machine communication as well as differences in hardware of constituent machines may introduce various types of wait states. In our earlier work, we have demonstrated that automatic pattern search in event traces can identify the sources of wait states in parallel applications running on a single computer. In this article, we describe how this approach can be extended to metacomputing environments with special emphasis on performance problems related to inter-machine communication. In addition, we demonstrate the benefits of our solution using a real-world multi-physics application.

[1]  Michael M. Resch,et al.  Performance Analysis of a Parallel Application in the GRID , 2003, International Conference on Computational Science.

[2]  Thomas Bemmerl,et al.  The New Multidevice Architecture of MetaMPICH in the Context of Other Approaches to Grid-Enabled MPI , 2006, PVM/MPI.

[3]  Péter Kacsuk,et al.  Prewsentation and Analysis of Grid Performance Data , 2003, Euro-Par.

[4]  Jesús Labarta,et al.  DiP: A Parallel Program Development Environment , 1996, Euro-Par, Vol. II.

[5]  Felix Wolf,et al.  Automatic performance analysis on parallel computers with SMP nodes , 2003 .

[6]  Thomas Fahringer,et al.  SCALEA-G: A unified monitoring and performance analysis system for the grid , 2004 .

[7]  Wolfgang E. Nagel,et al.  The unicore grid and its options for performance analysis , 2004 .

[8]  Hong Linh Truong,et al.  SCALEA-G: A Unified Monitoring and Performance Analysis System for the Grid , 2004, European Across Grids Conference.

[9]  Flaviu Cristian,et al.  Probabilistic clock synchronization , 1989, Distributed Computing.

[10]  Wolfgang Ziegler,et al.  Reliable Orchestration of Distributed MPI-Applications in a UNICORE-Based Grid with MetaMPICH and MetaScheduling , 2006, PVM/MPI.

[11]  Jack J. Dongarra,et al.  An algebra for cross-experiment performance analysis , 2004, International Conference on Parallel Processing, 2004. ICPP 2004..

[12]  Bernd Mohr,et al.  Automatic performance analysis of hybrid MPI/OpenMP applications , 2003, Eleventh Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2003. Proceedings..

[13]  Marian Bubak,et al.  Performance Tools for the Grid: State of the Art and Future , 2004 .

[14]  Matthias S. Müller,et al.  Performance Prediction in a Grid Environment , 2003, European Across Grids Conference.

[15]  Bernd Mohr,et al.  Scalable Parallel Trace-Based Performance Analysis , 2006, PVM/MPI.

[16]  Andrzej M. Goscinski,et al.  Using an Enterprise Grid for Execution of MPI Parallel Applications - A Case Study , 2006, PVM/MPI.