Design and implementation of a parallel performance data management framework

Empirical performance evaluation of parallel systems and applications can generate significant amounts of performance data and analysis results from multiple experiments as performance is investigated and problems diagnosed. Hence, the management of performance information is a core component of performance analysis tools. To better support tool integration, portability; and reuse, there is a strong motivation to develop performance data management technology that can provide a common foundation for performance data storage, access, merging, and analysis. This paper presents the design and implementation of the performance data management framework (PerfDMF). PerfDMF addresses objectives of performance tool integration, interoperation, and reuse by providing common data storage, access, and analysis infrastructure for parallel performance profiles. PerfDMF includes an extensible parallel profile data schema and relational database schema, a profile query and analysis programming interface, and an extendible toolkit for profile import/export and standard analysis. We describe the PerfDMF objectives and architecture, give detailed explanation of the major components, and show examples of PerfDMF application.

[1]  Susan L. Graham,et al.  Gprof: A call graph execution profiler , 1982, SIGPLAN '82.

[2]  Dennis Gannon,et al.  SIEVE: A Performance Debugging Environment for Parallel Programs , 1993, J. Parallel Distributed Comput..

[3]  Brian H. Larose The development and implementation of a performance database server , 1993, CS / Technical report / Knoxville / University of Tennessee / Computer Science Department.

[4]  Anthony J. G. Hey,et al.  The Graphical Benchmark Information Service , 1995, Sci. Program..

[5]  Barton P. Miller,et al.  The Paradyn Parallel Performance Measurement Tool , 1995, Computer.

[6]  Ying Zhang,et al.  SvPablo: A Multi-language Performance Analysis System , 1998, Computer Performance Evaluation.

[7]  Jack J. Dongarra,et al.  A Portable Programming Interface for Performance Evaluation on Modern Processors , 2000, Int. J. High Perform. Comput. Appl..

[8]  Barton P. Miller,et al.  A Framework for Multi-Execution Performance Tuning , 2001, Scalable Comput. Pract. Exp..

[9]  Luiz De Rose The Hardware Performance Monitor Toolkit , 2001, Euro-Par.

[10]  Bernd Mohr,et al.  Automatic Performance Analysis of SMP Cluster Applications , 2001 .

[11]  Jeffrey S. Vetter,et al.  Scalable Analysis Techniques for Microprocessor Performance Counter Metrics , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[12]  Franz Franchetti,et al.  On using ZENTURIO for performance and parameter studies on cluster and Grid architectures , 2003, Eleventh Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2003. Proceedings..

[13]  Xingfu Wu,et al.  Prophesy: an infrastructure for performance analysis and modeling of parallel and grid applications , 2003, PERV.

[14]  Robert J. Fowler,et al.  HPCToolkit : Multi-platform Tools for Profile-based Performance Analysis , 2003 .

[15]  Allen D. Malony,et al.  ParaProf: A Portable, Extensible, and Scalable Tool for Parallel Performance Profile Analysis , 2003, Euro-Par.

[16]  Barton P. Miller,et al.  On-line monitoring systems and computer tool interoperability , 2004 .

[17]  Chong-Sun Hwang,et al.  Efficient Garbage Collection Schemes for Causal Message Logging with Independent Checkpointing , 2004, The Journal of Supercomputing.

[18]  Robert J. Fowler,et al.  HPCVIEW: A Tool for Top-down Analysis of Node Performance , 2002, The Journal of Supercomputing.

[19]  J. Dongarra,et al.  An algebra for cross-experiment performance analysis , 2004, International Conference on Parallel Processing, 2004. ICPP 2004..

[20]  Karen L. Karavanic,et al.  PPerfGrid: A Grid Services-Based Tool for the Exchange of Heterogeneous Parallel Performance Data , 2004 .