Visualizing Large‐scale Parallel Communication Traces Using a Particle Animation Technique

Large‐scale scientific simulations require execution on parallel computing systems in order to yield useful results in a reasonable time frame. But parallel execution adds communication overhead. The impact that this overhead has on performance may be difficult to gauge, as parallel application behaviors are typically harder to understand than the sequential types. We introduce an animation‐based interactive visualization technique for the analysis of communication patterns occurring in parallel application execution. Our method has the advantages of illustrating the dynamic communication patterns in the system as well as a static image of MPI (Message Passing Interface) utilization history. We also devise a data streaming mechanism that allows for the exploration of very large data sets. We demonstrate the effectiveness of our approach scaling up to 16 thousand processes using a series of trace data sets of ScaLAPACK matrix operations functions.

[1]  Eileen Kraemer,et al.  Toward flexible control of the temporal mapping from concurrent program events to animations , 1994, Proceedings of 8th International Parallel Processing Symposium.

[2]  John T. Stasko,et al.  Effectiveness of Animation in Trend Visualization , 2008, IEEE Transactions on Visualization and Computer Graphics.

[3]  Barbara Tversky,et al.  Animation: can it facilitate? , 2002, Int. J. Hum. Comput. Stud..

[4]  Bernd Mohr,et al.  KOJAK - A Tool Set for Automatic Performance Analysis of Parallel Programs , 2003, Euro-Par.

[5]  Allen D. Malony,et al.  An Approach to Creating Performance Visualizations in a Parallel Profile Analysis Tool , 2011, Euro-Par Workshops.

[6]  Michael T. Heath,et al.  ParaGraph: A Tool for Visualizing Performance of Parallel Programs , 2007 .

[7]  Daniel Kressner,et al.  A Novel Parallel QR Algorithm for Hybrid Distributed Memory HPC Systems , 2010, SIAM J. Sci. Comput..

[8]  John T. Stasko,et al.  Rethinking the evaluation of algorithm animations as learning aids: an observational study , 2001, Int. J. Hum. Comput. Stud..

[9]  Kwan-Liu Ma,et al.  code_swarm: A Design Study in Organic Software Visualization , 2009, IEEE Transactions on Visualization and Computer Graphics.

[10]  Daniel A. Reed,et al.  Virtue: Performance Visualization of Parallel and Distributed Applications , 1999, Computer.

[11]  Ewing Lusk,et al.  Studying parallel program behavior with upshot , 1991 .

[12]  Kwan-Liu Ma,et al.  Visual Analysis of Inter-Process Communication for Large-Scale Parallel Computing , 2009, IEEE Transactions on Visualization and Computer Graphics.

[13]  Martin Schulz,et al.  Visualizing Network Traffic to Understand the Performance of Massively Parallel Simulations , 2012, IEEE Transactions on Visualization and Computer Graphics.

[14]  Jaeyoung Choi,et al.  Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines , 1994, Sci. Program..

[15]  Lucas Mello Schnorr,et al.  3D approach to the visualization of parallel applications and Grid monitoring information , 2008, 2008 9th IEEE/ACM International Conference on Grid Computing.

[16]  Michael T. Heath,et al.  Visualizing the performance of parallel programs , 1991, IEEE Software.

[17]  M. Cooper,et al.  Revealing structure within clustered parallel coordinates displays , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[18]  Kwan-Liu Ma,et al.  Visual analysis of I/O system behavior for high-end computing , 2011, LSAP '11.

[19]  Elena Breitmoser,et al.  A performance study of the PLAPACK and ScaLAPACK Eigensolvers on HPCx for the standard problem , 2003 .

[20]  Bernd Mohr,et al.  Automatic Performance Analysis of MPI Applications Based on Event Traces , 2000, Euro-Par.

[21]  Ewing Lusk,et al.  Performance analysis of MPI programs , 1994 .

[22]  Paul Green-Armytage,et al.  A Colour Alphabet and the Limits of Colour Coding , 2010 .

[23]  Allen D. Malony,et al.  The Tau Parallel Performance System , 2006, Int. J. High Perform. Comput. Appl..

[24]  Bernd Mohr,et al.  Automatic performance analysis of hybrid MPI/OpenMP applications , 2003, Eleventh Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2003. Proceedings..

[25]  Bernd Mohr,et al.  The Scalasca performance toolset architecture , 2010, Concurr. Comput. Pract. Exp..

[26]  William Gropp,et al.  Toward Scalable Performance Visualization with Jumpshot , 1999, Int. J. High Perform. Comput. Appl..

[27]  Vaidy S. Sunderam,et al.  The Dual Timestamping Methodology for Visualizing Distributed Applications , 1995 .

[28]  Wolfgang E. Nagel,et al.  VAMPIR: Visualization and Analysis of MPI Resources , 2010 .

[29]  William Gropp,et al.  An efficient format for nearly constant-time access to arbitrary time intervals in large trace files , 2008, Sci. Program..

[30]  Eileen Kraemer,et al.  A Methodology for Building Application-Specific Visualizations of Parallel Programs , 1993, J. Parallel Distributed Comput..

[31]  Paul Rosen,et al.  Abstract visualization of runtime memory behavior , 2011, 2011 6th International Workshop on Visualizing Software for Understanding and Analysis (VISSOFT).