Visual Analysis of Inter-Process Communication for Large-Scale Parallel Computing

In serial computation, program profiling is often helpful for optimization of key sections of code. When moving to parallel computation, not only does the code execution need to be considered but also communication between the different processes which can induce delays that are detrimental to performance. As the number of processes increases, so does the impact of the communication delays on performance. For large-scale parallel applications, it is critical to understand how the communication impacts performance in order to make the code more efficient. There are several tools available for visualizing program execution and communications on parallel systems. These tools generally provide either views which statistically summarize the entire program execution or process-centric views. However, process-centric visualizations do not scale well as the number of processes gets very large. In particular, the most common representation of parallel processes is a Gantt chart with a row for each process. As the number of processes increases, these charts can become difficult to work with and can even exceed screen resolution. We propose a new visualization approach that affords more scalability and then demonstrate it on systems running with up to 16,384 processes.

[1]  M. Cooper,et al.  Revealing structure within clustered parallel coordinates displays , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[2]  Jens Volkert,et al.  Debugging Massively Parallel Programs with ATEMPT , 1996, HPCN Europe.

[3]  Allen D. Malony,et al.  The Tau Parallel Performance System , 2006, Int. J. High Perform. Comput. Appl..

[4]  John T. Stasko,et al.  Visualizing Interactions in Program Executions , 1997, Proceedings of the (19th) International Conference on Software Engineering.

[5]  Daniel A. Reed,et al.  Virtue: Performance Visualization of Parallel and Distributed Applications , 1999, Computer.

[6]  William Gropp,et al.  An efficient format for nearly constant-time access to arbitrary time intervals in large trace files , 2008 .

[7]  Robert Ross,et al.  Visualization and parallel I/O at extreme scale , 2008, Journal of Physics: Conference Series.

[8]  John T. Stasko,et al.  PVaniM: a tool for visualization in network computing environments , 1998, Concurr. Pract. Exp..

[9]  Kwan-Liu Ma,et al.  A study of I/O methods for parallel visualization of large-scale data , 2005, Parallel Comput..

[10]  Kwan-Liu Ma,et al.  A Study of I/O Techniques for Parallel Visualization , 2004 .

[11]  Dieter Kranzlmüller,et al.  Debugging Point-to-Point Communication in MPI an PVM , 1998, PVM/MPI.

[12]  Daniel M. German,et al.  On the use of visualization to support awareness of human activities in software development: a survey and a framework , 2005, SoftVis '05.

[13]  William Gropp,et al.  An efficient format for nearly constant-time access to arbitrary time intervals in large trace files , 2008, Sci. Program..

[14]  Ewing Lusk,et al.  Performance analysis of MPI programs , 1994 .

[15]  Alexandru Telea,et al.  Multiscale Visualization of Dynamic Software Logs , 2007, EuroVis.

[16]  Hans Hagen,et al.  Analyzing the reliability of communication between software entities using a 3D visualization of clustered graphs , 2008, SoftVis '08.

[17]  Kwan-Liu Ma StarGate: A Unified, Interactive Visualization of Software Projects , 2008, 2008 IEEE Pacific Visualization Symposium.

[18]  Ewing Lusk,et al.  Studying parallel program behavior with upshot , 1991 .

[19]  Jan-Jan Wu An Interleaving Transformation for Parallelizing Reductions for Distributed-Memory Parallel Machines , 2004, The Journal of Supercomputing.

[20]  Michael T. Heath,et al.  ParaGraph: A Tool for Visualizing Performance of Parallel Programs , 2007 .

[21]  Kwan-Liu Ma,et al.  A Parallel Visualization Pipeline for Terascale Earthquake Simulations , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[22]  Arie van Deursen,et al.  Execution trace analysis through massive sequence and circular bundle views , 2008, J. Syst. Softw..

[23]  Lucian Voinea,et al.  EZEL: a Visual Tool for Performance Assessment of Peer-to-Peer File-Sharing Network , 2004, IEEE Symposium on Information Visualization.

[24]  William Gropp,et al.  From Trace Generation to Visualization: A Performance Framework for Distributed Parallel Systems , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[25]  Martin Wattenberg,et al.  Arc diagrams: visualizing structure in strings , 2002, IEEE Symposium on Information Visualization, 2002. INFOVIS 2002..

[26]  R. Martin,et al.  Electronic Structure: Basic Theory and Practical Methods , 2004 .

[27]  Lucian Voinea,et al.  An interactive reverse engineering environment for large-scale C++ code , 2008, SoftVis '08.