Parallel Computational Steering for HPC Applications Using HDF5 Files in Distributed Shared Memory

Interfacing a GUI driven visualization/analysis package to an HPC application enables a supercomputer to be used as an interactive instrument. We achieve this by replacing the IO layer in the HDF5 library with a custom driver which transfers data in parallel between simulation and analysis. Our implementation using ParaView as the interface, allows a flexible combination of parallel simulation, concurrent parallel analysis, and GUI client, either on the same or separate machines. Each MPI job may use different core counts or hardware configurations, allowing fine tuning of the amount of resources dedicated to each part of the workload. By making use of a distributed shared memory file, one may read data from the simulation, modify it using ParaView pipelines, write it back, to be reused by the simulation (or vice versa). This allows not only simple parameter changes, but complete remeshing of grids, or operations involving regeneration of field values over the entire domain. To avoid the problem of manually customizing the GUI for each application that is to be steered, we make use of XML templates that describe outputs from the simulation (and inputs back to it) to automatically generate GUI controls for manipulation of the simulation.

[1]  John Biddiscombe,et al.  Data Redistribution Using One-sided Transfers to In-Memory HDF5 Files , 2011, EuroMPI.

[2]  John Biddiscombe,et al.  Computational Steering and Parallel Online Monitoring Using RMA through the HDF5 DSM Virtual File Driver , 2011, ICCS.

[3]  John Biddiscombe,et al.  Parallel computational steering and analysis for HPC applications using a paraview interface and the HDF5 DSM virtual file driver , 2011, EGPGV '11.

[4]  Olivier Coulaud,et al.  Toward a Computational Steering Environment for Legacy Coupled Simulations , 2007, Sixth International Symposium on Parallel and Distributed Computing (ISPDC'07).

[5]  Michael E. Papka,et al.  Topology-aware data movement and staging for I/O acceleration on Blue Gene/P supercomputing systems , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[6]  James P. Ahrens,et al.  Remote large data visualization in the paraview framework , 2006, EGPGV '06.

[7]  Kenneth Moreland,et al.  Sandia National Laboratories , 2000 .

[8]  Renato Pajarola,et al.  Interactive SPH simulation and rendering on the GPU , 2010, SCA '10.

[9]  John Brooke,et al.  Computational steering in realitygrid , 2003 .

[10]  Fan Zhang,et al.  Enabling Multi-physics Coupled Simulations within the PGAS Programming Framework , 2011, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[11]  Scott Klasky,et al.  DataSpaces: an interaction and coordination framework for coupled simulation workflows , 2012, HPDC '10.

[12]  Nelson L. Max,et al.  A contract based system for large data visualization , 2005, VIS 05. IEEE Visualization, 2005..

[13]  Bertrand Alessandrini,et al.  Simulations of complex hydro-elastic problems using the parallel SPH model SPH-Flow , 2009 .

[14]  E.R. Mark,et al.  Enhancements to the eXtensible Data Model and Format (XDMF) , 2007, 2007 DoD High Performance Computing Modernization Program Users Group Conference.

[15]  Rajeev Thakur,et al.  Revealing the Performance of MPI RMA Implementations , 2007, PVM/MPI.

[16]  John Biddiscombe,et al.  An HDF5 MPI Virtual File Driver for Parallel In-situ Post-processing , 2010, EuroMPI.

[17]  Karsten Schwan,et al.  Adaptable, metadata rich IO methods for portable high performance IO , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[18]  Jeremy S. Meredith,et al.  Parallel in situ coupling of simulation with a fully featured visualization system , 2011, EGPGV '11.

[19]  Ivan Janciak,et al.  UK e-Science All Hands Meeting , 2009 .