PaDaWAn: a Python infrastructure for loosely coupled in situ workflows

This paper presents PaDaWAn, an infrastructure written in Python to provide loosely coupled in situ capabilities to accelerate filebased simulation workflows. It provides services for in-memory data exchange between applications and a simple configuration model to switch from a file-based workflow to a loosely coupled in situ workflow. The infrastructure is currently based on CEA-DAM Hercule parallel I/O library by providing an ABI-compatible library to intercept simulation data in a transparent way and to facilitate integration into existing simulation codes and tools. PaDaWAn implements a producer-consumer pattern with buffering of data in an in-memorystaging service with automatic memory management and running on dedicated resources. We describe the key design decisions and main architectural features, and share the lessons learned from the development of the infrastructure and from setting up test runs on two production-like workflow cases. We conclude on the perspectives for our infrastructure.

[1]  Utkarsh Ayachit,et al.  ParaView Catalyst: Enabling In Situ Data Analysis and Visualization , 2015, ISAV@SC.

[2]  Scott Klasky,et al.  DataSpaces: an interaction and coordination framework for coupled simulation workflows , 2012, HPDC '10.

[3]  Bruno Raffin,et al.  Melissa: Large Scale In Transit Sensitivity Analysis Avoiding Intermediate Files , 2017, SC17: International Conference for High Performance Computing, Networking, Storage and Analysis.

[4]  Michael E. Papka,et al.  Topology-aware data movement and staging for I/O acceleration on Blue Gene/P supercomputing systems , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[5]  Arie Shoshani,et al.  Hello ADIOS: the challenges and lessons of developing leadership class I/O frameworks , 2014, Concurr. Comput. Pract. Exp..

[6]  Franck Cappello,et al.  Damaris: How to Efficiently Leverage Multicore Parallelism to Achieve Scalable, Jitter-free I/O , 2012, 2012 IEEE International Conference on Cluster Computing.

[7]  Scott Klasky,et al.  Loosely Coupled In Situ Visualization: A Perspective on Why It's Here to Stay , 2015, ISAV@SC.

[8]  Jeremy S. Meredith,et al.  Parallel in situ coupling of simulation with a fully featured visualization system , 2011, EGPGV '11.

[9]  Scott Klasky,et al.  Enabling high-speed asynchronous data extraction and transfer using DART , 2010 .

[10]  Jayson Luc Peterson,et al.  A HYDRA UQ Workflow for NIF Ignition Experiments , 2016, 2016 Second Workshop on In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization (ISAV).

[11]  H. Jourdren HERA: A Hydrodynamic AMR Platform for Multi-Physics Simulations , 2005 .

[12]  Scott Klasky,et al.  Enabling high-speed asynchronous data extraction and transfer using DART , 2010, Concurr. Comput. Pract. Exp..

[13]  Matthieu Dreher,et al.  Decaf: Decoupled Dataflows for In Situ High-Performance Workflows , 2017 .

[14]  James P. Ahrens,et al.  The ALPINE In Situ Infrastructure: Ascending from the Ashes of Strawman , 2017, ISAV@SC.