Staged deployment of interactive multi-application HPC workflows

Running scientific workflows on a supercomputer can be a daunting task for a scientific domain specialist. Workflow management solutions (WMS) are a standard method for reducing the complexity of application deployment on high performance computing (HPC) infrastructure. We introduce the design for a middleware system that extends and combines the functionality from existing solutions in order to create a high-level, staged usercentric operation/deployment model. This design addresses the requirements of several use cases in the life sciences, with a focus on neuroscience. In this manuscript we focus on two use cases: 1) three coupled neuronal simulators (for three different space/time scales) with in-transit visualization and 2) a closed-loop workflow optimized by machine learning, coupling a robot with a neural network simulation. We provide a detailed overview of the application-integrated monitoring in relationship with the HPC job. We present here a novel usage model for large scale interactive multi-application workflows running on HPC systems which aims at reducing the complexity of deployment and execution, thus enabling new science.

[1]  Christopher D. Carothers,et al.  Toward an End-to-End Framework for Modeling, Monitoring and Anomaly Detection for Scientific Workflows , 2016, 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[2]  Wolfram Schenck,et al.  The NEST Dry-Run Mode: Efficient Dynamic Analysis of Neuronal Network Simulation Code , 2017, Front. Neuroinform..

[3]  Dietmar W. Erwin,et al.  UNICORE—a Grid computing environment , 2002, Concurr. Comput. Pract. Exp..

[4]  Julian M. Kunkel,et al.  Real-Time I/O-Monitoring of HPC Applications with SIOX, Elasticsearch, Grafana and FUSE , 2017, ISC Workshops.

[5]  Viktor K. Jirsa,et al.  The Virtual Brain: a simulator of primate brain network dynamics , 2013, Front. Neuroinform..

[6]  Radu Prodan,et al.  Fault Detection, Prevention and Recovery in Current Grid Workflow Systems , 2008, CoreGRID Workshop on Grid Middleware.

[7]  Steve B. Furber,et al.  The SpiNNaker Project , 2014, Proceedings of the IEEE.

[8]  Rizos Sakellariou,et al.  A characterization of workflow management systems for extreme-scale applications , 2016, Future Gener. Comput. Syst..

[9]  Philip K. Maini,et al.  Multiscale Modeling in Biology New insights into cancer illustrate how mathematical tools are enhancing the understanding of life from the smallest scale to the grandest , 2022 .

[10]  Mikael Djurfeldt,et al.  Closed Loop Interactions between Spiking Neural Network and Robotic Simulators Based on MUSIC and ROS , 2016, Front. Neuroinform..

[11]  Grant Potter Grafana: open platform for analytics and monitoring , 2018 .

[12]  Alexander Peyser,et al.  Streaming Live Neuronal Simulation Data into Visualization and Analysis , 2018, ISC Workshops.

[13]  Johannes Schemmel,et al.  A wafer-scale neuromorphic hardware system for large-scale neural modeling , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[14]  Sonja Grün,et al.  Elephant – Open-Source Tool for the Analysis of Electrophysiological Data Sets , 2015 .

[15]  Dieter Schmalstieg,et al.  RFA Guardian: Comprehensive Simulation of Radiofrequency Ablation Treatment of Liver Tumors , 2018, Scientific Reports.

[16]  Shantenu Jha,et al.  Synapse: Synthetic Application Profiler and Emulator , 2015, 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[17]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[18]  Michael Wilde,et al.  Kickstarting remote applications , 2006 .

[19]  Takumi Washio,et al.  Multi-scale, tailor-made heart simulation can predict the effect of cardiac resynchronization therapy. , 2017, Journal of molecular and cellular cardiology.

[20]  Gaute T. Einevoll,et al.  LFPy: a tool for biophysical simulation of extracellular potentials generated by detailed model neurons , 2014, Front. Neuroinform..

[21]  Shantenu Jha,et al.  A Comprehensive Perspective on Pilot-Job Systems , 2015, ACM Comput. Surv..

[22]  Alois Knoll,et al.  The Human Brain Project: Creating a European Research Infrastructure to Decode the Human Brain , 2016, Neuron.

[23]  Jie Tan,et al.  Big Data Bioinformatics , 2014, Journal of cellular physiology.

[24]  Karlheinz Meier,et al.  Introducing the Human Brain Project , 2011, FET.

[25]  Christopher D. Carothers,et al.  PANORAMA: An approach to performance modeling and diagnosis of extreme-scale workflows , 2017, Int. J. High Perform. Comput. Appl..

[26]  Daniel S. Katz,et al.  Pegasus: A framework for mapping complex scientific workflows onto distributed systems , 2005, Sci. Program..

[27]  Alexander Peyser,et al.  Arbor — A Morphologically-Detailed Neural Network Simulation Library for Contemporary High-Performance Computing Architectures , 2019, 2019 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP).

[28]  David Gavaghan,et al.  Multi-scale computational modelling in biology and physiology , 2007, Progress in Biophysics and Molecular Biology.

[29]  Alban Gaignard,et al.  Scientific workflows for computational reproducibility in the life sciences: Status, challenges and opportunities , 2017, Future Gener. Comput. Syst..

[30]  Tsuyoshi Murata,et al.  {m , 1934, ACML.