The role of scalable high-performance workflows and flexible workflow management systems that can support multiple simulations will continue to increase in importance. For example, with the end of Dennard scaling, there is a need to substitute a single long running simulation with multiple repeats of shorter simulations, or concurrent replicas. Further, many scientific problems involve ensembles of simulations in order to solve a higher-level problem or produce statistically meaningful results. However most supercomputing software development and performance enhancements have focused on optimizing single- simulation performance. On the other hand, there is a strong inconsistency in the definition and practice of workflows and workflow management systems. This inconsistency often centers around the difference between several different types of workflows, including modeling and simulation, grid, uncertainty quantification, and purely conceptual workflows. This work explores this phenomenon by examining the different types of workflows and workflow management systems, reviewing the perspective of a large supercomputing facility, examining the common features and problems of workflow management systems, and finally presenting a proposed solution based on the concept of common building blocks. The implications of the continuing proliferation of workflow management systems and the lack of interoperability between these systems are discussed from a practical perspective. In doing so, we have begun an investigation of the design and implementation of open workflow systems for supercomputers based upon common components.
[1]
Gregory R. Watson,et al.
The Eclipse Integrated Computational Environment
,
2018,
SoftwareX.
[2]
Rizos Sakellariou,et al.
A characterization of workflow management systems for extreme-scale applications
,
2016,
Future Gener. Comput. Syst..
[3]
Jano I. van Hemert,et al.
Scientific Workflows
,
2016,
ACM Comput. Surv..
[4]
Robert L. Clay,et al.
Incorporating Workflow for V&V/UQ in the Sandia Analysis Workbench.
,
2015
.
[5]
Shantenu Jha,et al.
A Building Blocks Approach towards Domain Specific Workflow Systems?
,
2016,
ArXiv.
[6]
Tadashi Maeno,et al.
PanDA: Evolution and Recent Trends in LHC Computing
,
2015
.
[7]
Rajkumar Buyya,et al.
A Taxonomy of Workflow Management Systems for Grid Computing
,
2005,
Proceedings of the 38th Annual Hawaii International Conference on System Sciences.
[8]
Ewa Deelman,et al.
Integrating existing scientific workflow systems: the Kepler/Pegasus example
,
2007,
WORKS '07.
[9]
Edward A. Lee,et al.
Scientific workflow management and the Kepler system
,
2006,
Concurr. Comput. Pract. Exp..
[10]
Wil M. P. van der Aalst,et al.
The Application of Petri Nets to Workflow Management
,
1998,
J. Circuits Syst. Comput..
[11]
Shantenu Jha,et al.
Designing Workflow Systems Using Building Blocks
,
2016,
1609.03484.
[12]
Boris Kozinsky,et al.
AiiDA: Automated Interactive Infrastructure and Database for Computational Science
,
2015,
ArXiv.
[13]
Lavanya Ramakrishnan,et al.
The future of scientific workflows
,
2018,
Int. J. High Perform. Comput. Appl..