Workflow Performance Profiles: Development and Analysis

This paper presents a method for performance profiles development of scientific workflow. It addresses issues related to: workflows execution in a parameter sweep manner, collecting performance information about each workflow task, and analysis of the collected data with statistical learning methods. The main goal of this work is to increase the understanding about the performance of studied workflows in a systematic and predictable way. The evaluation of the presented approach is based on a real scientific workflow developed by the Spallation Neutron Source - a DOE research facility at the Oak Ridge National Laboratory. The workflow executes an ensemble of molecular dynamics and neutron scattering intensity calculations to optimize a model parameter value.

[1]  Saltelli Andrea,et al.  Global Sensitivity Analysis: The Primer , 2008 .

[2]  Rizos Sakellariou,et al.  A Performance Model to Estimate Execution Time of Scientific Workflows on the Cloud , 2014, 2014 9th Workshop on Workflows in Support of Large-Scale Science.

[3]  Michal Wrzeszcz,et al.  A Novel Way of Using Simulations to Support Urban Security Operations , 2015, Comput. Informatics.

[4]  Andrew A. Chien,et al.  Automatic resource specification generation for resource selection , 2007, Proceedings of the 2007 ACM/IEEE Conference on Supercomputing (SC '07).

[5]  Miron Livny,et al.  Online Task Resource Consumption Prediction for Scientific Workflows , 2015, Parallel Process. Lett..

[6]  Jeremy C. Smith,et al.  Sassena - X-ray and neutron scattering calculated from molecular dynamics trajectories using massively parallel computers , 2012, Comput. Phys. Commun..

[7]  Douglas Thain,et al.  Practical Resource Monitoring for Robust High Throughput Computing , 2015, 2015 IEEE International Conference on Cluster Computing.

[8]  Laxmikant V. Kalé,et al.  Scalable molecular dynamics with NAMD on the IBM Blue Gene/L system , 2008, IBM J. Res. Dev..

[9]  P. F. Peterson,et al.  Mantid - Data Analysis and Visualization Package for Neutron Scattering and $μ SR$ Experiments , 2014, 1407.5860.

[10]  Jin-Soo Kim,et al.  Estimating Resource Needs for Time-Constrained Workflows , 2008, 2008 IEEE Fourth International Conference on eScience.

[11]  Radu Prodan,et al.  A Hybrid Intelligent Method for Performance Modeling and Prediction of Workflow Activities in Grids , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[12]  Miron Livny,et al.  Pegasus, a workflow management system for science automation , 2015, Future Gener. Comput. Syst..

[13]  Rafael Ferreira da Silva,et al.  Climate Science Performance, Data and Productivity on Titan , 2015 .

[14]  Jacek Kitowski,et al.  Self-scalable services in service oriented software for cost-effective data farming , 2016, Future Gener. Comput. Syst..

[15]  Dennis Gannon,et al.  Workflows for e-Science, Scientific Workflows for Grids , 2014 .

[16]  Vickie E Lynch,et al.  Molecular Dynamics Force-Field Refinement against Quasi-Elastic Neutron Scattering Data. , 2016, Journal of chemical theory and computation.

[17]  Ann L. Chervenak,et al.  Characterizing and profiling scientific workflows , 2013, Future Gener. Comput. Syst..

[18]  Ross C. Walker,et al.  An overview of the Amber biomolecular simulation package , 2013 .

[19]  J. D. Crow,et al.  The application of GaAlAs lasers to high-resolution liquid-crystal projection displays , 1982 .