Scientific Workflow Interchanging through Patterns: Reversals and Lessons Learned

Scientific workflows are used for dealing with complex problems in different e-science domains. These workflows are modeled and executed using Scientific Workflow Management Systems (SWfMSs). Generally, SWfMSs provide their own Workflow Specification Language (WfSL), and this is a challenge considering the possibility of interchanging workflow specifications between different SWfMSs. Nevertheless, the reuse of workflows gains growing importance as it helps with fostering the collaboration and cross-fertilization across different research groups. This paper presents a research proposal, including its mishaps and assimilations, on the use of workflow patterns combined with software architecture concepts to capture the key semantics expressed in scientific workflows specified in different WfSLs and to allow the interchanging of these specifications between different SWfMSs. This paper also shows how our findings based on real world specifications led us to reformulate our initial proposal and discuss the new results.

[1]  Carole A. Goble,et al.  Common motifs in scientific workflows: An empirical analysis , 2012, 2012 IEEE 8th International Conference on E-Science.

[2]  Moustafa Ghanem,et al.  Tavaxy: Integrating Taverna and Galaxy workflows with cloud computing support , 2012, BMC Bioinformatics.

[3]  Markus Völter,et al.  Model-Driven Software Development: Technology, Engineering, Management , 2006 .

[4]  Fernanda Campos,et al.  SciProv: An Architecture for Semantic Query in Provenance Metadata on e-Science Context , 2011, ITBAM.

[5]  Yolanda Gil,et al.  PROV Model Primer , 2012 .

[6]  David Garlan,et al.  Acme: an architecture description interchange language , 2010, CASCON.

[7]  Cláudio T. Silva,et al.  VisTrails: visualization meets data management , 2006, SIGMOD Conference.

[8]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[9]  Anne H. H. Ngu,et al.  Towards scientific workflow patterns , 2009, WORKS '09.

[10]  W. Freudling,et al.  Automated data reduction workflows for astronomy , 2013, 1311.5411.

[11]  Gábor Terstyánszky,et al.  The demand for consistent web-based workflow editors , 2013, WORKS@SC.

[12]  Johan Montagnat,et al.  Computer-Assisted Scientific Workflow Design , 2013, Journal of Grid Computing.

[13]  Johan Montagnat,et al.  Fine-Grain Interoperability of Scientific Workflows in Distributed Computing Infrastructures , 2013, Journal of Grid Computing.

[14]  Arie Shoshani,et al.  Accelerating the scientific exploration process with scientific workflows , 2006 .

[15]  Johan Montagnat,et al.  Flexible and Efficient Workflow Deployment of Data-Intensive Applications On Grids With MOTEUR , 2008, Int. J. High Perform. Comput. Appl..

[16]  Gary M. Olson The next generation of science collaboratories , 2009 .

[17]  Péter Kacsuk,et al.  P‐GRADE portal family for grid infrastructures , 2011, Concurr. Comput. Pract. Exp..

[18]  Daniel J. Blankenberg,et al.  Galaxy: a platform for interactive large-scale genome analysis. , 2005, Genome research.

[19]  Carole A. Goble,et al.  Opening new gateways to workflows for life scientists , 2012, HealthGrid.

[20]  Jun Qin,et al.  ASKALON: A Development and Grid Computing Environment for Scientific Workflows , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[21]  Marta Mattoso,et al.  ProvManager: a provenance management system for scientific workflows , 2012, Concurr. Comput. Pract. Exp..

[22]  Ian J. Taylor,et al.  Triana Applications within Grid Computing and Peer to Peer Environments , 2003, Journal of Grid Computing.

[23]  Ivona Brandic,et al.  Optimizing bioinformatics workflows for data analysis using cloud management techniques , 2011, WORKS '11.

[24]  Yogesh L. Simmhan,et al.  The Open Provenance Model core specification (v1.1) , 2011, Future Gener. Comput. Syst..