Experience with adapting a WS-BPEL runtime for eScience workflows

Scientists believe in the concept of collective intelligence and are increasingly collaborating with their peers, sharing data and simulation techniques. These collaborations are made possible by building eScience infrastructures. eScience infrastructures build and assemble various scientific workflow and data management tools which provide rich end user functionality while abstracting the complexities of many underlying technologies. For instance, workflow systems provide a means to execute complex sequence of tasks with or without intensive user intervention and in ways that support flexible reordering and reconfiguration of the workflow. As the workflow technologies continue to emerge, the need for interoperability and standardization clamorous. The Web Services Business Process Execution Language (WS-BPEL) provides one such standard way of defining workflows. WS-BPEL specification encompasses broad range of workflow composition and description capabilities that can be applied to both abstract as well as concrete executable components. Scientific workflows with their agile characteristics present significant challenges in embracing WS-BPEL for eScience purposes. In this paper we discuss the experiences in adopting a WS-BPEL runtime within an eScience infrastructure with reference to an early implementation of a custom eScience motivated BPEL like workflow engine. Specifically the paper focuses on replacing the early adopter research system with a widely used open source WS-BPEL runtime, Apache ODE, while retaining the interoperable design to switch to any WS-BPEL compliant workflow runtime in future. The paper discusses the challenges encountered in extending a business motivated workflow engine for scientific workflow executions. Further, the paper presents performance benchmarks for the developed system.

[1]  Sara J. Graves,et al.  LINKED ENVIRONMENTS FOR ATMOSPHERIC DISCOVERY (LEAD): A CYBERINFRASTRUCTURE FOR MESOSCALE METEOROLOGY RESEARCH AND EDUCATION , 2004 .

[2]  Francisco Curbera,et al.  Web Services Business Process Execution Language Version 2.0 , 2007 .

[3]  Carole A. Goble,et al.  Taverna: a tool for building and running workflows of services , 2006, Nucleic Acids Res..

[4]  Ian J. Taylor,et al.  The Triana Workflow Environment: Architecture and Applications , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[5]  Bertram Ludäscher,et al.  Kepler: an extensible system for design and execution of scientific workflows , 2004, Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004..

[6]  Liang Chen,et al.  Grid Service Orchestration Using the Business Process Execution Language (BPEL) , 2005, Journal of Grid Computing.

[7]  Dennis Gannon,et al.  A dynamic scientific workflow system for the web services architecture , 2007 .

[8]  Matjaz B. Juric,et al.  Business process execution language for web services , 2004 .

[9]  Wolfgang Emmerich,et al.  Reliable Scientific Service Compositions , 2006, ICSOC Workshops.

[10]  Daniel S. Katz,et al.  Pegasus: A framework for mapping complex scientific workflows onto distributed systems , 2005, Sci. Program..

[11]  Edward D. Lazowska,et al.  Trident: Scientific Workflow Workbench for Oceanography , 2008, 2008 IEEE Congress on Services - Part I.

[12]  Srinath Perera,et al.  Application of Management Frameworks to Manage Workflow-Based Systems: A Case Study on a Large Scale E-science Project , 2009, 2009 IEEE International Conference on Web Services.

[13]  Aleksander Slominski Adapting BPEL to Scientific Workflows , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[14]  David Meredith,et al.  Evaluation of BPEL to Scientific Workflows , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[15]  Yogesh L. Simmhan,et al.  The Trident Scientific Workflow Workbench , 2008, 2008 IEEE Fourth International Conference on eScience.

[16]  Kenn Scribner Microsoft Windows Workflow Foundation step by step , 2007 .

[17]  Rajkumar Buyya,et al.  A taxonomy of scientific workflow systems for grid computing , 2005, SGMD.

[18]  Yogesh L. Simmhan,et al.  A Framework for Collecting Provenance in Data-Centric Scientific Workflows , 2006, 2006 IEEE International Conference on Web Services (ICWS'06).

[19]  Tony Andrews Business Process Execution Language for Web Services Version 1.1 , 2003 .

[20]  Yi Huang,et al.  Building web services for scientific grid applications , 2006, IBM J. Res. Dev..

[21]  Daniel A. Reed,et al.  Fault Tolerance and Recovery of Scientific Workflows on Computational Grids , 2008, 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID).