Scientific workflow management: between generality and applicability

In a problem solving environment (PSE), a scientific workflow management system (SWMS) provides a meta environment for managing activities and data in scientific experiments, for prototyping experimental computing systems and for orchestrating the runtime system behaviour. The realisation of a SWMS is often driven by domain specific applications and thus is at application level. Investigating the common characteristics in domain specific SWMSs and encapsulating them in a generic framework improve the reusability of the SWMS components and reduce the costs for introducing an e-science framework in a new science domain. In this position paper, we present our research in an ongoing project: virtual laboratory for e-science (VL-e). In the VL-e project, we are building a generic e-science framework which would support scientists from different domains to share their knowledge and to perform specific experiments. We summarise the lessons we have learned from a previous VL-e implementation, and discuss the plan for improving the quality of the SWMS support in the VL-e framework.

[1]  E. Gallopoulos,et al.  Computer as thinker/doer: problem-solving environments for computational science , 1994, IEEE Computational Science and Engineering.

[2]  Louis O. Hertzberger,et al.  VL-E: Approaches to design a Grid-based Virtual Laboratory , 2004, DAPSYS.

[3]  Carole A. Goble,et al.  myGrid: personalised bioinformatics on the information grid , 2003, ISMB.

[4]  Russ B. Altman,et al.  Modelling biological processes using workflow and Petri Net models , 2002, Bioinform..

[5]  Gottfried Vossen,et al.  Workshop on workflow management in scientific and engineering applications—report , 1997, SGMD.

[6]  Irfan-Ullah Awan,et al.  Analysis of grid service composition with BPEL4WS , 2004, 18th International Conference on Advanced Information Networking and Applications, 2004. AINA 2004..

[7]  Arun Jagatheesan,et al.  Gridflow description, query, and execution at SCEC using the SDSC matrix , 2004, Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004..

[8]  Zhiming Zhao,et al.  An agent based architecture for constructing Interactive Simulation Systems , 2004 .

[9]  John H. Holmes,et al.  Modeling the workflow of prescription writing , 2003, SAC '03.

[10]  Rajkumar Buyya Grid economy comes of age: emerging gridbus tools for service-oriented cluster and grid computing , 2002, Proceedings. Second International Conference on Peer-to-Peer Computing,.

[11]  H. Casanova,et al.  ACM SIGACT news distributed computing column 8 , 2002, SIGA.

[12]  Karen Schuchardt,et al.  New paradigms in problem solving environments for scientific computing , 2002, IUI '02.

[13]  Francine Berman,et al.  New Grid Scheduling and Rescheduling Methods in the GrADS Project , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[14]  Kaizar Amin,et al.  GridAnt: a client-controllable grid workflow system , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[15]  Yogesh L. Simmhan,et al.  The XCAT Science Portal , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[16]  Steven G. Parker,et al.  Large-scale Computational Science Applications using the SCIRun Problem Solving Environment , 2000 .

[17]  Peter M. A. Sloot,et al.  Agent-Based Flow Control for HLA Components , 2005, Simul..

[18]  Subhash Saini,et al.  GridFlow: workflow management for grid computing , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[19]  César Garita,et al.  VLAM-G: A Grid-based virtual laboratory , 2002 .

[20]  Mathilde Romberg,et al.  The UNICORE Grid infrastructure , 2002, Sci. Program..

[21]  Wil M. P. van der Aalst,et al.  Workflow Patterns , 2004, Distributed and Parallel Databases.

[22]  Ian J. Taylor,et al.  Triana Applications within Grid Computing and Peer to Peer Environments , 2003, Journal of Grid Computing.

[23]  ZhaoZhiming,et al.  Agent-Based Flow Control for HLA Components , 2005 .

[24]  G. Allen,et al.  The Cactus Code: a problem solving environment for the grid , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.

[25]  Adam Arbree,et al.  Mapping Abstract Complex Workflows onto Grid Environments , 2003, Journal of Grid Computing.

[26]  Louis O. Hertzberger,et al.  Grid Service Based Collaboration for VL-e: Requirements, Analysis and Design , 2005, EGC.

[27]  Anne E. Trefethen,et al.  The UK e-Science Core Programme and the Grid , 2002, Future Gener. Comput. Syst..

[28]  Yolanda Gil,et al.  Workflow management in GriPhyN , 2004 .

[29]  Matthias S. Müller,et al.  The emerging role of biogrids , 2004, CACM.

[30]  Matthew R. Pocock,et al.  Taverna: a tool for the composition and enactment of bioinformatics workflows , 2004, Bioinform..

[31]  Joel Closier,et al.  HEP Applications Evaluation of the EDG Testbed and Middleware , 2003, ArXiv.

[32]  Louis O. Hertzberger,et al.  A Grid-Based Virtual Laboratory , 2002 .

[33]  M. Z. Muehlen,et al.  Workflow Management Coalition , 2000 .

[34]  Arnoud Visser,et al.  INTELLIGENT ADAPTIVE TRAFFIC FORECASTING SYSTEM USING DATA ASSIMILATION FOR USE IN TRAVELER INFORMATION SYSTEMS , 2004 .

[35]  Henri E. Bal,et al.  Virtual Laboratory for e-Science , 2007 .

[36]  Ludwik Czaja,et al.  Place/Transition Petri Net Evolutions: Recording Ways, Analysis and Synthesis , 2002, Fundam. Informaticae.