The advent of grid technology has provided a promising methodology for usage of distributed resources for complex scientific workflow applications. Grid workflow management systems (GWMS) enable even a non-expert grid user to compose workflows and further provide functionality to coordinate execution of the complete workflow thereby masking the grid-specific details from the user. We have designed a non- DAG workflow specification model for workflow composition. Our model allows a user to compose a workflow using directed graphs, thereby allowing modeling of sequence, parallel, choice and iteration patterns in the workflow. We have also provided for structural verification of workflows using Petri net based analysis techniques. A workflow is said to be structurally correct if it does not have errors like deadlock and lack of synchronization. A workflow found correct by structural verification techniques can be executed correctly for all possible workflow instances. We have incorporated our workflow model into a previously existing workflow management system for Sun grid engine [1]. Experiments show that the model allows for composition of wider range of workflow applications. Also the verification procedure provides appropriate error messages to user, indicating the nature and cause of error, when a structural error is detected. We have composed and executed the workflow of EMAN [2], a real-world bio-informatics application. The experimentation results show the efficient utilization of compute power of the grid by the inherent parallel tasks in the EMAN workflow.
[1]
Rajkumar Buyya,et al.
A taxonomy of scientific workflow systems for grid computing
,
2005,
SGMD.
[2]
Yolanda Gil,et al.
Pegasus: Mapping Scientific Workflows onto the Grid
,
2004,
European Across Grids Conference.
[3]
Wil M. P. van der Aalst,et al.
An Alternative Way to Analyze Workflow Graphs
,
2002,
CAiSE.
[4]
Maria E. Orlowska,et al.
Analyzing Process Models Using Graph Reduction Techniques
,
2000,
Inf. Syst..
[5]
Falko Bause,et al.
An Efficient Polynomial-Time Algorithm to Decide Liveness and Boundedness of Free-Choice Nets
,
1992,
Application and Theory of Petri Nets.
[6]
Matthew R. Pocock,et al.
Taverna: a tool for the composition and enactment of bioinformatics workflows
,
2004,
Bioinform..
[7]
Sanjeev K. Aggarwal,et al.
A workflow editor and scheduler for composing applications on computational grids
,
2006,
12th International Conference on Parallel and Distributed Systems - (ICPADS'06).
[8]
W Chiu,et al.
EMAN: semiautomated software for high-resolution single-particle reconstructions.
,
1999,
Journal of structural biology.
[9]
Simon J. Cox,et al.
Tuning GENIE earth system model components using a Grid enabled data management system
,
2004
.
[10]
Ian T. Foster,et al.
The Anatomy of the Grid: Enabling Scalable Virtual Organizations
,
2001,
Int. J. High Perform. Comput. Appl..