A distributed workflow management model for grid middleware

Workflow management enables grid service composition and user collaboration in grid middleware, it includes the workflow of grid service invoking and data exchanging among services. Because of the large requirements of grid service composition, it's necessary for the workflow management to be scalable in many grid application systems. This paper proposes a distributed workflow management model for grid middleware, it is comprised of three modules including workflow definition tool (WDT), workflow balancer (WB), workflow execution engine (WEE) and defines the interfaces between workflow management and other grid components. Firstly, WDT composites grid services into workflow service, secondly WB dispatches workflow services and workflow job requests, thirdly WEE is up to the execution of workflow jobs. Furthermore, it describes the interfaces between workflow management and related grid components, such as job management, information service and grid portal. Also it divides the interfaces into WEE management, workflow service management and workflow job management. With the implementation and experiments of this model for Chinese education and research, its support platform middleware, its usability is verified through the usage of workflow tools in image processing scenario, its scalability and load balancing ability are proved in workflow scheduling test.

[1]  Adam Arbree,et al.  Mapping Abstract Complex Workflows onto Grid Environments , 2003, Journal of Grid Computing.

[2]  Chunming Hu,et al.  CGSP: An Extensible and Reconfigurable Grid Framework , 2005, APPT.

[3]  Hai Jin ChinaGrid: Making Grid Computing a Reality , 2004, ICADL.

[4]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[5]  Paul W. P. J. Grefen,et al.  WIDE-a distributed architecture for workflow management , 1997, Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications.

[6]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[7]  Subhash Saini,et al.  GridFlow: workflow management for grid computing , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[8]  Carole A. Goble,et al.  myGrid: personalised bioinformatics on the information grid , 2003, ISMB.

[9]  David F. Snelling,et al.  UNICORE—a Grid computing environment , 2002, Concurr. Comput. Pract. Exp..

[10]  Zhi Xu,et al.  UDMGrid: A Grid Application for University Digital Museums , 2004, GCC.

[11]  Yolanda Gil,et al.  Pegasus: Mapping Scientific Workflows onto the Grid , 2004, European Across Grids Conference.