DynaSched: a dynamic Web service scheduling and deployment framework for data-intensive Grid workflows

Abstract Grid computing boosts productivity by maximizing resource utilization and simplifying access to resources which are shared among virtual organizations. Recently, the Grid and Web Service communities have established a set of common interests and requirements. The latest version of the Globus Toolkit implements the Web Service Resource Framework (WSRF) specifications which have been formulated to cover these interests. We leverage the Globus Toolkit to address some limitations in supporting the dynamic nature of large-scale Grid and data-intensive workflow executions. Dynamic Web Service deployment fits well into the dynamic nature of the Grid and opens new ways of managing workflow executions on the Grid. In this article, we present the design and evaluation of a dynamic Web Service scheduling and deployment framework (DynaSched) that supports the workflow management of dynamic services. Dynamic Web Service deployment on the Grid allows jobs to be executed on the same site as where the input data is located. The empirical studies show that the designed framework decreases data-intensive workflow execution time by minimizing communication costs. We argue that the framework ensures more flexible, fault-tolerant workflows. The system is based on Open Grid Services Architecture specifications and is WSRF-compliant.

[2]  Dennis Gannon,et al.  Workflows for e-Science, Scientific Workflows for Grids , 2014 .

[3]  Jason Maassen,et al.  Programming Scientific and Distributed Workflow with Triana Services , 2004 .

[4]  Ian Taylor,et al.  Programming scientific and distributed workflow with Triana services: Research Articles , 2006 .

[5]  Hai Jin,et al.  HAND: Highly Available Dynamic Deployment Infrastructure for Globus Toolkit 4 , 2007, 15th EUROMICRO International Conference on Parallel, Distributed and Network-Based Processing (PDP'07).

[6]  Bertram Ludäscher,et al.  Kepler: an extensible system for design and execution of scientific workflows , 2004, Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004..

[7]  Ian T. Foster,et al.  Grid information services for distributed resource sharing , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[8]  Alfons Kemper,et al.  Reliable Web Service Execution and Deployment in Dynamic Environments , 2003, TES.

[9]  Ian T. Foster,et al.  Globus Toolkit Version 4: Software for Service-Oriented Systems , 2005, Journal of Computer Science and Technology.

[10]  Carole A. Goble,et al.  Taverna: a tool for building and running workflows of services , 2006, Nucleic Acids Res..

[11]  Fabio Casati,et al.  Technologies for E-Services , 2001, Lecture Notes in Computer Science.

[12]  Sujata Banerjee,et al.  S3: a scalable sensing service for monitoring large networked systems , 2006, INM '06.

[13]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[14]  Jem Treadwell,et al.  Open Grid Services Architecture , 2006, Grid-Based Problem Solving Environments.

[15]  Jun Qin,et al.  ASKALON: A Development and Grid Computing Environment for Scientific Workflows , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[16]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .