Towards the Scientific Cloud Workflow Architecture

Scientific workflows emerged as a technology that enables scientists to undertake computational scientific experiments. Workflow enactors map workflow tasks onto distributed resources, hiding the inherent complexity of distributed infrastructures to the users. In the past, while the emphasis has been focused in adapting the workflow structure onto the resources, today the emergence of the cloud computing paradigm enables us to adapt the resources to the workflow tasks based on their characteristics. In this paper, we examine the concept of Cloud Scientific Workflow, and propose a new architectural approach based on autonomic principles, guided by a combination of high-level and low-level policies. High-level policies enable the workflow enactor to choose among a number of workflow structure transformations that better suit the underlying resources dynamically depending on the context, whereas low-level policies enable the autonomic resource manager to adjust the required computational power to the workload derived from a scientific workflow specification, exploiting the cloud elasticity property, and to cope with performance fluctuations or unexpected events in the cloud infrastructure. The novelty of our approach is the combination of both policies that can lead to higher degrees of dynamism. The key enabling architectural components for such a dynamism are the petri-net based performance models for implementing high-level policies and MOSt-CB system for the adaptation to multi-cloud environments.

[1]  Yolanda Gil,et al.  A new approach for publishing workflows: abstractions, standards, and linked data , 2011, WORKS '11.

[2]  G. Bruce Berriman,et al.  Scientific workflow applications on Amazon EC2 , 2010, 2009 5th IEEE International Conference on E-Science Workshops.

[3]  Matthew A. Brown,et al.  Automatic Panoramic Image Stitching using Invariant Features , 2007, International Journal of Computer Vision.

[4]  T. Priol,et al.  Dynamicity in Scientific Workflows , 2008 .

[5]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[6]  Paul T. Groth,et al.  Wings: Intelligent Workflow-Based Design of Computational Experiments , 2011, IEEE Intelligent Systems.

[7]  Yash Goyal,et al.  CloudCV: Large-Scale Distributed Computer Vision as a Cloud Service , 2015, Mobile Cloud Visual Media Computing.

[8]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[9]  G. Bruce Berriman,et al.  On the Use of Cloud Computing for Scientific Workflows , 2008, 2008 IEEE Fourth International Conference on eScience.

[10]  Elie Rachkidi,et al.  MOSt-CB: SLA enforcement and smart VNE (Virtual network embedding) in a multi cloud providers environment , 2014, 2014 IEEE Globecom Workshops (GC Wkshps).

[11]  Ewa Deelman,et al.  Experiences using cloud computing for a scientific workflow application , 2011, ScienceCloud '11.

[12]  Rizos Sakellariou,et al.  Scheduling Data-IntensiveWorkflows onto Storage-Constrained Distributed Resources , 2007, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07).

[13]  Nazim Agoulmine,et al.  Cost-effective complex service mapping in cloud infrastructures , 2012, 2012 IEEE Network Operations and Management Symposium.

[14]  Dennis Gannon,et al.  Workflows for e-Science, Scientific Workflows for Grids , 2014 .

[15]  G. Alonso,et al.  Parallel computing patterns for Grid workflows , 2006, 2006 Workshop on Workflows in Support of Large-Scale Science.

[16]  Daniel S. Katz,et al.  Pegasus: A framework for mapping complex scientific workflows onto distributed systems , 2005, Sci. Program..

[17]  Jeffrey S. Chase,et al.  Adapting Scientific Workflows on Networked Clouds Using Proactive Introspection , 2015, 2015 IEEE/ACM 8th International Conference on Utility and Cloud Computing (UCC).

[18]  Xiao Liu,et al.  The Design of Cloud Workflow Systems , 2012, SpringerBriefs in Computer Science.

[19]  Omer F. Rana,et al.  Adaptive exception handling for scientific workflows , 2010, Concurr. Comput. Pract. Exp..