A Declarative Optimization Engine for Resource Provisioning of Scientific Workflows in IaaS Clouds

Resource provisioning for scientific workflows in Infrastructure-as-a-service (IaaS) clouds is an important and complicated problem for budget and performance optimizations of workflows. Scientists are facing the complexities resulting from severe cloud performance dynamics and various user requirements on performance and cost. To address those complexity issues, we propose a declarative optimization engine named Deco for resource provisioning of scientific workflows in IaaS clouds. Deco allows users to specify their workflow optimization goals and constraints of specific problems with an extended declarative language. We propose a novel probabilistic optimization approach for evaluating the declarative optimization goals and constraints in dynamic clouds. To accelerate the solution finding, Deco leverages the available power of GPUs to find the solution in a fast and timely manner. We evaluate Deco with several common provisioning problems. We integrate Deco into a popular workflow management system (Pegasus) and show that Deco can achieve more effective performance/cost optimizations than the state-of-the-art approaches.

[1]  Richard Wolski,et al.  The Eucalyptus Open-Source Cloud-Computing System , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[2]  Rizos Sakellariou,et al.  A Performance Model to Estimate Execution Time of Scientific Workflows on the Cloud , 2014, 2014 9th Workshop on Workflows in Support of Large-Scale Science.

[3]  Armin Haller,et al.  A Declarative Recommender System for Cloud Infrastructure Services Selection , 2012, GECON.

[4]  Luc De Raedt,et al.  ProbLog: A Probabilistic Prolog and its Application in Link Discovery , 2007, IJCAI.

[5]  Jianwu Wang,et al.  Early Cloud Experiences with the Kepler Scientific Workflow System , 2012, ICCS.

[6]  Rajkumar Buyya,et al.  Cost-based scheduling of scientific workflow applications on utility grids , 2005, First International Conference on e-Science and Grid Computing (e-Science'05).

[7]  Kaijun Ren,et al.  Exploring portfolio scheduling for long-term execution of scientific workloads in IaaS clouds , 2013, 2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[8]  Bertram Ludäscher,et al.  Scientific workflow management and the Kepler system: Research Articles , 2006 .

[9]  Ann L. Chervenak,et al.  Characterizing and profiling scientific workflows , 2013, Future Gener. Comput. Syst..

[10]  Johannes Gehrke,et al.  ClouDiA: A Deployment Advisor for Public Clouds , 2012, Proc. VLDB Endow..

[11]  Jorge-Arnulfo Quiané-Ruiz,et al.  Runtime measurements in the cloud , 2010, Proc. VLDB Endow..

[12]  Marty Humphrey,et al.  Auto-scaling to minimize cost and meet application deadlines in cloud workflows , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[13]  M. Kunze,et al.  The Cumulus project: Build a scientific cloud for a data center , 2009 .

[14]  Renato Figueiredo,et al.  Science Clouds: Early Experiences in Cloud Computing for Scientific Applications , 2008 .

[15]  Jin-Soo Kim,et al.  Cost optimized provisioning of elastic resources for application workflows , 2011, Future Gener. Comput. Syst..

[16]  Bingsheng He,et al.  Simplified Resource Provisioning for Workflows in IaaS Clouds , 2014, 2014 IEEE 6th International Conference on Cloud Computing Technology and Science.

[17]  El-Ghazali Talbi,et al.  GPU Computing for Parallel Local Search Metaheuristic Algorithms , 2013, IEEE Transactions on Computers.

[18]  Bingsheng He,et al.  Transformation-Based Monetary CostOptimizations for Workflows in the Cloud , 2014, IEEE Transactions on Cloud Computing.

[19]  Seung Woo Son,et al.  NUMARCK: Machine Learning Algorithm for Resiliency and Checkpointing , 2014, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis.

[20]  Luc De Raedt,et al.  On the implementation of the probabilistic logic programming language ProbLog , 2010, Theory and Practice of Logic Programming.

[21]  Bingsheng He,et al.  Monetary Cost Optimizations for Hosting Workflow-as-a-Service in IaaS Clouds , 2013, IEEE Transactions on Cloud Computing.

[22]  Philippe Codognet,et al.  A GPU Implementation of Parallel Constraint-Based Local Search , 2014, 2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing.

[23]  Dick H. J. Epema,et al.  Deadline-constrained workflow scheduling algorithms for Infrastructure as a Service Clouds , 2013, Future Gener. Comput. Syst..

[24]  Daniel S. Katz,et al.  Pegasus: A framework for mapping complex scientific workflows onto distributed systems , 2005, Sci. Program..

[25]  Giorgio C. Buttazzo,et al.  QoS guarantee using probabilistic deadlines , 1999, Proceedings of 11th Euromicro Conference on Real-Time Systems. Euromicro RTS'99.

[26]  J. Lloyd Foundations of Logic Programming , 1984, Symbolic Computation.

[27]  Andreas Wilke,et al.  A scalable data analysis platform for metagenomics , 2013, 2013 IEEE International Conference on Big Data.

[28]  Rajkumar Buyya,et al.  Meeting Deadlines of Scientific Workflows in Public Clouds with Tasks Replication , 2014, IEEE Transactions on Parallel and Distributed Systems.

[29]  K. K. Ramakrishnan,et al.  Towards a ubiquitous cloud computing infrastructure , 2010, 2010 17th IEEE Workshop on Local & Metropolitan Area Networks (LANMAN).

[30]  Jarek Nabrzyski,et al.  Cost- and deadline-constrained provisioning for scientific workflow ensembles in IaaS clouds , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[31]  Yong Zhao,et al.  Cloud Computing and Grid Computing 360-Degree Compared , 2008, GCE 2008.

[32]  Richard A. O'Keefe,et al.  The Craft of Prolog , 1990 .

[33]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[34]  Hannu Toivonen,et al.  Link Discovery in Graphs Derived from Biological Databases , 2006, DILS.

[35]  Subhash Saini,et al.  GridFlow: workflow management for grid computing , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[36]  Rajkumar Buyya,et al.  A taxonomy of scientific workflow systems for grid computing , 2005, SGMD.

[37]  Marty Humphrey,et al.  Scaling and Scheduling to Maximize Application Performance within Budget Constraints in Cloud Workflows , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.

[38]  Joseph M. Hellerstein,et al.  Boom analytics: exploring data-centric, declarative programming for the cloud , 2010, EuroSys '10.

[39]  Saikat Guha,et al.  Generalized resource allocation for the cloud , 2012, SoCC '12.

[40]  Alexandru Iosup,et al.  Performance Analysis of Cloud Computing Services for Many-Tasks Scientific Computing , 2011, IEEE Transactions on Parallel and Distributed Systems.

[41]  Boon Thau Loo,et al.  Declarative automated cloud resource orchestration , 2011, SoCC.