Fluctuation-Aware and Predictive Workflow Scheduling in Cost-Effective Infrastructure-as-a-Service Clouds

Cloud computing is becoming an increasingly popular platform for the execution of scientific applications such as scientific workflows. In contrast to grids and other traditional high-performance computing systems, clouds provide a customizable infrastructure where scientific workflows can provision desired resources ahead of the execution and set up a required software environment on virtual machines (VMs). Nevertheless, various challenges, especially its quality-of-service prediction and optimal scheduling, are yet to be addressed. Existing studies mainly consider workflow tasks to be executed with VMs having time-invariant, stochastic, or bounded performance and focus on minimizing workflow execution time or execution cost while meeting the quality-of-service requirements. This work considers time-varying performance and aims at minimizing the execution cost of workflow deployed on Infrastructure-as-a-Service clouds while satisfying Service-Level-Agreements with users. We employ time-series-based approaches to capture dynamic performance fluctuations, feed a genetic algorithm with predicted performance of VMs, and generate schedules at run-time. A case study based on real-world third-party IaaS clouds and some well-known scientific workflows show that our proposed approach outperforms traditional approaches, especially those considering time-invariant or bounded performance only.

[1]  Dick H. J. Epema,et al.  Deadline-constrained workflow scheduling algorithms for Infrastructure as a Service Clouds , 2013, Future Gener. Comput. Syst..

[2]  Jun Zhang,et al.  Deadline constrained cloud computing resources scheduling for cost optimization based on dynamic objective genetic algorithm , 2015, 2015 IEEE Congress on Evolutionary Computation (CEC).

[3]  Rajkumar Buyya,et al.  Deadline Based Resource Provisioningand Scheduling Algorithm for Scientific Workflows on Clouds , 2014, IEEE Transactions on Cloud Computing.

[4]  MengChu Zhou,et al.  Optimal Supervisory Control of Flexible Manufacturing Systems by Petri Nets: A Set Classification Approach , 2014, IEEE Transactions on Automation Science and Engineering.

[5]  Rajkumar Buyya,et al.  A Particle Swarm Optimization-Based Heuristic for Scheduling Workflow Applications in Cloud Computing Environments , 2010, 2010 24th IEEE International Conference on Advanced Information Networking and Applications.

[6]  Hongye Su,et al.  An improved approach to test diagnosability of bounded petri nets , 2017, IEEE/CAA Journal of Automatica Sinica.

[7]  MengChu Zhou,et al.  A Nonnegative Latent Factor Model for Large-Scale Sparse Matrices in Recommender Systems via Alternating Direction Method , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[8]  Qingsheng Zhu,et al.  A time series and reduction‐based model for modeling and QoS prediction of service compositions , 2015, Concurr. Comput. Pract. Exp..

[9]  MengChu Zhou,et al.  Business and Scientific Workflows: A Web Service-Oriented Approach , 2013 .

[10]  Jarek Nabrzyski,et al.  Cost- and deadline-constrained provisioning for scientific workflow ensembles in IaaS clouds , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[11]  Qingsheng Zhu,et al.  Percentile Performance Estimation of Unreliable IaaS Clouds and Their Cost-Optimal Capacity Decision , 2017, IEEE Access.

[12]  MengChu Zhou,et al.  Stochastic Modeling and Quality Evaluation of Infrastructure-as-a-Service Clouds , 2015, IEEE Transactions on Automation Science and Engineering.

[13]  Fang Dong,et al.  A Performance Fluctuation-Aware Stochastic Scheduling Mechanism for Workflow Applications in Cloud Environment , 2014, IEICE Trans. Inf. Syst..

[14]  MengChu Zhou,et al.  Stochastic Modeling and Performance Analysis of Migration-Enabled and Error-Prone Clouds , 2015, IEEE Transactions on Industrial Informatics.

[15]  Manu Vardhan,et al.  Cost Effective Genetic Algorithm for Workflow Scheduling in Cloud Under Deadline Constraint , 2016, IEEE Access.

[16]  Wensheng Tang,et al.  Multi-valued collaborative QoS prediction for cloud service via time series analysis , 2017, Future Gener. Comput. Syst..

[17]  Kishor S. Trivedi,et al.  An Interacting Stochastic Models Approach for the Performance Evaluation of DSRC Vehicular Safety Communication , 2013, IEEE Transactions on Computers.

[18]  Darrell Whitley,et al.  A genetic algorithm tutorial , 1994, Statistics and Computing.

[19]  Quanwang Wu,et al.  Broker-based SLA-aware composite service provisioning , 2014, J. Syst. Softw..

[20]  MengChu Zhou,et al.  Design, Analysis and Verification of Real-Time Systems Based on Time Petri Net Refinement , 2013, TECS.

[21]  John Shalf,et al.  Performance Analysis of High Performance Computing Applications on the Amazon Web Services Cloud , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[22]  Marty Humphrey,et al.  Auto-scaling to minimize cost and meet application deadlines in cloud workflows , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[23]  Rajkumar Buyya,et al.  Deadline‐constrained coevolutionary genetic algorithm for scientific workflow scheduling in cloud computing , 2017, Concurr. Comput. Pract. Exp..

[24]  Hao Wu,et al.  Resource and Instance Hour Minimization for Deadline Constrained DAG Applications Using Computer Clouds , 2016, IEEE Transactions on Parallel and Distributed Systems.

[25]  Jin-Soo Kim,et al.  BTS: Resource capacity estimate for time-targeted science workflows , 2011, J. Parallel Distributed Comput..

[26]  Jorge-Arnulfo Quiané-Ruiz,et al.  Runtime measurements in the cloud , 2010, Proc. VLDB Endow..

[27]  Jin-Soo Kim,et al.  Cost optimized provisioning of elastic resources for application workflows , 2011, Future Gener. Comput. Syst..

[28]  Rajkumar Buyya,et al.  Meeting Deadlines of Scientific Workflows in Public Clouds with Tasks Replication , 2014, IEEE Transactions on Parallel and Distributed Systems.

[29]  MengChu Zhou,et al.  Toward cloud computing QoS architecture: analysis of cloud systems and cloud services , 2017, IEEE/CAA Journal of Automatica Sinica.

[30]  Franco Romerio,et al.  A parametric genetic algorithm approach to assess complementary options of large scale windsolar coupling , 2017, IEEE/CAA Journal of Automatica Sinica.

[31]  MengChu Zhou,et al.  A Novel Approach to Extracting Non-Negative Latent Factors From Non-Negative Big Sparse Matrices , 2016, IEEE Access.

[32]  Xiaohui Liu,et al.  Evolutionary Multi-Objective Workflow Scheduling in Cloud , 2016, IEEE Transactions on Parallel and Distributed Systems.

[33]  Rajkumar Buyya,et al.  A taxonomy and survey on scheduling algorithms for scientific workflows in IaaS cloud computing environments , 2017, Concurr. Comput. Pract. Exp..

[34]  Kishor S. Trivedi,et al.  Scalable Analytics for IaaS Cloud Availability , 2014, IEEE Transactions on Cloud Computing.

[35]  Yun Yang,et al.  Robust Scheduling of Scientific Workflows with Deadline and Budget Constraints in Clouds , 2014, 2014 IEEE 28th International Conference on Advanced Information Networking and Applications.

[36]  Oscar H. Ibarra,et al.  Heuristic Algorithms for Scheduling Independent Tasks on Nonidentical Processors , 1977, JACM.