Performance-Feedback Autoscaling with Budget Constraints for Cloud-based Workloads of Workflows

The growing popularity of workflows in the cloud domain promoted the development of sophisticated autoscaling policies that allow automatic allocation and deallocation of resources. However, many state-of-the-art autoscaling policies for workflows are mostly plan-based or designed for batches (ensembles) of workflows. This reduces their flexibility when dealing with workloads of workflows, as the workloads are often subject to unpredictable resource demand fluctuations. Moreover, autoscaling in clouds almost always imposes budget constraints that should be satisfied. The budget-aware autoscalers for workflows usually require task runtime estimates to be provided beforehand, which is not always possible when dealing with workloads due to their dynamic nature. To address these issues, we propose a novel Performance-Feedback Autoscaler (PFA) that is budget-aware and does not require task runtime estimates for its operation. Instead, it uses the performance-feedback loop that monitors the average throughput on each resource type. We implement PFA in the popular Apache Airflow workflow management system, and compare the performance of our autoscaler with other two state-of-the-art autoscalers, and with the optimal solution obtained with the Mixed Integer Programming approach. Our results show that PFA outperforms other considered online autoscalers, as it effectively minimizes the average job slowdown by up to 47% while still satisfying the budget constraints. Moreover, PFA shows by up to 76% lower average runtime than the competitors.

[1]  Rizos Sakellariou,et al.  Scheduling multiple DAGs onto heterogeneous systems , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[2]  et al,et al.  Search for gravitational waves from binary inspirals in S3 and S4 LIGO data , 2007, 0704.3368.

[3]  Bingsheng He,et al.  Monetary Cost Optimizations for Hosting Workflow-as-a-Service in IaaS Clouds , 2013, IEEE Transactions on Cloud Computing.

[4]  Alexandru Iosup,et al.  A Trace-Based Performance Study of Autoscaling Workloads of Workflows in Datacenters , 2018, 2018 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID).

[5]  V PapadopoulosAlessandro,et al.  An Experimental Performance Evaluation of Autoscalers for Complex Workflows , 2018 .

[6]  Mei-Hui Su,et al.  Characterization of scientific workflows , 2008, 2008 Third Workshop on Workflows in Support of Large-Scale Science.

[7]  Bryan Ng,et al.  Scheduling deadline constrained scientific workflows on dynamically provisioned cloud resources , 2017, Future Gener. Comput. Syst..

[8]  Shigang Chen,et al.  Using Integer Programming for Workflow Scheduling in the Cloud , 2017, 2017 IEEE 10th International Conference on Cloud Computing (CLOUD).

[9]  Jarek Nabrzyski,et al.  Cost- and deadline-constrained provisioning for scientific workflow ensembles in IaaS clouds , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[10]  Johan Tordsson,et al.  An adaptive hybrid elasticity controller for cloud infrastructures , 2012, 2012 IEEE Network Operations and Management Symposium.

[11]  Prashant Pandey,et al.  Cloud computing , 2010, ICWET.

[12]  Albert G. Greenberg,et al.  EyeQ: Practical Network Performance Isolation at the Edge , 2013, NSDI.

[13]  Salim Hariri,et al.  Performance-Effective and Low-Complexity Task Scheduling for Heterogeneous Computing , 2002, IEEE Trans. Parallel Distributed Syst..

[14]  Daniel S. Katz,et al.  Montage: An Astronomical Image Mosaicking Toolkit , 2010 .

[15]  Jonathan Livny,et al.  Bioinformatic discovery of bacterial regulatory RNAs using SIPHT. , 2012, Methods in molecular biology.

[16]  Zhenyu Wen,et al.  Fog Orchestration for Internet of Things Services , 2017, IEEE Internet Computing.

[17]  Dick H. J. Epema,et al.  Deadline-constrained workflow scheduling algorithms for Infrastructure as a Service Clouds , 2013, Future Gener. Comput. Syst..

[18]  Jin-Soo Kim,et al.  Cost optimized provisioning of elastic resources for application workflows , 2011, Future Gener. Comput. Syst..

[19]  Adam Belloum,et al.  Execution Time Estimation for Workflow Scheduling , 2014, 2014 9th Workshop on Workflows in Support of Large-Scale Science.

[20]  Rajkumar Buyya,et al.  Budget-Driven Scheduling of Scientific Workflows in IaaS Clouds with Fine-Grained Billing Periods , 2017, ACM Trans. Auton. Adapt. Syst..

[21]  Prasanta K. Jana,et al.  A novel cost-efficient approach for deadline-constrained workflow scheduling by dynamic provisioning of resources , 2018, Future Gener. Comput. Syst..

[22]  Robert Ricci,et al.  Taming Performance Variability , 2018, OSDI.

[23]  Ann L. Chervenak,et al.  Characterizing and profiling scientific workflows , 2013, Future Gener. Comput. Syst..

[24]  Jin Sun,et al.  A Review of Cost and Makespan-Aware Workflow Scheduling in Clouds , 2019, J. Circuits Syst. Comput..

[25]  Marty Humphrey,et al.  Auto-scaling to minimize cost and meet application deadlines in cloud workflows , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[26]  Marty Humphrey,et al.  Scaling and Scheduling to Maximize Application Performance within Budget Constraints in Cloud Workflows , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.