A virtual machine re-packing approach to the horizontal vs. vertical elasticity trade-off for cloud autoscaling

An automated solution to horizontal vs. vertical elasticity problem is central to make cloud autoscalers truly autonomous. Today's cloud autoscalers are typically varying the capacity allocated by increasing and decreasing the number of virtual machines (VMs) of a predefined size (horizontal elasticity), not taking into account that as load varies it may be advantageous not only to vary the number but also the size of VMs (vertical elasticity). We analyze the price/performance effects achieved by different strategies for selecting VM-sizes for handling increasing load and we propose a cost-benefit based approach to determine when to (partly) replace a current set of VMs with a different set. We evaluate our repacking approach in combination with different auto-scaling strategies. Our results show a range of 7% up to 60% cost saving in total resource utilization cost of our sample applications and workloads.

[1]  Jordi Torres,et al.  Characterizing Cloud Federation for Enhancing Providers' Profit , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[2]  Tharindu Patikirikorala,et al.  Feedback controllers in the cloud , 2010, APSEC 2010.

[3]  Chase Qishi Wu,et al.  A cost-effective scheduling algorithm for scientific workflows in clouds , 2012, 2012 IEEE 31st International Performance Computing and Communications Conference (IPCCC).

[4]  Johan Tordsson,et al.  An adaptive hybrid elasticity controller for cloud infrastructures , 2012, 2012 IEEE Network Operations and Management Symposium.

[5]  Zhenhuan Gong,et al.  PRESS: PRedictive Elastic ReSource Scaling for cloud systems , 2010, 2010 International Conference on Network and Service Management.

[6]  P. Shenoy,et al.  Kingfisher: A System for Elastic Cost-aware Provisioning in the Cloud , 2010 .

[7]  Johan Tordsson,et al.  Modeling for Dynamic Cloud Scheduling Via Migration of Virtual Machines , 2011, 2011 IEEE Third International Conference on Cloud Computing Technology and Science.

[8]  Calton Pu,et al.  A Cost-Sensitive Adaptation Engine for Server Consolidation of Multitier Applications , 2009, Middleware.

[9]  Prashant J. Shenoy,et al.  Resource overbooking and application profiling in shared hosting platforms , 2002, OSDI '02.

[10]  Erik Elmroth,et al.  Unifying Cloud Management: Towards Overall Governance of Business Level Objectives , 2011, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[11]  Marty Humphrey,et al.  Auto-scaling to minimize cost and meet application deadlines in cloud workflows , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[12]  Dmytro Dyachuk,et al.  Maximizing Cloud Providers' Revenues via Energy Aware Allocation Policies , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[13]  Rizos Sakellariou,et al.  Enacting SLAs in Clouds Using Rules , 2011, Euro-Par.

[14]  Alex Glikson,et al.  SLA-aware resource over-commit in an IaaS cloud , 2012, 2012 8th international conference on network and service management (cnsm) and 2012 workshop on systems virtualiztion management (svm).

[15]  Jesús Carretero,et al.  Multi-model prediction for enhancing content locality in elastic server infrastructures , 2011, 2011 18th International Conference on High Performance Computing.

[16]  Johan Tordsson,et al.  Virtual Machine Placement for Predictable and Time-Constrained Peak Loads , 2011, GECON.

[17]  José Antonio Lozano,et al.  A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments , 2014, Journal of Grid Computing.

[18]  Zhuzhong Qian,et al.  A game theoretical method for auto-scaling of multi-tiers web applications in cloud , 2012, Internetware.

[19]  Johan Tordsson,et al.  Cloud brokering mechanisms for optimized placement of virtual machines across multiple providers , 2012, Future Gener. Comput. Syst..

[20]  Kang G. Shin,et al.  Automated control of multiple virtualized resources , 2009, EuroSys '09.

[21]  Marin Litoiu,et al.  Optimal autoscaling in a IaaS cloud , 2012, ICAC '12.

[22]  Rajkumar Buyya,et al.  Adapting Market-Oriented Scheduling Policies for Cloud Computing , 2010, ICA3PP.

[23]  Wentong Cai,et al.  QoS-Aware Revenue-Cost Optimization for Latency-Sensitive Services in IaaS Clouds , 2012, 2012 IEEE/ACM 16th International Symposium on Distributed Simulation and Real Time Applications.