Improving cloud infrastructure utilization through overbooking

Despite the potential given by the combination of multi-tenancy and virtualization, resource utilization in today's data centers is still low. We identify three key characteristics of cloud services and infrastructure as-a-service management practices: burstiness in service workloads, fluctuations in virtual machine resource usage over time, and virtual machines being limited to pre-defined sizes only. Based on these characteristics, we propose scheduling and admission control algorithms that incorporate resource overbooking to improve utilization. A combination of modeling, monitoring, and prediction techniques is used to avoid overpassing the total infrastructure capacity. A performance evaluation using a mixture of workload traces demonstrates the potential for significant improvements in resource utilization while still avoiding overpassing the total capacity.

[1]  Rajkumar Buyya,et al.  Managing Cancellations and No-Shows of Reservations with Overbooking to Increase Resource Revenue , 2008, 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID).

[2]  Rajkumar Buyya,et al.  Managing Overloaded Hosts for Dynamic Consolidation of Virtual Machines in Cloud Data Centers under Quality of Service Constraints , 2013, IEEE Transactions on Parallel and Distributed Systems.

[3]  Hai Jin,et al.  An Adaptive Meta-scheduler for Data-Intensive Applications , 2003, GCC.

[4]  Luiz André Barroso,et al.  The Case for Energy-Proportional Computing , 2007, Computer.

[5]  Alex Glikson,et al.  SLA-aware resource over-commit in an IaaS cloud , 2012, 2012 8th international conference on network and service management (cnsm) and 2012 workshop on systems virtualiztion management (svm).

[6]  André Brinkmann,et al.  The Gain of Overbooking , 2009, JSSPP.

[7]  Jerome A. Rolia,et al.  Selling T-shirts and Time Shares in the Cloud , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[8]  Robert D. van der Mei,et al.  A prediction method for job runtimes on shared processors: Survey, statistical analysis and new avenues , 2007, Perform. Evaluation.

[9]  María Blanca Caminero,et al.  Network-aware meta-scheduling in advance with autonomous self-tuning system , 2011, Future Gener. Comput. Syst..

[10]  Moustafa Ghanem,et al.  Improving Resource Utilisation in the Cloud Environment Using Multivariate Probabilistic Models , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[11]  Mohammad Kazem Akbari,et al.  Grid performance prediction using state‐space model , 2009, Concurr. Comput. Pract. Exp..

[12]  Timothy Roscoe,et al.  Resource overbooking and application profiling in shared hosting platforms , 2002, OSDI '02.

[13]  Antonio Corradi,et al.  VM consolidation: A real case based on OpenStack Cloud , 2014, Future Gener. Comput. Syst..

[14]  Eric Bouillet,et al.  Efficient resource provisioning in compute clouds via VM multiplexing , 2010, ICAC '10.

[15]  Janakiram Subramanian,et al.  Airline Yield Management with Overbooking, Cancellations, and No-Shows , 1999, Transp. Sci..

[16]  W. Cirne,et al.  A comprehensive model of the supercomputer workload , 2001, Proceedings of the Fourth Annual IEEE International Workshop on Workload Characterization. WWC-4 (Cat. No.01EX538).

[17]  Johan Tordsson,et al.  An adaptive hybrid elasticity controller for cloud infrastructures , 2012, 2012 IEEE Network Operations and Management Symposium.

[18]  Vijay K. Naik,et al.  Biting Off Safely More Than You Can Chew: Predictive Analytics for Resource Over-Commit in IaaS Cloud , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[19]  Charles Reiss,et al.  Towards understanding heterogeneous clouds at scale : Google trace analysis , 2012 .

[20]  Hamid Ahmadi,et al.  Equivalent Capacity and Its Application to Bandwidth Allocation in High-Speed Networks , 1991, IEEE J. Sel. Areas Commun..

[21]  María Blanca Caminero,et al.  Exponential Smoothing for Network-Aware Meta-scheduler in Advance in Grids , 2010, 2010 39th International Conference on Parallel Processing Workshops.

[22]  Albert Y. Zomaya,et al.  Energy efficient utilization of resources in cloud computing systems , 2010, The Journal of Supercomputing.