State-of-the-practice in data center virtualization: Toward a better understanding of VM usage

Hardware virtualization is the prevalent way to share data centers among different tenants. In this paper we present a large scale workload characterization study that aims to a better understanding of the state-of-the-practice, i.e., how data centers in the private cloud are used by their customers, how physical resources are shared among different tenants using virtualization, and how virtualization technologies are actually employed. Our study focuses on all corporate data centers of a major infrastructure provider that are geographically dispersed across the entire globe and reports on their observed usage across a 19-day period. We especially focus on how virtual machines are deployed across different physical resources with an emphasis on processors and memory, focusing on resource sharing and usage of physical resources, virtual machine life cycles, and migration patterns and frequencies. Our study illustrates that there is a huge tendency in over provisioning resources while being conservative to the several possibilities opened up by virtualization (e.g., migration and co-location), showing tremendous potential for the development of policies aiming to reduce data center operational costs.

[1]  Jerome A. Rolia,et al.  Workload Analysis and Demand Prediction of Enterprise Data Center Applications , 2007, 2007 IEEE 10th International Symposium on Workload Characterization.

[2]  Evgenia Smirni,et al.  Usage patterns in multi-tenant data centers: a temporal perspective , 2012, ICAC '12.

[3]  Evgenia Smirni,et al.  Data Centers in the Cloud: A Large Scale Performance Study , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[4]  Anand Sivasubramaniam,et al.  Xen and co.: communication-aware CPU scheduling for consolidated xen-based hosting platforms , 2007, VEE '07.

[5]  Jeffrey O. Kephart,et al.  Runtime Demand Estimation for effective dynamic resource management , 2010, 2010 IEEE Network Operations and Management Symposium - NOMS 2010.

[6]  Calton Pu,et al.  A Cost-Sensitive Adaptation Engine for Server Consolidation of Multitier Applications , 2009, Middleware.

[7]  Jerome A. Rolia,et al.  Selling T-shirts and Time Shares in the Cloud , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[8]  Eric Bouillet,et al.  Efficient resource provisioning in compute clouds via VM multiplexing , 2010, ICAC '10.

[9]  Gautam Kumar,et al.  CosMig: Modeling the Impact of Reconfiguration in a Cloud , 2011, 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems.

[10]  Kang G. Shin,et al.  Automated control of multiple virtualized resources , 2009, EuroSys '09.

[11]  Akshat Verma,et al.  pMapper: Power and Migration Cost Aware Application Placement in Virtualized Systems , 2008, Middleware.

[12]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[13]  Haifeng Chen,et al.  Untangling mixed information to calibrate resource utilization in virtual machines , 2011, ICAC '11.

[14]  Peter Desnoyers,et al.  Memory buddies: exploiting page sharing for smart colocation in virtualized data centers , 2009, VEE '09.

[15]  Prashant J. Shenoy,et al.  Profiling and Modeling Resource Usage of Virtualized Applications , 2008, Middleware.

[16]  Jerome A. Rolia,et al.  An integrated approach to resource pool management: Policies, efficiency and quality metrics , 2008, 2008 IEEE International Conference on Dependable Systems and Networks With FTCS and DCC (DSN).

[17]  Gargi Dasgupta,et al.  Workload management for power efficiency in virtualized data centers , 2011, CACM.

[18]  Evgenia Smirni,et al.  Model-driven consolidation of Java workloads on multicores , 2012, IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012).