SOPRAN: Integrative Workload Modeling and Proactive Reoptimization for Virtual Machine Management

For a data center to operate effectively (i.e., meeting customers’ Service Level Agreements (SLAs)) and efficiently (i.e., minimizing resource consumption), the virtual machines (VMs) must be carefully managed. In particular, as the resource demands of VMs change, the assignment of VMs to physical machines becomes sub-optimal. VM replication and migration provide a solution for dealing with dynamic workloads. However, as migrations are costly, an effective control policy is critical to avoid frequent migrations. Moreover, an agile decision making component is also important to reduce feedback latencies. In this paper we propose SOPRAN, a virtual machine management system leveraging an integrative workload model for the data center, that can dynamically adapt the assignment of VMs to physical machines to minimize resource consumption without sacrificing the SLAs. Different from existing trace-based methods for this problem, SOPRAN characterizes the dynamic workloads in the system using an integrative risk cube model, and approximates the workload demands with a representative state set. The optimal plan for each representative state is incrementally generated, forming the switchable plan set. At runtime, a two-phase re-optimization strategy matches the current system demand to the closest representative state and actuates the corresponding plan in the switchable plan set. At the same time, online monitors profile the actual demands and refine the risk cube to guarantee the model’s accuracy. This modeling technique and optimization procedure based on it brings the great savings in optimization cost and migration opportunities, and enables the high scalability of SOPRAN. We evaluated SOPRAN against the state-of-the-art IBM MFR algorithm. The results show that, with comparable resource consumptions, SOPRAN can achieve more stable SLA violation rate of no more than 4%, 80% lower migration rate, and save up to 90% reoptimization overhead.

[1]  K. Shin,et al.  Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach , 2002, IEEE Trans. Parallel Distributed Syst..

[2]  Jing Xu,et al.  On the Use of Fuzzy Modeling in Virtualized Data Center Management , 2007, Fourth International Conference on Autonomic Computing (ICAC'07).

[3]  Jin-Soo Kim,et al.  Energy Reduction in Consolidated Servers through Memory-Aware Virtual Machine Scheduling , 2011, IEEE Transactions on Computers.

[4]  Gautam Kumar,et al.  The cost of reconfiguration in a cloud , 2010, Middleware Industrial Track '10.

[5]  Jordi Guitart,et al.  SLA-driven Elastic Cloud Hosting Provider , 2010, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing.

[6]  Gang Wang,et al.  Appliance-Based Autonomic Provisioning Framework for Virtualized Outsourcing Data Center , 2007, Fourth International Conference on Autonomic Computing (ICAC'07).

[7]  Andrzej Kochut,et al.  Dynamic Placement of Virtual Machines for Managing SLA Violations , 2007, 2007 10th IFIP/IEEE International Symposium on Integrated Network Management.

[8]  Monica S. Lam,et al.  Optimizing the migration of virtual computers , 2002, OPSR.

[9]  Jordi Guitart Fernández,et al.  SLA-driven Elastic Cloud Hosting Provider , 2010, PDP 2010.

[10]  Feng Zhao,et al.  Virtual machine power metering and provisioning , 2010, SoCC '10.

[11]  Calton Pu,et al.  A Cost-Sensitive Adaptation Engine for Server Consolidation of Multitier Applications , 2009, Middleware.

[12]  Edward L. Haletky VMware ESX Server in the Enterprise: Planning and Securing Virtualization Servers , 2007 .

[13]  Heeseung Jo,et al.  Task-aware virtual machine scheduling for I/O performance. , 2009, VEE '09.

[14]  Prashant J. Shenoy,et al.  Resource overbooking and application profiling in shared hosting platforms , 2002, OSDI '02.

[15]  Jerome A. Rolia,et al.  An integrated approach to resource pool management: Policies, efficiency and quality metrics , 2008, 2008 IEEE International Conference on Dependable Systems and Networks With FTCS and DCC (DSN).

[16]  Ludmila Cherkasova,et al.  Measuring CPU Overhead for I/O Processing in the Xen Virtual Machine Monitor , 2005, USENIX ATC, General Track.

[17]  Alfons Kemper,et al.  AutoGlobe: An Automatic Administration Concept for Service-Oriented Database Applications , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[18]  Tal Garfinkel,et al.  Terra: a virtual machine-based platform for trusted computing , 2003, SOSP '03.

[19]  Alan L. Cox,et al.  Scheduling I/O in virtual machine monitors , 2008, VEE '08.

[20]  Ashraf Aboulnaga,et al.  Automatic virtual machine configuration for database workloads , 2008, SIGMOD Conference.

[21]  Asser N. Tantawi,et al.  Dynamic placement for clustered web applications , 2006, WWW '06.

[22]  Kang G. Shin,et al.  Automated control of multiple virtualized resources , 2009, EuroSys '09.

[23]  Arun Venkataramani,et al.  Black-box and Gray-box Strategies for Virtual Machine Migration , 2007, NSDI.

[24]  Jerome A. Rolia,et al.  A capacity management service for resource pools , 2005, WOSP '05.

[25]  Cheng-Zhong Xu,et al.  A Reinforcement Learning Approach to Online Web Systems Auto-configuration , 2009, 2009 29th IEEE International Conference on Distributed Computing Systems.

[26]  Tim Kraska,et al.  Building a database on S3 , 2008, SIGMOD Conference.

[27]  Akshat Verma,et al.  pMapper: Power and Migration Cost Aware Application Placement in Virtualized Systems , 2008, Middleware.

[28]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[29]  Margo I. Seltzer,et al.  Virtual worlds: fast and strategyproof auctions for dynamic resource allocation , 2003, EC '03.

[30]  Baruch Schieber,et al.  Minimizing migrations in fair multiprocessor scheduling of persistent tasks , 2004, SODA '04.

[31]  Xiaoyun Zhu,et al.  Statistical service assurances for applications in utility grid environments , 2004, Perform. Evaluation.

[32]  Amin Vahdat,et al.  Usher: An Extensible Framework for Managing Clusters of Virtual Machines , 2007, LISA.

[33]  Peter J. Varman,et al.  Efficient and adaptive proportional share I/O scheduling , 2009, PERV.