SLA_Driven Adaptive Resource Allocation for Virtualized Servers

In order to reduce cost and improve efficiency, many data centers adopt virtualization solutions. The advent of virtualization allows multiple virtual machines hosted on a single physical server. However, this poses new challenges for resource management. Web workloads which are dominant in data centers are known to vary dynamically with time. In order to meet application’s service level agreement (SLA), how to allocate resources for virtual machines has become an important challenge in virtualized server environments, especially when dealing with fluctuating workloads and complex server applications. User experience is an important manifestation of SLA and attracts more attention. In this paper, the SLA is defined by server-side response time. Traditional resource allocation based on resource utilization has some drawbacks. We argue that dynamic resource allocation directly based on real-time user experience is more reasonable and also has practical significance. To address the problem, we propose a system architecture that combines response time measurements and analysis of user experience for resource allocation. An optimization model is introduced to dynamically allocate the resources among virtual machines. When resources are insufficient, we provide service differentiation and firstly guarantee resource requirements of applications that have higher priorities. We evaluate our proposal using TPC-W and Webbench. The experimental results show that our system can judiciously allocate system resources. The system helps stabilize applications’ user experience. It can reduce the mean deviation of user experience from desired targets. key words: resource allocation, virtualized servers, user experience, optimization theory

[1]  Prashant J. Shenoy,et al.  Profiling and Modeling Resource Usage of Virtualized Applications , 2008, Middleware.

[2]  Jing Xu,et al.  Autonomic resource management in virtualized data centers using fuzzy logic-based approaches , 2008, Cluster Computing.

[3]  Jerome A. Rolia,et al.  Configuring Workload Manager Control Parameters for Resource Pools , 2006, 2006 IEEE/IFIP Network Operations and Management Symposium NOMS 2006.

[4]  Kang G. Shin,et al.  Adaptive control of virtualized resources in utility computing environments , 2007, EuroSys '07.

[5]  Aman Kansal,et al.  Q-clouds: managing performance interference effects for QoS-aware clouds , 2010, EuroSys '10.

[6]  Aameek Singh,et al.  Shares and utilities based power consolidation in virtualized server environments , 2009, 2009 IFIP/IEEE International Symposium on Integrated Network Management.

[7]  Xue Liu,et al.  Optimal multivariate control for differentiated services on a shared hosting platform , 2007, 2007 46th IEEE Conference on Decision and Control.

[8]  Allan Kuchinsky,et al.  Integrating user-perceived quality into Web server design , 2000, Comput. Networks.

[9]  Xiaoyun Zhu,et al.  PARTIC: Power-Aware Response Time Control for Virtualized Web Servers , 2011, IEEE Transactions on Parallel and Distributed Systems.

[10]  Maria Kihl,et al.  Resource allocation and disturbance rejection in web servers using SLAs and virtualized servers , 2009, IEEE Transactions on Network and Service Management.

[11]  Ludmila Cherkasova,et al.  Measuring CPU Overhead for I/O Processing in the Xen Virtual Machine Monitor , 2005, USENIX ATC, General Track.

[12]  Xiaoyun Zhu,et al.  AppRAISE: application-level performance management in virtualized server environments , 2009, IEEE Transactions on Network and Service Management.

[13]  Kang G. Shin,et al.  Automated control of multiple virtualized resources , 2009, EuroSys '09.

[14]  Hyong S. Kim,et al.  How to tame your VMs: an automated control system for virtualized services , 2010 .

[15]  Abhishek Chandra,et al.  An observation-based approach towards self-managing Web servers , 2002, IEEE 2002 Tenth IEEE International Workshop on Quality of Service (Cat. No.02EX564).

[16]  Kevin Skadron,et al.  Enhancing Energy Efficiency in Multi-tier Web Server Clusters via Prioritization , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[17]  Hui Wang,et al.  A service-oriented priority-based resource scheduling scheme for virtualized utility computing , 2008, HiPC'08.

[18]  Yefu Wang,et al.  Co-Con: Coordinated control of power and application performance for virtualized server clusters , 2009, 2009 17th International Workshop on Quality of Service.

[19]  Xiaoyun Zhu,et al.  Adaptive entitlement control of resource containers on shared servers , 2005, 2005 9th IFIP/IEEE International Symposium on Integrated Network Management, 2005. IM 2005..

[20]  Raffaela Mirandola,et al.  Run-time resource management in SOA virtualized environments , 2009, QUASOSS '09.

[21]  Steven Hand,et al.  Self-adaptive and self-configured CPU resource provisioning for virtualized servers using Kalman filters , 2009, ICAC '09.

[22]  Xiaoyun Zhu,et al.  Memory overbooking and dynamic control of Xen virtual machines in consolidated environments , 2009, 2009 IFIP/IEEE International Symposium on Integrated Network Management.

[23]  Mikko H. Lipasti,et al.  An architectural evaluation of Java TPC-W , 2001, Proceedings HPCA Seventh International Symposium on High-Performance Computer Architecture.