Automated management of virtualized data centers

Virtualized data centers enable sharing of resources among hosted applications. However, it is difficult to manage these data centers because of ever-changing application requirements. This thesis presents a collection of tools called AutoControl and LiteGreen, that automatically adapt to dynamic changes to achieve various SLOs (service level objectives) while maintaining high resource utilization, high application performance and low power consumption. AutoControl resource manager is based on control theory and optimization techniques. The resource manager is a combination of an online model estimator and a novel multi-input, multi-output (MEMO) resource controller. The model estimator captures the complex relationship between application performance and resource allocations, while the MIMO controller allocates the right amount of resources to achieve application SLOs. We also developed a power manager called LiteGreen to save desktop energy by virtualizing the users desktop computing environment as a virtual machine (VM) and then migrating it between the users physical desktop machine and a VM server, depending on whether the desktop computing environment is being actively used or is idle. AutoControl and LiteGreen are built using Xen and Hyper-V virtualization technologies and various experimental testbeds including a testbed on Emulab are built to evaluate different aspects of AutoControl and LiteGreen.

[1]  Karsten Schwan,et al.  VirtualPower: coordinated power management in virtualized enterprise systems , 2007, SOSP.

[2]  C. Amza,et al.  Specification and implementation of dynamic Web site benchmarks , 2002, 2002 IEEE International Workshop on Workload Characterization.

[3]  Tarek F. Abdelzaher,et al.  Design, implementation, and evaluation of differentiated caching services , 2004, IEEE Transactions on Parallel and Distributed Systems.

[4]  Peter J. Varman,et al.  Efficient and adaptive proportional share I/O scheduling , 2009, PERV.

[5]  Mahadev Satyanarayanan,et al.  Internet suspend/resume , 2002, Proceedings Fourth IEEE Workshop on Mobile Computing Systems and Applications.

[6]  Yuting Zhang,et al.  Friendly virtual machines: leveraging a feedback-control model for application adaptation , 2005, VEE '05.

[7]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[8]  Paramvir Bahl,et al.  Somniloquy: Augmenting Network Interfaces to Reduce PC Energy Usage , 2009, NSDI.

[9]  Mike Hibler,et al.  An integrated experimental environment for distributed systems and networks , 2002, OPSR.

[10]  Junda Liu,et al.  Skilled in the Art of Being Idle: Reducing Energy Waste in Networked Systems , 2009, NSDI.

[11]  S. W. Galley PDP-10 virtual machines , 1973 .

[12]  Scott Shenker,et al.  Scheduling for reduced CPU energy , 1994, OSDI '94.

[13]  William E. Weihl,et al.  Lottery scheduling: flexible proportional-share resource management , 1994, OSDI '94.

[14]  Richard E. Brown,et al.  Report to Congress on Server and Data Center Energy Efficiency: Public Law 109-431 , 2008 .

[15]  K. Shin,et al.  Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach , 2002, IEEE Trans. Parallel Distributed Syst..

[16]  Calton Pu,et al.  A feedback-driven proportion allocator for real-rate scheduling , 1999, OSDI '99.

[17]  Erich M. Nahum,et al.  A method for transparent admission control and request scheduling in e-commerce web sites , 2004, WWW '04.

[18]  Dharmendra S. Modha,et al.  CacheCOW: providing QoS for storage system caches , 2003, SIGMETRICS '03.

[19]  Manish Marwah,et al.  Delivering Energy Proportionality with Non Energy-Proportional Systems - Optimizing the Ensemble , 2008, HotPower.

[20]  S. Parekh,et al.  MIMO control of an Apache web server: modeling and controller design , 2002, Proceedings of the 2002 American Control Conference (IEEE Cat. No.CH37301).

[21]  Alan L. Cox,et al.  Optimizing network virtualization in Xen , 2006 .

[22]  Daniel Price,et al.  Solaris Zones: Operating System Support for Consolidating Commercial Workloads , 2004, LISA.

[23]  George Varghese,et al.  Difference engine , 2010, OSDI.

[24]  Jeffrey S. Chase,et al.  Making Scheduling "Cool": Temperature-Aware Workload Placement in Data Centers , 2005, USENIX Annual Technical Conference, General Track.

[25]  Jason Nieh,et al.  The design, implementation and evaluation of SMART: a scheduler for multimedia applications , 1997, SOSP.

[26]  Xiaoyun Zhu,et al.  Utilization and SLO-Based Control for Dynamic Sizing of Resource Partitions , 2005, DSOM.

[27]  Amin Vahdat,et al.  Enforcing Performance Isolation Across Virtual Machines in Xen , 2006, Middleware.

[28]  Scott Devine,et al.  Disco: running commodity operating systems on scalable multiprocessors , 1997, TOCS.

[29]  Parthasarathy Ranganathan,et al.  Models and Metrics for Energy-Efficient Computing , 2009, Adv. Comput..

[30]  Werner Vogels,et al.  Dynamo: amazon's highly available key-value store , 2007, SOSP.

[31]  Arif Merchant,et al.  Façade: Virtual Storage Devices with Performance Guarantees , 2003, FAST.

[32]  Erich M. Nahum,et al.  Yaksha: a self-tuning controller for managing the performance of 3-tiered Web sites , 2004, Twelfth IEEE International Workshop on Quality of Service, 2004. IWQOS 2004..

[33]  Arun Venkataramani,et al.  Black-box and Gray-box Strategies for Virtual Machine Migration , 2007, NSDI.

[34]  Fabrice Bellard,et al.  QEMU, a Fast and Portable Dynamic Translator , 2005, USENIX ATC, FREENIX Track.

[35]  Parthasarathy Ranganathan A Recipe for Efficiency? Some Principles of Power-aware Computing , 2010 .

[36]  Peter Druschel,et al.  Resource containers: a new facility for resource management in server systems , 1999, OSDI '99.

[37]  Kang G. Shin,et al.  Adaptive control of virtualized resources in utility computing environments , 2007, EuroSys '07.

[38]  Jerome A. Rolia,et al.  A capacity management service for resource pools , 2005, WOSP '05.

[39]  Tao Yang,et al.  Integrated resource management for cluster-based Internet services , 2002, OSDI.

[40]  Artur Andrzejak,et al.  Bounding the Resource Savings of Utility Computing Models , 2002 .

[41]  Xiaoyun Zhu,et al.  Triage: Performance differentiation for storage systems using adaptive control , 2005, TOS.

[42]  Jian Xu,et al.  Performance virtualization for large-scale storage systems , 2003, 22nd International Symposium on Reliable Distributed Systems, 2003. Proceedings..

[43]  Gerald J. Popek,et al.  Formal requirements for virtualizable third generation architectures , 1974, SOSP '73.

[44]  Kenneth J. Christensen,et al.  Greener PCs for the Enterprise , 2009, IT Professional.

[45]  Xiaorui Wang,et al.  Cluster-level feedback power control for performance optimization , 2008, 2008 IEEE 14th International Symposium on High Performance Computer Architecture.

[46]  Kang G. Shin,et al.  Automated control of multiple virtualized resources , 2009, EuroSys '09.

[47]  Christos G. Cassandras,et al.  Introduction to Discrete Event Systems , 1999, The Kluwer International Series on Discrete Event Dynamic Systems.

[48]  Willy Zwaenepoel,et al.  Cluster reserves: a mechanism for resource management in cluster-based network servers , 2000, SIGMETRICS '00.

[49]  Robert J. Creasy,et al.  The Origin of the VM/370 Time-Sharing System , 1981, IBM J. Res. Dev..

[50]  Kang G. Shin,et al.  Virtual Services: A New Abstraction for Server Consolidation , 2000, USENIX Annual Technical Conference, General Track.

[51]  Xiaoyun Zhu,et al.  Triage: performance isolation and differentiation for storage systems , 2004, Twelfth IEEE International Workshop on Quality of Service, 2004. IWQOS 2004..

[52]  Michael B. Jones,et al.  CPU reservations and time constraints: efficient, predictable scheduling of independent activities , 1997, SOSP.

[53]  Feng Zhao,et al.  Energy aware consolidation for cloud computing , 2008, CLUSTER 2008.

[54]  Richard A. Meyer,et al.  A Virtual Machine Time-Sharing System , 1970, IBM Syst. J..

[55]  Carl A. Waldspurger,et al.  Memory resource management in VMware ESX server , 2002, OSDI '02.

[56]  Xiaoyun Zhu,et al.  Designing Controllable Computer Systems , 2005, HotOS.

[57]  Xue Liu,et al.  Optimal multivariate control for differentiated services on a shared hosting platform , 2007, 2007 46th IEEE Conference on Decision and Control.

[58]  Karl Johan Åström,et al.  Adaptive Control , 1989, Embedded Digital Control with Microcontrollers.

[59]  G ShinKang,et al.  Adaptive control of virtualized resources in utility computing environments , 2007 .

[60]  Anja Feldmann,et al.  Live wide-area migration of virtual machines including local persistent state , 2007, VEE '07.

[61]  Wei Jin,et al.  Interposed proportional sharing for a storage service utility , 2004, SIGMETRICS '04/Performance '04.

[62]  Yixin Diao,et al.  Feedback Control of Computing Systems , 2004 .

[63]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[64]  Marianne Shaw,et al.  Scale and performance in the Denali isolation kernel , 2002, OSDI '02.

[65]  Willy Zwaenepoel,et al.  Diagnosing performance overheads in the xen virtual machine environment , 2005, VEE '05.

[66]  Amin Vahdat,et al.  Long-term Streaming Media Server Workload Analysis and Modeling , 2003 .

[67]  Xiaoyun Zhu,et al.  Memory overbooking and dynamic control of Xen virtual machines in consolidated environments , 2009, 2009 IFIP/IEEE International Symposium on Integrated Network Management.

[68]  Eli M. Dow,et al.  Xen and the Art of Repeated Research , 2004, USENIX Annual Technical Conference, FREENIX Track.

[69]  Mikko H. Lipasti,et al.  An architectural evaluation of Java TPC-W , 2001, Proceedings HPCA Seventh International Symposium on High-Performance Computer Architecture.

[70]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[71]  Larry L. Peterson,et al.  Container-based operating system virtualization: a scalable, high-performance alternative to hypervisors , 2007, EuroSys '07.

[72]  Ludmila Cherkasova,et al.  XenMon: QoS Monitoring and Performance Profiling Tool , 2005 .