HOPE: Enabling Efficient Service Orchestration in Software-Defined Data Centers

The functional scope of today's software-defined data centers (SDDC) has expanded to such an extent that servers face a growing amount of critical background operational tasks like load monitoring, logging, migration, and duplication, etc. These ancillary operations, which we refer to as management operations, often nibble the stringent data center power envelope and exert a tremendous amount of pressure on front-end user tasks. However, existing power capping, peak shaving, and time shifting mechanisms mainly focus on managing data center power demand at the "macro level" -- they do not distinguish ancillary background services from user tasks, and therefore often incur significant performance degradation and energy overhead. In this study we explore "micro-level" power management in SDDC: tuning a specific set of critical loads for the sake of overall system efficiency and performance. Specifically, we look at management operations that can often lead to resource contention and energy overhead in an IaaS SDDC. We assess the feasibility of this new power management paradigm by characterizing the resource and power impact of various management operations. We propose HOPE, a new system optimization framework for eliminating the potential efficiency bottleneck caused by the management operations in the SDDC. HOPE is implemented on a customized OpenStack cloud environment with heavily instrumented power infrastructure. We thoroughly validate HOPE models and optimization efficacy under various user workload scenarios. Our deployment experiences show that the proposed technique allows SDDC to reduce energy consumption by 19%, reduce management operation execution time by 25.4%, and in the meantime improve workload performance by 30%.

[1]  Svetozar Miuÿ,et al.  DejaVu: Accelerating Resource Allocation in Virtualized Environments , 2012 .

[2]  Christina Delimitrou,et al.  Quasar: resource-efficient and QoS-aware cluster management , 2014, ASPLOS.

[3]  Tao Li,et al.  Characterizing the efficiency of data deduplication for big data storage management , 2013, 2013 IEEE International Symposium on Workload Characterization (IISWC).

[4]  Thomas F. Wenisch,et al.  Power routing: dynamic power provisioning in the data center , 2010, ASPLOS XV.

[5]  Irene Zhang,et al.  Optimizing VM Checkpointing for Restore Performance in VMware ESXi , 2013, USENIX Annual Technical Conference.

[6]  Longjun Liu,et al.  Towards sustainable in-situ server systems in the big data era , 2015, 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA).

[7]  Anand Sivasubramaniam,et al.  Virtualizing power distribution in datacenters , 2013, ISCA.

[8]  Longjun Liu,et al.  Enabling datacenter servers to scale out economically and sustainably , 2013, 2013 46th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[9]  Nanning Zheng,et al.  HEB: Deploying and managing hybrid energy buffers for improving datacenter efficiency and economy , 2015, 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA).

[10]  Vijayaraghavan Soundararajan,et al.  The impact of management operations on the virtualized datacenter , 2010, ISCA '10.

[11]  Nanning Zheng,et al.  Leveraging Heterogeneous Power for Improving Datacenter Efficiency and Resiliency , 2015, IEEE Computer Architecture Letters.

[12]  Hsien-Hsin S. Lee,et al.  Migration energy-aware workload consolidation in enterprise clouds , 2012, 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings.

[13]  Ramesh Govindan,et al.  Scalable Rule Management for Data Centers , 2013, NSDI.

[14]  Xiao Zhang,et al.  Power containers: an OS facility for fine-grained power and energy management on multicore servers , 2013, ASPLOS '13.

[15]  David R. Kaeli,et al.  Quantifying load imbalance on virtualized enterprise servers , 2010, WOSP/SIPEW '10.

[16]  Sachin Katti,et al.  Copysets: Reducing the Frequency of Data Loss in Cloud Storage , 2013, USENIX Annual Technical Conference.

[17]  Depei Qian,et al.  Managing Green Datacenters Powered by Hybrid Renewable Energy Systems , 2014, ICAC.

[18]  Chao Li,et al.  iSwitch: Coordinating and optimizing renewable energy powered server clusters , 2012, 2012 39th Annual International Symposium on Computer Architecture (ISCA).

[19]  Vanish Talwar,et al.  VScope: Middleware for Troubleshooting Time-Sensitive Data Center Applications , 2012, Middleware.

[20]  Ricardo Bianchini,et al.  DeepDive: Transparently Identifying and Managing Performance Interference in Virtualized Environments , 2013, USENIX Annual Technical Conference.

[21]  Longjun Liu,et al.  Towards Automated Provisioning and Emergency Handling in Renewable Energy Powered Datacenters , 2014, Journal of Computer Science and Technology.

[22]  Calton Pu,et al.  An Analysis of Performance Interference Effects in Virtual Environments , 2007, 2007 IEEE International Symposium on Performance Analysis of Systems & Software.

[23]  Nanning Zheng,et al.  RE-UPS: an adaptive distributed energy storage system for dynamically managing solar energy in green datacenters , 2015, The Journal of Supercomputing.

[24]  Christina Delimitrou,et al.  Paragon: QoS-aware scheduling for heterogeneous datacenters , 2013, ASPLOS '13.

[25]  Jie Liu,et al.  Cuanta: quantifying effects of shared on-chip resource interference for consolidated virtual machines , 2011, SoCC.

[26]  Karsten Schwan,et al.  VirtualPower: coordinated power management in virtualized enterprise systems , 2007, SOSP.

[27]  Kristina Chodorow,et al.  MongoDB: The Definitive Guide , 2010 .

[28]  Nanning Zheng,et al.  Towards an Adaptive Multi-Power-Source Datacenter , 2016, ICS.

[29]  Thu D. Nguyen,et al.  Parasol and GreenSwitch: managing datacenters powered by renewable energy , 2013, ASPLOS '13.

[30]  Babak Falsafi,et al.  Clearing the clouds: a study of emerging scale-out workloads on modern hardware , 2012, ASPLOS XVII.