SHIP: A Scalable Hierarchical Power Control Architecture for Large-Scale Data Centers

In today's data centers, precisely controlling server power consumption is an essential way to avoid system failures caused by power capacity overload or overheating due to increasingly high server density. While various power control strategies have been recently proposed, existing solutions are not scalable to control the power consumption of an entire large-scale data center, because these solutions are designed only for a single server or a rack enclosure. In a modern data center, however, power control needs to be enforced at three levels: rack enclosure, power distribution unit, and the entire data center, due to the physical and contractual power limits at each level. This paper presents SHIP, a highly scalable hierarchical power control architecture for large-scale data centers. SHIP is designed based on well-established control theory for analytical assurance of control accuracy and system stability. Empirical results on a physical testbed show that our control solution can provide precise power control, as well as power differentiations for optimized system performance and desired server priorities. In addition, our extensive simulation results based on a real trace file demonstrate the efficacy of our control solution in large-scale data centers composed of 5,415 servers.

[1]  Kevin Skadron,et al.  Control-theoretic techniques and thermal-RC modeling for accurate and localized dynamic thermal management , 2002, Proceedings Eighth International Symposium on High Performance Computer Architecture.

[2]  Meeta Sharma Gupta,et al.  System level analysis of fast, per-core DVFS using on-chip switching regulators , 2008, 2008 IEEE 14th International Symposium on High Performance Computer Architecture.

[3]  Xiaorui Wang,et al.  Power capping: a prelude to power shifting , 2008, Cluster Computing.

[4]  Chenyang Lu,et al.  Feedback utilization control in distributed real-time systems with end-to-end tasks , 2005, IEEE Transactions on Parallel and Distributed Systems.

[5]  Xiaoyun Zhu,et al.  Power-Efficient Response Time Guarantees for Virtualized Enterprise Servers , 2008, 2008 Real-Time Systems Symposium.

[6]  Xiaorui Wang,et al.  Dynamic Thermal and Timeliness Guarantees for Distributed Real-Time Embedded Systems , 2009, 2009 15th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications.

[7]  Michael Kistler,et al.  The case for power management in web servers , 2002 .

[8]  Bishop Brock,et al.  Architecting for power management: The IBM® POWER7™ approach , 2010, HPCA - 16 2010 The Sixteenth International Symposium on High-Performance Computer Architecture.

[9]  Vanish Talwar,et al.  No "power" struggles: coordinated multi-level power management for the data center , 2008, ASPLOS.

[10]  Philip E. Gill,et al.  Practical optimization , 1981 .

[11]  Calton Pu,et al.  A feedback-driven proportion allocator for real-rate scheduling , 1999, OSDI '99.

[12]  Anand Sivasubramaniam,et al.  Managing server energy and operational costs in hosting centers , 2005, SIGMETRICS '05.

[13]  Jan M. Maciejowski,et al.  Predictive control : with constraints , 2002 .

[14]  Xiaorui Wang,et al.  MIMO Power Control for High-Density Servers in an Enclosure , 2010, IEEE Transactions on Parallel and Distributed Systems.

[15]  Kai Ma,et al.  Temperature-constrained power control for chip multiprocessors with online model estimation , 2009, ISCA '09.

[16]  Anand Sivasubramaniam,et al.  Statistical Profiling-based Techniques for Effective Provisioning of Power Infrastructure in Consolidated Data Centers , 2008 .

[17]  Luca Benini,et al.  Operating-system directed power reduction , 2000, ISLPED '00.

[18]  Yixin Diao,et al.  Feedback Control of Computing Systems , 2004 .

[19]  Xiaorui Wang,et al.  SHIP: Scalable Hierarchical Power Control for Large-Scale Data Centers , 2009, 2009 18th International Conference on Parallel Architectures and Compilation Techniques.

[20]  Karsten Schwan,et al.  VPM tokens: virtual machine-aware power budgeting in datacenters , 2009, Cluster Computing.

[21]  Vincent W. Freeh,et al.  Dynamic Power Management using Feedback , 2002 .

[22]  Karthick Rajamani,et al.  Energy Management for Commercial Servers , 2003, Computer.

[23]  Kevin Skadron,et al.  Power-aware QoS management in Web servers , 2003, RTSS 2003. 24th IEEE Real-Time Systems Symposium, 2003.

[24]  AbdelzaherTarek,et al.  Dynamic Voltage Scaling in Multitier Web Servers with End-to-End Delay Control , 2007 .

[25]  Xiaoyun Zhu,et al.  Triage: Performance differentiation for storage systems using adaptive control , 2005, TOS.

[26]  Anand Sivasubramaniam,et al.  Statistical profiling-based techniques for effective power provisioning in data centers , 2009, EuroSys '09.

[27]  Margaret Martonosi,et al.  Dynamic thermal management for high-performance microprocessors , 2001, Proceedings HPCA Seventh International Symposium on High-Performance Computer Architecture.

[28]  Chenyang Lu,et al.  DEUCON: Decentralized End-to-End Utilization Control for Distributed Real-Time Systems , 2007, IEEE Transactions on Parallel and Distributed Systems.

[29]  Vincent W. Freeh,et al.  Boosting Data Center Performance Through Non-Uniform Power Allocation , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[30]  Srinivasan Keshav,et al.  A control-theoretic approach to flow control , 1991, SIGCOMM '91.

[31]  Chenyang Lu,et al.  Feedback performance control in software services , 2003 .

[32]  Thomas F. Wenisch,et al.  Power routing: dynamic power provisioning in the data center , 2010, ASPLOS 2010.

[33]  Amin Vahdat,et al.  ECOSystem: managing energy as a first class operating system resource , 2002, ASPLOS X.

[34]  Wolf-Dietrich Weber,et al.  Power provisioning for a warehouse-sized computer , 2007, ISCA '07.

[35]  Karsten Schwan,et al.  Vpm tokens: virtual machine-aware power budgeting in datacenters , 2008, HPDC '08.

[36]  David E. Irwin,et al.  Ensemble-level Power Management for Dense Blade Servers , 2006, 33rd International Symposium on Computer Architecture (ISCA'06).

[37]  Xue Liu,et al.  Dynamic Voltage Scaling in Multitier Web Servers with End-to-End Delay Control , 2007, IEEE Transactions on Computers.