The need for speed and stability in data center power capping

Data centers can lower costs significantly by pro-visioning expensive electrical equipment (such as UPS, diesel generators, and cooling capacity) for the actual peak power consumption rather than server nameplate power ratings. However, it is possible that this under-provisioned power level is exceeded due to software behaviors on rare occasions and could cause the entire data center infrastructure to breach the safety limits. A mechanism to cap servers to stay within the provisioned budget is needed, and processor frequency scaling based power capping methods are readily available for this purpose. We show that existing methods, when applied across a large number of servers, are not fast enough to operate correctly under rapid power dynamics observed in data centers. We also show that existing methods when applied to an open system (where demand is independent of service rate) can cause cascading failures in the software service hosted, causing the service performance to fall uncontrollably even when power capping is applied for only a small reduction in power consumption. We discuss the causes for both these short-comings and point out techniques that can yield a safe, fast, and stable power capping solution. Our techniques use admission control to limit power consumption and ensure stability, resulting in orders of magnitude improvement in performance. We also discuss why admission control cannot replace existing power capping methods but must be combined with them.

[1]  Thiemo Voigt,et al.  Adaptive resource-based Web server admission control , 2002, Proceedings ISCC 2002 Seventh International Symposium on Computers and Communications.

[2]  Raphael Rom,et al.  Application-aware admission control and scheduling in Web servers , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[3]  Mor Harchol-Balter,et al.  Optimal power allocation in server farms , 2009, SIGMETRICS '09.

[4]  Lui Sha,et al.  Adaptive Control of Multi-Tiered Web Applications Using Queueing Predictor , 2006, 2006 IEEE/IFIP Network Operations and Management Symposium NOMS 2006.

[5]  Paul England,et al.  Feedback Driven QoS-Aware Power Budgeting for Virtualized Servers , 2009 .

[6]  Ludmila Cherkasova,et al.  Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites , 2002, IEEE Trans. Computers.

[7]  Luiz André Barroso,et al.  The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines, Second Edition , 2013, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines, Second Edition.

[8]  Karthick Rajamani,et al.  Application-Aware Power Management , 2006, 2006 IEEE International Symposium on Workload Characterization.

[9]  Thomas F. Wenisch,et al.  Peak power modeling for data center servers with switched-mode power supplies , 2010, 2010 ACM/IEEE International Symposium on Low-Power Electronics and Design (ISLPED).

[10]  Maria Kihl,et al.  Sustainable Computing: Informatics and Systems , 2012 .

[11]  Xiaorui Wang,et al.  SHIP: A Scalable Hierarchical Power Control Architecture for Large-Scale Data Centers , 2012, IEEE Transactions on Parallel and Distributed Systems.

[12]  David E. Culler,et al.  USENIX Association Proceedings of USITS ’ 03 : 4 th USENIX Symposium on Internet Technologies and Systems , 2003 .

[13]  J Gettys,et al.  Bufferbloat: Dark Buffers in the Internet , 2011, IEEE Internet Computing.

[14]  Kevin Skadron,et al.  Power-aware QoS management in Web servers , 2003, RTSS 2003. 24th IEEE Real-Time Systems Symposium, 2003.

[15]  Vanish Talwar,et al.  No "power" struggles: coordinated multi-level power management for the data center , 2008, ASPLOS.

[16]  Wolf-Dietrich Weber,et al.  Power provisioning for a warehouse-sized computer , 2007, ISCA '07.

[17]  Anand Sivasubramaniam,et al.  Benefits and limitations of tapping into stored energy for datacenters , 2011, 2011 38th Annual International Symposium on Computer Architecture (ISCA).

[18]  Hong Liu,et al.  Energy proportional datacenter networks , 2010, ISCA.

[19]  David E. Irwin,et al.  Ensemble-level Power Management for Dense Blade Servers , 2006, 33rd International Symposium on Computer Architecture (ISCA'06).

[20]  Xiaorui Wang,et al.  Server-Level Power Control , 2007, Fourth International Conference on Autonomic Computing (ICAC'07).

[21]  Yefu Wang,et al.  Coordinating Power Control and Performance Management for Virtualized Server Clusters , 2011, IEEE Transactions on Parallel and Distributed Systems.

[22]  Jie Liu,et al.  Power Budgeting for Virtualized Data Centers , 2011, USENIX Annual Technical Conference.

[23]  Luiz André Barroso,et al.  The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines , 2009, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines.

[24]  Maria Kihl,et al.  Analysis of admission control mechanisms using non-linear control theory , 2003, Proceedings of the Eighth IEEE Symposium on Computers and Communications. ISCC 2003.

[25]  Charles Poussot-Vassal,et al.  A Control-Theoretic Approach for the Combined Management of Quality-of-Service and Energy in Service Centers , 2010 .

[26]  Thomas F. Wenisch,et al.  Power routing: dynamic power provisioning in the data center , 2010, ASPLOS XV.

[27]  Vincent W. Freeh,et al.  Boosting Data Center Performance Through Non-Uniform Power Allocation , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[28]  Xue Liu,et al.  Integrating Adaptive Components: An Emerging Challenge in Performance-Adaptive Systems and a Server Farm Case-Study , 2007, 28th IEEE International Real-Time Systems Symposium (RTSS 2007).

[29]  Gargi Dasgupta,et al.  BrownMap: Enforcing Power Budget in Shared Data Centers , 2010, Middleware.

[30]  Xiaorui Wang,et al.  How much power oversubscription is safe and allowed in data centers , 2011, ICAC '11.