Global fan speed control considering non-ideal temperature measurements in enterprise servers

Time lag and quantization in temperature sensors in enterprise servers lead to stability concerns on existing variable fan speed control schemes. Stability challenges become further aggravated when multiple local controllers are running together with the fan control scheme. In this paper, we present a global control scheme which tackles the concerns on the stability of enterprise servers while reducing the performance degradation caused by the variable fan speed control scheme. We first present a stable fan speed control scheme based on the Proportional-Integral-Derivative (PID) controller by adaptively adjusting the PID parameters according to the operating fan speed and eliminating the fan speed oscillation caused by temperature quantization. Then, we present a global control scheme which coordinates control actions among multiple local controllers. In addition, it guarantees the server stability while minimizing the overall performance degradation. We validated the proposed control scheme using a presently shipping commercial enterprise server. Our experimental results show that the proposed fan control scheme is stable under the non-ideal temperature measurement system (10 sec in time lag and 1°C in quantization figures). Furthermore, the global control scheme enables to run multiple local controllers in a stable manner while reducing the performance degradation up to 19.2% compared to conventional coordination schemes with 19.1% savings in power consumption.

[1]  Y. Okuyama Robust stabilization and PID control for nonlinear discretized systems on a grid pattern , 2008, 2008 American Control Conference.

[2]  Tajana Simunic,et al.  Utilizing Predictors for Efficient Thermal Management in Multiprocessor SoCs , 2009, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[3]  Tajana Simunic,et al.  Temperature Aware Dynamic Workload Scheduling in Multisocket CPU Servers , 2011, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[4]  Chong-Min Kyung,et al.  Program Phase-Aware Dynamic Voltage Scaling Under Variable Computational Workload and Memory Stall Environment , 2011, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[5]  Massoud Pedram,et al.  Power and Performance Modeling in a Virtualized Server System , 2010, 2010 39th International Conference on Parallel Processing Workshops.

[6]  Tajana Simunic,et al.  JETC: Joint energy thermal and cooling management for memory and CPU subsystems in servers , 2012, IEEE International Symposium on High-Performance Comp Architecture.

[7]  Yixin Diao,et al.  Feedback Control of Computing Systems , 2004 .

[8]  Yefu Wang,et al.  Co-Con: Coordinated control of power and application performance for virtualized server clusters , 2009, 2009 17th International Workshop on Quality of Service.

[9]  Christos Kozyrakis,et al.  Full-System Power Analysis and Modeling for Server Environments , 2006 .

[10]  Tajana Simunic,et al.  Fan-speed-aware scheduling of data intensive jobs , 2012, ISLPED '12.

[11]  Luiz André Barroso,et al.  The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines , 2009, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines.

[12]  José Manuel Moya,et al.  Leakage and temperature aware server control for improving energy efficiency in data centers , 2013, 2013 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[13]  David Atienza,et al.  Correlation-aware virtual machine allocation for energy-efficient datacenters , 2013, 2013 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[14]  Duarte Valério,et al.  Tuning of fractional PID controllers with Ziegler-Nichols-type rules , 2006, Signal Process..

[15]  Kevin Skadron,et al.  HotSpot: a compact thermal modeling methodology for early-stage VLSI design , 2006, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[16]  Sriram Sankar,et al.  The need for speed and stability in data center power capping , 2012, 2012 International Green Computing Conference (IGCC).

[17]  Manish Marwah,et al.  Optimal Fan Speed Control for Thermal Management of Servers , 2009 .

[18]  Thomas F. Wenisch,et al.  Power management of online data-intensive services , 2011, 2011 38th Annual International Symposium on Computer Architecture (ISCA).

[19]  Naehyuck Chang,et al.  Energy-optimal dynamic thermal management for green computing , 2009, 2009 IEEE/ACM International Conference on Computer-Aided Design - Digest of Technical Papers.