Reliability-Aware Speed Control Policy for Energy Reduction in Server Farms

Server farms are playing an important role in large-scale computing infrastructure today. However, the increasing energy consumption of server farms makes them expensive to operate. There are many technologies aiming at reducing the power consumption of server farms, for example, dynamic server speed scaling is an effective way to make the energy consumption proportional to the work load. But most of them have negative effect on system reliability due to increased transient failure rates. The repeated transition cycles increase the wear-and-tear of server components. In this paper, we control the process of speed scaling of a single server based on the dynamic workload using Markov Decision Process (MDP), and propose two reliability-aware algorithms for energy-efficient dynamic server speed control. Furthermore, we extend the model to multi-servers, it could provide speed control policies with performance, reliability and energy consumption concern. We evaluate the proposed models and policies via numerical experiments, which could show the effectiveness of our algorithms.

[1]  Lisa Zhang,et al.  Routing for Energy Minimization in the Speed Scaling Model , 2010, 2010 Proceedings IEEE INFOCOM.

[2]  Yu Cai,et al.  Markov Model Based Power Management in Server Clusters , 2010, 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing.

[3]  Thomas D. Burd,et al.  Energy efficient CMOS microprocessor design , 1995, Proceedings of the Twenty-Eighth Annual Hawaii International Conference on System Sciences.

[4]  Dakai Zhu,et al.  Global scheduling based reliability-aware power management for multiprocessor real-time systems , 2011, Real-Time Systems.

[5]  Kenneth J. Christensen,et al.  Reducing the Energy Consumption of Ethernet with Adaptive Link Rate (ALR) , 2008, IEEE Transactions on Computers.

[6]  Luiz André Barroso,et al.  The Case for Energy-Proportional Computing , 2007, Computer.

[7]  Albert Y. Zomaya,et al.  A Taxonomy and Survey of Energy-Efficient Data Centers and Cloud Computing Systems , 2010, Adv. Comput..

[8]  U. Rieder,et al.  Markov Decision Processes , 2010 .

[9]  Natarajan Gautam,et al.  Server Frequency Control Using Markov Decision Processes , 2009, IEEE INFOCOM 2009.

[10]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[11]  Luca Benini,et al.  Policy optimization for dynamic power management , 1999, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[12]  Thomas F. Wenisch,et al.  PowerNap: eliminating server idle power , 2009, ASPLOS.