Towards energy-aware scheduling in data centers using machine learning

As energy-related costs have become a major economical factor for IT infrastructures and data-centers, companies and the research community are being challenged to find better and more efficient power-aware resource management strategies. There is a growing interest in "Green" IT and there is still a big gap in this area to be covered. In order to obtain an energy-efficient data center, we propose a framework that provides an intelligent consolidation methodology using different techniques such as turning on/off machines, power-aware consolidation algorithms, and machine learning techniques to deal with uncertain information while maximizing performance. For the machine learning approach, we use models learned from previous system behaviors in order to predict power consumption levels, CPU loads, and SLA timings, and improve scheduling decisions. Our framework is vertical, because it considers from watt consumption to workload features, and cross-disciplinary, as it uses a wide variety of techniques. We evaluate these techniques with a framework that covers the whole control cycle of a real scenario, using a simulation with representative heterogeneous workloads, and we measure the quality of the results according to a set of metrics focused toward our goals, besides traditional policies. The results obtained indicate that our approach is close to the optimal placement and behaves better when the level of uncertainty increases.

[1]  Salim Hariri,et al.  Autonomic power and performance management for computing systems , 2006, 2006 IEEE International Conference on Autonomic Computing.

[2]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[3]  Rajarshi Das,et al.  Autonomic multi-agent management of power and performance in data centers , 2008, AAMAS.

[4]  Benny Rochwerger,et al.  Oceano-SLA based management of a computing utility , 2001, 2001 IEEE/IFIP International Symposium on Integrated Network Management Proceedings. Integrated Network Management VII. Integrated Management Strategies for the New Millennium (Cat. No.01EX470).

[5]  Randy H. Katz,et al.  An energy case for hybrid datacenters , 2010, OPSR.

[6]  Daniel Mossé,et al.  A dynamic configuration model for power-efficient virtualized server clusters , 2009 .

[7]  Ramin Yahyapour,et al.  Design and evaluation of job scheduling strategies for grid computing , 2000, GRID.

[8]  Gargi Dasgupta,et al.  Server Workload Analysis for Power Minimization using Consolidation , 2009, USENIX Annual Technical Conference.

[9]  Carsten Franke,et al.  XtreemOS: A Vision for a Grid Operating System , 2008 .

[10]  Jordi Torres Elastic Management of Tasks in Virtualized Environments , 2009 .

[11]  Akshat Verma,et al.  Power-aware dynamic placement of HPC applications , 2008, ICS '08.

[12]  S. Ranjan,et al.  QoS-driven server migration for Internet data centers , 2002, IEEE 2002 Tenth IEEE International Workshop on Quality of Service (Cat. No.02EX564).

[13]  David Levine,et al.  Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning , 2007, NIPS.

[14]  Alexandru Iosup,et al.  The Grid Workloads Archive , 2008, Future Gener. Comput. Syst..

[15]  V. Petrucci,et al.  Dynamic configuration support for power-aware virtualized server clusters , 2009 .

[16]  Gerald Tesauro,et al.  Reinforcement Learning in Autonomic Computing: A Manifesto and Case Studies , 2007, IEEE Internet Computing.

[17]  Christoforos E. Kozyrakis,et al.  JouleSort: a balanced energy-efficiency benchmark , 2007, SIGMOD '07.

[18]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[19]  Jordi Torres,et al.  Autonomic QoS control in enterprise Grid environments using online simulation , 2009, J. Syst. Softw..

[20]  Enrique V. Carrera,et al.  Load balancing and unbalancing for power and performance in cluster-based systems , 2001 .

[21]  Rajarshi Das,et al.  Coordinating Multiple Autonomic Managers to Achieve Specified Power-Performance Tradeoffs , 2007, Fourth International Conference on Autonomic Computing (ICAC'07).

[22]  Ricardo Bianchini,et al.  Power and energy management for server systems , 2004, Computer.

[23]  Liang Liu,et al.  GreenCloud: a new architecture for green data center , 2009, ICAC-INDST '09.

[24]  David Vengerov,et al.  A Reinforcement Learning Approach to Dynamic Resource Allocation ∗ , 2005 .

[25]  Karthick Rajamani,et al.  Energy Management for Commercial Servers , 2003, Computer.

[26]  David Filani Dynamic Data Center Power Management Trends, Issues, and Solutions , 2008 .

[27]  Tao Yang,et al.  Integrated resource management for cluster-based Internet services , 2002, OSDI.

[28]  Jordi Guitart Fernández,et al.  Elastic management of tasks in virtualized environments , 2009 .

[29]  Luiz André Barroso,et al.  The Case for Energy-Proportional Computing , 2007, Computer.

[30]  Anand Sivasubramaniam,et al.  Managing server energy and operational costs in hosting centers , 2005, SIGMETRICS '05.