Understanding and Managing IT Power Consumption: A Measurement-Based Approach

The continuing, unsustainable increase in datacenter power consumption is causing researchers in industry and academia to be heavily invested in addressing power management challenges. This chapter presents the basic elements of a measurement-based approach toward managing distributed datacenter and cloud computing systems to meet both application and end-user needs and to obtain improved efficiency and sustainability in their operation. The main components of the approach presented include (1) continuous online monitoring, measurement and assessment of systems and applications behaviors and power consumption, including for online estimation of the power usage of virtual machines running application components in these virtualized systems; (2) the ability to perform these tasks efficiently at scale, so as to deal with the ever-increasing sizes and complexity of modern datacenter infrastructures; and (3) the importance of “coordinated” management methods that operate across multiple levels of abstraction and multiple layers of the management stack in an orchestrated manner.

[1]  Mark Crovella,et al.  Mining anomalies using traffic feature distributions , 2005, SIGCOMM '05.

[2]  Vanish Talwar,et al.  Online detection of utility cloud anomalies using metric distributions , 2010, 2010 IEEE Network Operations and Management Symposium - NOMS 2010.

[3]  Vipin Chaudhary,et al.  VMeter: Power modelling for virtualized clouds , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW).

[4]  Karsten Schwan,et al.  Vpm tokens: virtual machine-aware power budgeting in datacenters , 2008, HPDC '08.

[5]  Willy Zwaenepoel,et al.  Diagnosing performance overheads in the xen virtual machine environment , 2005, VEE '05.

[6]  Guruduth Banavar,et al.  Gryphon: An Information Flow Based Approach to Message Brokering , 1998, ArXiv.

[7]  George Candea,et al.  Microreboot - A Technique for Cheap Recovery , 2004, OSDI.

[8]  Karsten Schwan,et al.  E2EProf: Automated End-to-End Performance Management for Enterprise Systems , 2007, 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'07).

[9]  Miguel Castro,et al.  SCRIBE: The Design of a Large-Scale Event Notification Infrastructure , 2001, Networked Group Communication.

[10]  Dejan S. Milojicic,et al.  Moara: Flexible and Scalable Group-Based Querying System , 2008, Middleware.

[11]  Andrea C. Arpaci-Dusseau,et al.  Information and control in gray-box systems , 2001, SOSP.

[12]  Ada Gavrilovska,et al.  VM power metering: feasibility and challenges , 2011, PERV.

[13]  Chandrakant D. Patel,et al.  Estimating Environmental Costs , 2010, SustainIT.

[14]  Andrea C. Arpaci-Dusseau,et al.  Controlling Your PLACE in the File System with Gray-box Techniques , 2003, USENIX Annual Technical Conference, General Track.

[15]  Christophe Diot,et al.  Diagnosing network-wide traffic anomalies , 2004, SIGCOMM.

[16]  Robbert van Renesse,et al.  Astrolabe: A robust and scalable technology for distributed system monitoring, management, and data mining , 2003, TOCS.

[17]  Akshat Verma,et al.  WattApp: an application aware power meter for shared data centers , 2010, ICAC '10.

[18]  Karsten Schwan,et al.  Look Who's Talking: Discovering Dependencies between Virtual Machines Using CPU Utilization , 2010, HotCloud.

[19]  Indranil Gupta,et al.  MON: On-Demand Overlays for Distributed System Management , 2005, WORLDS.

[20]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[21]  Robert J. Fowler,et al.  SoftPower: fine-grain power estimations using performance counters , 2010, HPDC '10.

[22]  David E. Culler,et al.  The ganglia distributed monitoring system: design, implementation, and experience , 2004, Parallel Comput..

[23]  S. M. Shahidehpour,et al.  Transaction analysis in deregulated power systems using game theory , 1997 .

[24]  Eric A. Brewer,et al.  Pinpoint: problem determination in large, dynamic Internet services , 2002, Proceedings International Conference on Dependable Systems and Networks.

[25]  Feng Zhao,et al.  Virtual machine power metering and provisioning , 2010, SoCC '10.

[26]  Vanish Talwar,et al.  vManage: loosely coupled platform and virtualization management in data centers , 2009, ICAC '09.

[27]  Willy Zwaenepoel,et al.  Performance and scalability of EJB applications , 2002, OOPSLA '02.

[28]  Jeffrey S. Chase,et al.  Correlating Instrumentation Data to System States: A Building Block for Automated Diagnosis and Control , 2004, OSDI.

[29]  Karsten Schwan,et al.  A state-space approach to SLA based management , 2008, NOMS 2008 - 2008 IEEE Network Operations and Management Symposium.

[30]  Rajarshi Das,et al.  Utility-function-driven energy-efficient cooling in data centers , 2010, ICAC '10.

[31]  Karsten Schwan,et al.  SysProf: Online Distributed Behavior Diagnosis through Fine-grain System Monitoring , 2006, 26th IEEE International Conference on Distributed Computing Systems (ICDCS'06).

[32]  Nagarajan Kandasamy,et al.  Power and Performance Management of Virtualized Computing Environments Via Lookahead Control , 2008, ICAC.

[33]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[34]  Paramvir Bahl,et al.  Towards highly reliable enterprise network services via inference of multi-level dependencies , 2007, SIGCOMM.

[35]  Karsten Schwan,et al.  CoolIT: coordinating facility and it management for efficient datacenters , 2008, CLUSTER 2008.

[36]  Marcos K. Aguilera,et al.  Performance debugging for distributed systems of black boxes , 2003, SOSP '03.

[37]  Richard Mortier,et al.  Using Magpie for Request Extraction and Workload Modelling , 2004, OSDI.

[38]  Karsten Schwan,et al.  Coordinated Optimization of Cooling and IT Power in Data Centers , 2010 .

[39]  Sandeep Uttamchandani,et al.  ChargeView: An integrated tool for implementing chargeback in IT systems , 2008, NOMS 2008 - 2008 IEEE Network Operations and Management Symposium.

[40]  Sujata Banerjee,et al.  A Power Benchmarking Framework for Network Devices , 2009, Networking.