EDR: An energy-aware runtime load distribution system for data-intensive applications in the cloud

Data centers account for a growing percentage of US power consumption. Energy efficiency is now a first-class design constraint for the data centers that support cloud services. Service providers must distribute their data efficiently across multiple data centers. This includes creation of data replicas that provide multiple copies of data for efficient access. However, selecting replicas to maximize performance while minimizing energy waste is an open problem. State of the art replica selection approaches either do not address energy, lack scalability and/or are vulnerable to crashes due to use of a centralized coordinator. Therefore, we propose, develop and evaluate a simple cost-oriented decentralized replica selection system named EDR (Energy-Aware Distributed Running system), implemented with two distributed optimization algorithms. We demonstrate experimentally the cost differences in various replica selection scenarios and show that our novel approach is as fast as the best available decentralized approach DONAR, while additionally considering dynamic energy costs. We show that an average of 12% savings on total system energy costs can be achieved by using EDR for several data intensive applications.

[1]  Nian-Feng Tzeng,et al.  Run-time Energy Consumption Estimation Based on Workload in Server Systems , 2008, HotPower.

[2]  Christian Belady,et al.  GREEN GRID DATA CENTER POWER EFFICIENCY METRICS: PUE AND DCIE , 2008 .

[3]  Shuaiwen Song,et al.  Iso-Energy-Efficiency: An Approach to Power-Constrained Parallel Computation , 2011, 2011 IEEE International Parallel & Distributed Processing Symposium.

[4]  Sujata Banerjee,et al.  Energy Aware Network Operations , 2009, IEEE INFOCOM Workshops 2009.

[5]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[6]  Karthick Rajamani,et al.  On evaluating request-distribution schemes for saving energy in server clusters , 2003, 2003 IEEE International Symposium on Performance Analysis of Systems and Software. ISPASS 2003..

[7]  Sergiu Nedevschi,et al.  Reducing Network Energy Consumption via Sleeping and Rate-Adaptation , 2008, NSDI.

[8]  Suman Nath,et al.  Energy-Aware Server Provisioning and Load Dispatching for Connection-Intensive Internet Services , 2008, NSDI.

[9]  Shuaiwen Song,et al.  System-level power-performance efficiency modeling for emergent GPU architectures , 2012, 2012 21st International Conference on Parallel Architectures and Compilation Techniques (PACT).

[10]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[11]  Rajkumar Buyya,et al.  Energy Efficient Resource Management in Virtualized Cloud Data Centers , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[12]  C. Mas Machuca,et al.  Energy Profile Aware Routing , 2009, 2009 IEEE International Conference on Communications Workshops.

[13]  Albert G. Greenberg,et al.  The cost of a cloud: research problems in data center networks , 2008, CCRV.

[14]  Robert Tappan Morris,et al.  Vivaldi: a decentralized network coordinate system , 2004, SIGCOMM '04.

[15]  Thu D. Nguyen,et al.  Cost-and Energy-Aware Load Distribution Across Data Centers , 2009 .

[16]  Ulas C. Kozat,et al.  Dynamic resource allocation and power management in virtualized data centers , 2010, 2010 IEEE Network Operations and Management Symposium - NOMS 2010.

[17]  L. Benini,et al.  Analysis of power consumption on switch fabrics in network routers , 2002, Proceedings 2002 Design Automation Conference (IEEE Cat. No.02CH37324).

[18]  Asfandyar Qureshi Plugging Into Energy Market Diversity , 2008, HotNets.

[19]  Xue Liu,et al.  Minimizing Electricity Cost: Optimization of Distributed Internet Data Centers in a Multi-Electricity-Market Environment , 2010, 2010 Proceedings IEEE INFOCOM.

[20]  Zongpeng Li,et al.  Youtube traffic characterization: a view from the edge , 2007, IMC '07.

[21]  Asuman E. Ozdaglar,et al.  Constrained Consensus and Optimization in Multi-Agent Networks , 2008, IEEE Transactions on Automatic Control.

[22]  Ricardo Bianchini,et al.  Power and energy management for server systems , 2004, Computer.

[23]  Marty Humphrey,et al.  An automated approach to cloud storage service selection , 2011, ScienceCloud '11.

[24]  Rajkumar Buyya,et al.  Energy-Efficient Management of Data Center Resources for Cloud Computing: A Vision, Architectural Elements, and Open Challenges , 2010, PDPTA.

[25]  Xiao Qin,et al.  An Energy-Efficient Framework for Large-Scale Parallel Storage Systems , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[26]  Lachlan L. H. Andrew,et al.  Greening geographical load balancing , 2011, PERV.

[27]  Hyeonsang Eom,et al.  Towards Energy Proportional Cloud for Data Processing Frameworks , 2010, SustainIT.

[28]  Ian Sommerville,et al.  Workload Classification & Software Energy Measurement for Efficient Scheduling on Private Cloud Platforms , 2011, ArXiv.

[29]  Srini Seetharaman Energy conservation in multi-tenant networks through power virtualization , 2010 .

[30]  Erol Gelenbe,et al.  Energy-Efficient Cloud Computing , 2010, Comput. J..

[31]  Klaus-Dieter Lange,et al.  ASSESSING TRENDS OVER TIME IN PERFORMANCE , COSTS , AND ENERGY USE FOR SERVERS , 2009 .

[32]  Patrick Wendell,et al.  DONAR: decentralized server selection for cloud services , 2010, SIGCOMM '10.

[33]  J. Koomey Worldwide electricity used in data centers , 2008 .