Hadoop in Flight: Migrating Live MapReduce Jobs for Power-Shifting Data Centers

Renewable energy sources such as wind and solar are unpredictable for power utilities, which must produce exactly as much power as is needed at any given time. To help manage the demand, some utilities have begun deploying real-time energy prices to their customers. Data centers, which often run Hadoop jobs on thousands of machines, have become some of the utilities' largest consumers. In fact, recent studies have shown that, when processing at full capacity, data centers can require as much power as a mid-sized U. S. city. By implementing a method in which data centers can offload their work to locations on different power grids, they can take advantage of the lower-priced energy and thereby minimize operational costs. To this end, we have designed and implemented a new mechanism directly within the Hadoop 2 codebase that allows users to pause, migrate, and resume a job at arbitrary points of execution. We have evaluated this scheme using popular applications and show that energy can be delayed and shifted to a different location with reasonable overheads. Our experiments justify the migration use-case, showing that it saves both energy and time over either restarting the job remotely or allowing it to complete locally.

[1]  Christopher Stewart,et al.  Some Joules Are More Precious Than Others: Managing Renewable Energy in the Datacenter∗ , 2009 .

[2]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[3]  Lachlan L. H. Andrew,et al.  Online algorithms for geographical load balancing , 2012, 2012 International Green Computing Conference (IGCC).

[4]  Yefu Wang,et al.  Electricity Bill Capping for Cloud-Scale Data Centers that Impact the Power Markets , 2012, 2012 41st International Conference on Parallel Processing.

[5]  Thu D. Nguyen,et al.  Reducing electricity cost through virtual machine placement in high performance computing clouds , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[6]  Christoforos E. Kozyrakis,et al.  On the energy (in)efficiency of Hadoop clusters , 2010, OPSR.

[7]  Prashant J. Shenoy,et al.  Rules of thumb in data engineering , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[8]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[9]  J. Koomey Worldwide electricity used in data centers , 2008 .

[10]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[11]  Erol Gelenbe,et al.  Energy-Efficient Cloud Computing , 2010, Comput. J..

[12]  Yang Li,et al.  Towards dynamic pricing-based collaborative optimizations for green data centers , 2013, 2013 IEEE 29th International Conference on Data Engineering Workshops (ICDEW).

[13]  Hamidreza Zareipour,et al.  Data centres in the ancillary services market , 2012, 2012 International Green Computing Conference (IGCC).

[14]  Xue Liu,et al.  Minimizing Electricity Cost: Optimization of Distributed Internet Data Centers in a Multi-Electricity-Market Environment , 2010, 2010 Proceedings IEEE INFOCOM.

[15]  Christopher Stewart,et al.  Electric grid balancing through lowcost workload migration , 2012, PERV.

[16]  Jordi Torres,et al.  GreenHadoop: leveraging green energy in data-processing frameworks , 2012, EuroSys '12.

[17]  Lachlan L. H. Andrew,et al.  Greening Geographical Load Balancing , 2015, IEEE/ACM Transactions on Networking.

[18]  Niv Buchbinder,et al.  Online Job-Migration for Reducing the Electricity Bill in the Cloud , 2011, Networking.

[19]  Bingsheng He,et al.  Green-aware workload scheduling in geographically distributed data centers , 2012, 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings.

[20]  Albert Y. Zomaya,et al.  Energy-efficient data replication in cloud computing datacenters , 2013, 2013 IEEE Globecom Workshops (GC Wkshps).

[21]  Liang Liu,et al.  GreenCloud: a new architecture for green data center , 2009, ICAC-INDST '09.

[22]  Gagan Agrawal,et al.  Time and Cost Sensitive Data-Intensive Computing on Hybrid Clouds , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[23]  Rajkumar Buyya,et al.  A taxonomy of Data Grids for distributed data sharing, management, and processing , 2005, CSUR.

[24]  Jordi Torres,et al.  GreenSlot: Scheduling energy consumption in green datacenters , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[25]  Bruce M. Maggs,et al.  Cutting the electric bill for internet-scale systems , 2009, SIGCOMM '09.

[26]  David E. Culler,et al.  The ganglia distributed monitoring system: design, implementation, and experience , 2004, Parallel Comput..

[27]  Karsten Schwan,et al.  VirtualPower: coordinated power management in virtualized enterprise systems , 2007, SOSP.

[28]  Rajkumar Buyya,et al.  Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing , 2012, Future Gener. Comput. Syst..

[29]  Andy Hopper,et al.  Free Lunch: Exploiting Renewable Energy for Computing , 2011, HotOS.