Green-aware workload scheduling in geographically distributed data centers

Renewable (or green) energy, such as solar or wind, has at least partially powered data centers to reduce the environmental impact of traditional energy sources (brown energy with high carbon footprint). In this paper, we propose a holistic workload scheduling algorithm to minimize the brown energy consumption across multiple geographically distributed data centers with renewable energy sources. While green energy supply for a single data center is intermittent due to daily/seasonal effects, our workload scheduling algorithm is aware of different amounts of green energy supply and dynamically schedules the workload across data centers. The scheduling decision adapts to workload and data center cooling dynamics. Our experiments with real workload traces demonstrate that our scheduling algorithm greatly reduces brown energy consumption by up to 40% in comparison with other scheduling policies.

[1]  Rishan Chen,et al.  Improving large graph processing on partitioned graphs in the cloud , 2012, SoCC '12.

[2]  Anand Sivasubramaniam,et al.  Benefits and limitations of tapping into stored energy for datacenters , 2011, 2011 38th Annual International Symposium on Computer Architecture (ISCA).

[3]  Prashant J. Shenoy,et al.  Cloudy Computing: Leveraging Weather Forecasts in Energy Harvesting Sensor Systems , 2010, 2010 7th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks (SECON).

[4]  Luiz André Barroso,et al.  The Case for Energy-Proportional Computing , 2007, Computer.

[5]  Bingsheng He,et al.  Wave Computing in the Cloud , 2009, HotOS.

[6]  Mark D. Hill,et al.  Virtual hierarchies to support server consolidation , 2007, ISCA '07.

[7]  Hai Jin,et al.  Towards Pay-As-You-Consume Cloud Computing , 2011, 2011 IEEE International Conference on Services Computing.

[8]  Chao Li,et al.  iSwitch: Coordinating and optimizing renewable energy powered server clusters , 2012, 2012 39th Annual International Symposium on Computer Architecture (ISCA).

[9]  J. Koomey Worldwide electricity used in data centers , 2008 .

[10]  Wolf-Dietrich Weber,et al.  Power provisioning for a warehouse-sized computer , 2007, ISCA '07.

[11]  Albert G. Greenberg,et al.  The cost of a cloud: research problems in data center networks , 2008, CCRV.

[12]  Bingsheng He,et al.  Distributed Systems Meet Economics: Pricing in the Cloud , 2010, HotCloud.

[13]  Thu D. Nguyen,et al.  Reducing electricity cost through virtual machine placement in high performance computing clouds , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[14]  Christopher Stewart,et al.  Some Joules Are More Precious Than Others: Managing Renewable Energy in the Datacenter∗ , 2009 .

[15]  Thu D. Nguyen,et al.  Cost-and Energy-Aware Load Distribution Across Data Centers , 2009 .

[16]  Prashant J. Shenoy,et al.  CloudNet: dynamic pooling of cloud resources by live WAN migration of virtual machines , 2011, VEE.

[17]  Dick H. J. Epema,et al.  Cost-driven scheduling of grid workflows using Partial Critical Paths , 2010, 2010 11th IEEE/ACM International Conference on Grid Computing.

[18]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[19]  Bingsheng He,et al.  Comet: batched stream processing for data intensive distributed computing , 2010, SoCC '10.

[20]  Hai Jin,et al.  Maestro: Replica-Aware Map Scheduling for MapReduce , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[21]  Margaret Martonosi,et al.  Capping the brown energy consumption of Internet services at low cost , 2010, International Conference on Green Computing.

[22]  Tajana Rosing,et al.  Utilizing green energy prediction to schedule mixed batch and service jobs in data centers , 2011, OPSR.

[23]  Jordi Torres,et al.  GreenHadoop: leveraging green energy in data-processing frameworks , 2012, EuroSys '12.

[24]  Jordi Torres,et al.  GreenSlot: Scheduling energy consumption in green datacenters , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[25]  Chao Li,et al.  Characterizing and analyzing renewable energy driven data centers , 2011, PERV.

[26]  Yanpei Chen,et al.  Integrating Renewable Energy Using Data Analytics Systems: Challenges and Opportunities , 2011, IEEE Data Eng. Bull..

[27]  Marty Humphrey,et al.  Auto-scaling to minimize cost and meet application deadlines in cloud workflows , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[28]  Yefu Wang,et al.  GreenWare: Greening Cloud-Scale Data Centers to Maximize the Use of Renewable Energy , 2011, Middleware.