Energy Efficient Workflow Job Scheduling for Green Cloud

The elastic resource provision, no interfering resource sharing and flexible customized configuration provided by the Cloud infrastructure has shed light on efficient execution of many scientific applications modeled as Directed Acyclic Graph (DAG) structured workflows. However, the energy cost on running the increasingly deployed Cloud data centers around the globe together with the amount of CO2 emissions have skyrocketed. In order to maintain sustainable Cloud computing facing with ever-increasing problem complexity and big data size, we propose an energy-efficient scientific workflow scheduling algorithm to minimize energy consumption and CO2 emission while satisfying certain Quality of Service (QoS). Our multiple-step resource provision and allocation algorithm applies Dynamic Voltage and Frequency Scaling (DVFS) technology to reduce energy consumption within acceptable performance bounds, and minimize the Virtual Machine (VM) overhead for further reduced energy consumption and higher resource utilization rate. The candidacy of multiple data centers from the energy and performance efficiency perspectives is also evaluated. The simulation results show that our algorithm is able to achieve an average up to 30% of energy savings and increase the resource utilization rate for about 25% leading to higher profit and less CO2 emissions.

[1]  Ying Lu,et al.  Efficient Power Management of Heterogeneous Soft Real-Time Clusters , 2008, 2008 Real-Time Systems Symposium.

[2]  Daniel S. Katz,et al.  Pegasus: A framework for mapping complex scientific workflows onto distributed systems , 2005, Sci. Program..

[3]  Xue Liu,et al.  Dynamic Voltage Scaling in Multitier Web Servers with End-to-End Delay Control , 2007, IEEE Transactions on Computers.

[4]  Rajkumar Buyya,et al.  Power Aware Scheduling of Bag-of-Tasks Applications with Deadline Constraints on DVS-enabled Clusters , 2007, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07).

[5]  Richard Wolski,et al.  The Eucalyptus Open-Source Cloud-Computing System , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[6]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[7]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[8]  Ming Mao,et al.  A Performance Study on the VM Startup Time in the Cloud , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[9]  Jeffrey S. Chase,et al.  Making Scheduling "Cool": Temperature-Aware Workload Placement in Data Centers , 2005, USENIX Annual Technical Conference, General Track.

[10]  Janet L. Wiener,et al.  Cost-aware scheduling for heterogeneous enterprise machines (CASH’EM) , 2007, 2007 IEEE International Conference on Cluster Computing.

[11]  Xi He,et al.  Power-aware scheduling of virtual machines in DVFS-enabled clusters , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.

[12]  David Filani Dynamic Data Center Power Management Trends, Issues, and Solutions , 2008 .

[13]  Richard E. Brown,et al.  Report to Congress on Server and Data Center Energy Efficiency: Public Law 109-431 , 2008 .