A pareto-based GA for scheduling HPC applications on distributed cloud infrastructures

Reducing energy consumption is an increasingly important issue in cloud computing, more specifically when dealing with High Performance Computing (HPC). Minimizing energy consumption can significantly reduce the amount of energy bills and then increases the provider's profit. In addition, the reduction of energy decreases greenhouse gas emissions. Therefore, many researches are carried out to develop new methods in order to consume less energy. In this paper, we present a multi-objective genetic algorithm (MO-GA) that optimizes the energy consumption, CO2 emissions and the generated profit of a geographically distributed cloud computing infrastructure. We also propose a greedy heuristic that aims to maximize the number of scheduled applications in order to compare it with the MO-GA. The two approaches have been experimented using realistic workload traces from Feitelson's PWA Parallel Workload Archive. The results show that MO-GA outperforms the greedy heuristic by a significant margin in terms of energy consumption and CO2 emissions. In addition, MO-GA is also proved to be slightly better in terms of profit while scheduling more applications.

[1]  Rajkumar Buyya,et al.  Environment-conscious scheduling of HPC applications on distributed Cloud-oriented data centers , 2011, J. Parallel Distributed Comput..

[2]  Albert Y. Zomaya,et al.  Linear Combinations of DVFS-Enabled Processor Frequencies to Modify the Energy-Aware Scheduling Algorithms , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[3]  Albert Y. Zomaya,et al.  Energy efficient utilization of resources in cloud computing systems , 2010, The Journal of Supercomputing.

[4]  Laurent Lefèvre,et al.  Save Watts in Your Grid: Green Strategies for Energy-Aware Framework in Large Scale Distributed Systems , 2008, 2008 14th IEEE International Conference on Parallel and Distributed Systems.

[5]  Rajkumar Buyya,et al.  A Negotiation Mechanism for Advance Resource Reservations Using the Alternate Offers Protocol , 2008, 2008 16th Interntional Workshop on Quality of Service.

[6]  David Levine,et al.  Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning , 2007, NIPS.

[7]  Albert Y. Zomaya,et al.  Profit-Driven Service Request Scheduling in Clouds , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[8]  Albert Y. Zomaya,et al.  Minimizing Energy Consumption for Precedence-Constrained Applications Using Dynamic Voltage Scaling , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[9]  Rajkumar Buyya,et al.  A Linear Programming Driven Genetic Algorithm for Meta-Scheduling on Utility Grids , 2008, 2008 16th International Conference on Advanced Computing and Communications.

[10]  Janet L. Wiener,et al.  Cost-aware scheduling for heterogeneous enterprise machines (CASH’EM) , 2007, 2007 IEEE International Conference on Cluster Computing.

[11]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .