Reducing Energy Costs for IBM Blue Gene/P via Power-Aware Job Scheduling

Energy expense is becoming increasingly dominant in the operating costs of high-performance computing (HPC) systems. At the same time, electricity prices vary significantly at different times of the day. Furthermore, job power profiles also differ greatly, especially on HPC systems. In this paper, we propose a smart, power-aware job scheduling approach for HPC systems based on variable energy prices and job power profiles. In particular, we propose a 0-1 knapsack model and demonstrate its flexibility and effectiveness for scheduling jobs, with the goal of reducing energy cost and not degrading system utilization. We design scheduling strategies for Blue Gene/P, a typical partition-based system. Experiments with both synthetic data and real job traces from production systems show that our power-aware job scheduling approach can reduce the energy cost significantly, up to 25 %, with only slight impact on system utilization.

[1]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[2]  Jordi Torres,et al.  GreenSlot: Scheduling energy consumption in green datacenters , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[3]  Thomas F. Wenisch,et al.  Power management of online data-intensive services , 2011, 2011 38th Annual International Symposium on Computer Architecture (ISCA).

[4]  Dan Tsafrir,et al.  A Short Survey of Commercial Cluster Batch Schedulers , 2005 .

[5]  Zhiling Lan,et al.  Analyzing and adjusting user runtime estimates to improve job scheduling on the Blue Gene/P , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).

[6]  Mikko Majanen,et al.  Energy-aware job scheduler for high-performance computing , 2012, Computer Science - Research and Development.

[7]  Ibm Blue,et al.  Overview of the IBM Blue Gene/P Project , 2008, IBM J. Res. Dev..

[8]  Wolf-Dietrich Weber,et al.  Power provisioning for a warehouse-sized computer , 2007, ISCA '07.

[9]  Hong Zhu,et al.  A survey of practical algorithms for suffix tree construction in external memory , 2010 .

[10]  Steven Skiena,et al.  Computational Discrete Mathematics: Combinatorics and Graph Theory with Mathematica ® , 2009 .

[11]  P. Sadayappan,et al.  Job fairness in non-preemptive job scheduling , 2004 .

[12]  Enrique V. Carrera,et al.  Load balancing and unbalancing for power and performance in cluster-based systems , 2001 .

[13]  Bruce M. Maggs,et al.  Cutting the electric bill for internet-scale systems , 2009, SIGCOMM '09.

[14]  Zhiling Lan,et al.  Reducing Fragmentation on Torus-Connected Supercomputers , 2011, 2011 IEEE International Parallel & Distributed Processing Symposium.

[15]  Hiroshi Nakashima,et al.  Saving 200kW and $200 K/year by power-aware job/machine scheduling , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[16]  Zhiling Lan,et al.  Measuring Power Consumption on IBM Blue Gene/Q , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.

[17]  Dror G. Feitelson,et al.  Utilization and Predictability in Scheduling the IBM SP2 with Backfilling , 1998, Proceedings of the First Merged International Parallel Processing Symposium and Symposium on Parallel and Distributed Processing.

[18]  Zhiling Lan,et al.  Fault-aware, utility-based job scheduling on Blue, Gene/P systems , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.

[19]  Chandrakant D. Patel,et al.  Energy Aware Grid: Global Workload Placement Based on Energy Efficiency , 2003 .

[20]  Zhiling Lan,et al.  Fault-Aware Runtime Strategies for High-Performance Computing , 2009, IEEE Transactions on Parallel and Distributed Systems.

[21]  Ivan Rodero,et al.  Evaluation of Coordinated Grid Scheduling Strategies , 2009, 2009 11th IEEE International Conference on High Performance Computing and Communications.

[22]  D. Sauer,et al.  Operation conditions of batteries in PV applications , 2004 .

[23]  Dan Tsafrir,et al.  Backfilling Using System-Generated Predictions Rather than User Runtime Estimates , 2007, IEEE Transactions on Parallel and Distributed Systems.

[24]  P. Sadayappan,et al.  Unfairness Metrics for Space-Sharing Parallel Job Schedulers , 2005, JSSPP.

[25]  Wolfgang Frings,et al.  Measuring power consumption on IBM Blue Gene/P , 2011, Computer Science - Research and Development.

[26]  Luiz André Barroso,et al.  The Case for Energy-Proportional Computing , 2007, Computer.

[27]  Wu-chun Feng,et al.  The Bladed Beowulf: a cost-effective alternative to traditional Beowulfs , 2002, Proceedings. IEEE International Conference on Cluster Computing.

[28]  Zhiling Lan,et al.  Adaptive Metric-Aware Job Scheduling for Production Supercomputers , 2012, 2012 41st International Conference on Parallel Processing Workshops.

[29]  Dario Pompili,et al.  Proactive thermal management in green datacenters , 2012, The Journal of Supercomputing.

[30]  Zhiling Lan,et al.  Evaluating Performance Impacts of Delayed Failure Repairing on Large-Scale Systems , 2011, 2011 IEEE International Conference on Cluster Computing.