Node level power measurements on a petaflop system

The complexity and size of scientific and engineering challenges are continually increasing to the point where they are approaching exascale computing. One of the main challenges is to be able to run scientific software applications on these extremely large systems in an energy efficient manner, as power consumption will become one of the dominant cost factors that will govern the next generation of large high performance computing data centers. In this paper, we present the results of node level power measurements on a petaflop HPC system.

[1]  Efraim Rotem,et al.  Power-Management Architecture of the Intel Microarchitecture Code-Named Sandy Bridge , 2012, IEEE Micro.

[2]  Gerhard Wellein,et al.  Overhead Analysis of Performance Counter Measurements , 2014, 2014 43rd International Conference on Parallel Processing Workshops.

[3]  Wolfgang E. Nagel,et al.  Power measurement techniques on standard compute nodes: A quantitative comparison , 2013, 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).

[4]  Ananta Tiwari,et al.  Modeling Power and Energy Usage of HPC Kernels , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.

[5]  Joseph Manzano,et al.  Optimizing irregular applications for energy and performance on the Tilera many-core architecture , 2015, Conf. Computing Frontiers.

[6]  Frank B. Schmuck,et al.  GPFS: A Shared-Disk File System for Large Computing Clusters , 2002, FAST.

[7]  Karthikeyan Sankaralingam,et al.  Dark Silicon and the End of Multicore Scaling , 2012, IEEE Micro.

[8]  Mark Bohr,et al.  A 30 Year Retrospective on Dennard's MOSFET Scaling Paper , 2007, IEEE Solid-State Circuits Newsletter.

[9]  Torsten Wilde,et al.  A power-measurement methodology for large-scale, high-performance computing , 2014, ICPE.

[10]  Torsten Wilde,et al.  Predicting the Energy and Power Consumption of Strong and Weak Scaling HPC Applications , 2014, Supercomput. Front. Innov..

[11]  Steve R. Kleiman,et al.  SnapMirror: File-System-Based Asynchronous Mirroring for Disaster Recovery , 2002, FAST.

[12]  Arndt Bode,et al.  Extreme Scaling Workshop at the LRZ , 2013, PARCO.

[13]  Torsten Wilde,et al.  A Case Study of Energy Aware Scheduling on SuperMUC , 2014, ISC.

[14]  Thomas Ilsche,et al.  Power measurements for compute nodes: Improving sampling rates, granularity and accuracy , 2015, 2015 Sixth International Green and Sustainable Computing Conference (IGSC).

[15]  Rong Ge,et al.  Power and energy profiling of scientific applications on distributed systems , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[16]  John Shalf,et al.  Power efficiency in high performance computing , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[17]  Martin Schulz,et al.  Beyond DVFS: A First Look at Performance under a Hardware-Enforced Power Bound , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.

[18]  Carmen B. Navarrete,et al.  Autotuning the energy consumption , 2013, PARCO.

[19]  Carlo Cavazzoni,et al.  DWPE, a new data center energy-efficiency metric bridging the gap between infrastructure and workload , 2014, 2014 International Conference on High Performance Computing & Simulation (HPCS).

[20]  Thomas Schwenkler,et al.  Intelligent Platform Management Interface , 2006 .

[21]  Bernd Mohr,et al.  The Mont-Blanc Project: First Phase Successfully Finished , 2015, ArXiv.

[22]  Jack J. Dongarra,et al.  Power monitoring with PAPI for extreme scale architectures and dataflow-based programming models , 2014, 2014 IEEE International Conference on Cluster Computing (CLUSTER).

[23]  Shirley Moore,et al.  Measuring Energy and Power with PAPI , 2012, 2012 41st International Conference on Parallel Processing Workshops.

[24]  Torsten Wilde,et al.  Predicting energy consumption relevant indicators of strong scaling HPC applications for different compute resource configurations , 2015, SpringSim.

[25]  Robert Schöne,et al.  Introducing FIRESTARTER: A processor stress test utility , 2013, 2013 International Green Computing Conference Proceedings.