Node variability in large-scale power measurements: perspectives from the Green500, Top500 and EEHPCWG

The last decade has seen power consumption move from an afterthought to the foremost design constraint of new supercomputers. Measuring the power of a supercomputer can be a daunting proposition, and as a result, many published measurements are extrapolated. This paper explores the validity of these extrapolations in the context of inter-node power variability and power variations over time within a run. We characterize power variability across nodes in systems at eight supercomputer centers across the globe. This characterization shows that the current requirement for measurements submitted to the Green500 and others is insufficient, allowing variations of up to 20% due to measurement timing and a further 10--15% due to insufficient sample sizes. This paper proposes new power and energy measurement requirements for supercomputers, some of which have been accepted for use by the Green500 and Top500, to ensure consistent accuracy.

[1]  Sunita Chandrasekaran,et al.  SPEC ACCEL: A Standard Application Suite for Measuring Hardware Accelerator Performance , 2014, PMBS@SC.

[2]  Wu-chun Feng,et al.  Understanding Power Measurement Implications in the Green500 List , 2010, 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing.

[3]  Wolfgang E. Nagel,et al.  Power measurement techniques on standard compute nodes: A quantitative comparison , 2013, 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).

[4]  Wolf-Dietrich Weber,et al.  Power provisioning for a warehouse-sized computer , 2007, ISCA '07.

[5]  Kevin Skadron,et al.  Rodinia: A benchmark suite for heterogeneous computing , 2009, 2009 IEEE International Symposium on Workload Characterization (IISWC).

[6]  Torsten Wilde,et al.  A power-measurement methodology for large-scale, high-performance computing , 2014, ICPE.

[7]  John D. Davis,et al.  Including Variability in Large-Scale Cluster Power Models , 2012, IEEE Computer Architecture Letters.

[8]  Matthias S. Müller,et al.  Quantifying power consumption variations of HPC systems using SPEC MPI benchmarks , 2010, Computer Science - Research and Development.

[9]  Stephen W. Poole,et al.  Power measurement for high performance computing: State of the art , 2011, 2011 International Green Computing Conference and Workshops.

[10]  Volker Lindenstruth,et al.  Lattice-CSC: Optimizing and Building an Efficient Supercomputer for Lattice-QCD and to Achieve First Place in Green500 , 2015, ISC.

[11]  Rajesh K. Gupta,et al.  Accurate Characterization of the Variability in Power Consumption in Modern Mobile Processors , 2012, HotPower.

[12]  Martin Schulz,et al.  Beyond DVFS: A First Look at Performance under a Hardware-Enforced Power Bound , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.

[13]  Matthias S. Müller,et al.  SPEC OMP2012 - An Application Benchmark Suite for Parallel Systems Using OpenMP , 2012, IWOMP.

[14]  John D. Davis,et al.  Accounting for Variability in Large-Scale Cluster Power Models , 2011 .

[15]  John Shalf,et al.  Power efficiency in high performance computing , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[16]  Robert Schöne,et al.  Introducing FIRESTARTER: A processor stress test utility , 2013, 2013 International Green Computing Conference Proceedings.