Characterizing the Performance-Energy Tradeoff of Small ARM Cores in HPC Computation

Deploying large numbers of small, low-power cores has been gaining traction recently as a system design strategy in high performance computing (HPC). The ARM platform that dominates the embedded and mobile computing segments is now being considered as an alternative to high-end x86 processors that largely dominate HPC because peak performance per watt may be substantially improved using off-the-shelf commodity processors.

[1]  John Goodacre Technology Preview: The ARMv8 Architecture , 2011 .

[2]  Mitesh R. Meswani,et al.  Reducing Energy Usage with Memory and Computation-Aware Dynamic Frequency Scaling , 2011, Euro-Par.

[3]  John Goodacre The evolution of the ARM architecture towards big data and the data-centre (abstract only) , 2013, VHPC '13.

[4]  João Correia Lopes,et al.  High Performance Computing for Computational Science - VECPAR 2010 - 9th International conference, Berkeley, CA, USA, June 22-25, 2010, Revised Selected Papers , 2011, VECPAR.

[5]  William Gropp,et al.  Exascale Research: Preparing for the Post-Moore Era , 2011 .

[6]  John Shalf,et al.  Exascale Computing Technology Challenges , 2010, VECPAR.

[7]  Dieter Kranzlmüller,et al.  Towards Energy Efficient Parallel Computing on Consumer Electronic Devices , 2011, ICT-GLOW.

[8]  Antti Ylä-Jääski,et al.  Energy- and Cost-Efficiency Analysis of ARM-Based Clusters , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[9]  Urs Hölzle,et al.  Brawny cores still beat wimpy cores, most of the time , 2010 .

[10]  Xingfu Wu,et al.  SWAPP: A Framework for Performance Projections of HPC Applications Using Benchmarks , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.

[11]  Simon D. Hammond,et al.  Analysis of Cray XC30 Performance Using Trinity-NERSC-8 Benchmarks and Comparison with Cray XE6 and IBM BG/Q , 2013, PMBS@SC.

[12]  Michael Laurenzano,et al.  How well can simple metrics represent the performance of HPC applications? , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[13]  Laura Carrington,et al.  A performance prediction framework for scientific applications , 2003, Future Gener. Comput. Syst..

[14]  Alejandro Rico,et al.  Experiences with mobile processors for energy efficient HPC , 2013, 2013 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[15]  Samuel Williams,et al.  The Landscape of Parallel Computing Research: A View from Berkeley , 2006 .

[16]  Sandia Report,et al.  Improving Performance via Mini-applications , 2009 .

[17]  W. Paul Vogt,et al.  The SAGE Dictionary of Statistics & Methodology: A Nontechnical Guide for the Social Sciences , 2015 .

[18]  Darren J. Kerbyson,et al.  A Performance Model of the Parallel Ocean Program , 2005, Int. J. High Perform. Comput. Appl..

[19]  Emmanuel Jeannot,et al.  Euro-Par 2011 Parallel Processing , 2011, Lecture Notes in Computer Science.

[20]  Thomas Lippert,et al.  Trends in supercomputing: The European path to exascale , 2011, Comput. Phys. Commun..

[21]  Jesús Labarta,et al.  A Framework for Performance Modeling and Prediction , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[22]  Karthikeyan Sankaralingam,et al.  Power struggles: Revisiting the RISC vs. CISC debate on contemporary ARM and x86 architectures , 2013, 2013 IEEE 19th International Symposium on High Performance Computer Architecture (HPCA).

[23]  Michael Laurenzano,et al.  PEBIL: Efficient static binary instrumentation for Linux , 2010, 2010 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS).

[24]  Julien Langou,et al.  Mixed Precision Iterative Refinement Techniques for the Solution of Dense Linear Systems , 2007, Int. J. High Perform. Comput. Appl..

[25]  A Min Tjoa,et al.  Information and Communication on Technology for the Fight against Global Warming - First International Conference, ICT-GLOW 2011, Toulouse, France, August 30-31, 2011. Proceedings , 2011, ICT-GLOW.

[26]  David H. Bailey,et al.  The NAS parallel benchmarks summary and preliminary results , 1991, Proceedings of the 1991 ACM/IEEE Conference on Supercomputing (Supercomputing '91).

[27]  Jack Dongarra,et al.  Computational Science — ICCS 2003 , 2003, Lecture Notes in Computer Science.

[28]  Fabrizio Petrini,et al.  Predictive Performance and Scalability Modeling of a Large-Scale Application , 2001, ACM/IEEE SC 2001 Conference (SC'01).