Evaluating Performance and Energy on ARM-based Clusters for High Performance Computing

The High-Performance Computing (HPC) community aimed for many years at increasing performance regardless to energy consumption. However, energy is limiting the scalability of next generation supercomputers. Current HPC systems already cost huge amounts of power, in the order of a few Mega Watts (MW). The future HPC systems intend to achieve 10 to 100 times more performance, but the accepted energy to power those machines must remain below 20 MW. Therefore, the scientific community is investigating ways to improve energy efficiency. This paper presents a study of the execution time, power consumption, maximum power and energy efficiency using developer boards with ARM processors. Our objective is to verify the feasibility of clusters using processors that target low power consumption. As a sub product of our research we built an unconventional cluster of Panda Boards each one featuring two ARM Cortex A9 cores. We believe that these unconventional solutions bring an alternative base to build HPC clusters that respect the limits of electric energy.

[1]  Arthur Musah,et al.  Power-Management Techniques for OMAP35x Applications Processors , 2008 .

[2]  Gernot Heiser,et al.  Slow Down or Sleep, That Is the Question , 2011, USENIX Annual Technical Conference.

[3]  Carlo Brandolese,et al.  Source-Level Estimation of Energy Consumption and Execution Time of Embedded Software , 2008, 2008 11th EUROMICRO Conference on Digital System Design Architectures, Methods and Tools.

[4]  Trevor Mudge,et al.  MiBench: A free, commercially representative embedded benchmark suite , 2001 .

[5]  Dieter Kranzlmüller,et al.  Towards Energy Efficient Parallel Computing on Consumer Electronic Devices , 2011, ICT-GLOW.

[6]  J. Shalf,et al.  A Real Cloud Computer , 2009, IEEE Spectrum.

[7]  G. Blake,et al.  A survey of multicore processors , 2009, IEEE Signal Processing Magazine.

[8]  Antti Ylä-Jääski,et al.  Energy- and Cost-Efficiency Analysis of ARM-Based Clusters , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[9]  Pranay Prabhat,et al.  Low power memory implementation for a GHz+ Dual Core ARM Cortex A9 processor on a high-K metal gate 32nm low power process , 2011, Proceedings of 2011 International Symposium on VLSI Design, Automation and Test.

[10]  Phillip Stanley-Marbell,et al.  Performance, Power, and Thermal Analysis of Low-Power Processors for Scale-Out Systems , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[11]  Luigi Carro,et al.  Multi-core Systems on Chip , 2010, Handbook of Signal Processing Systems.

[12]  Francieli Zanon Boito,et al.  Evaluating application performance and energy consumption on hybrid CPU+GPU architecture , 2012, Cluster Computing.

[13]  Katie Roberts-Hoffman,et al.  ARM Cortex-A 8 vs . Intel Atom : Architectural and Benchmark Comparisons , .

[14]  Brad Smith ARM and Intel Battle over the Mobile Chip's Future , 2008, Computer.

[15]  Gregor von Laszewski,et al.  Efficient resource management for Cloud computing environments , 2010, International Conference on Green Computing.

[16]  Michael Lang,et al.  Using Performance Modeling to Design Large-Scale Systems , 2009, Computer.

[17]  P Kogge,et al.  The tops in flops , 2011, IEEE Spectrum.

[18]  Rachel Courtland The high stakes of low power , 2012 .