Self-Tuned Software-Managed Energy Reduction in InfiniBand Links

One of the biggest challenges in high-performance computing is to reduce the power and energy consumption. Research in energy efficiency has focused mainly on energy consumption at the node level. Less attention has been given to the interconnect, which is becoming a significant source of energy-inefficiency. Although supercomputers undoubtedly require a high-performance interconnect, previous work has shown that network links have low average utilization. It is therefore possible to save energy using low-power modes, but link wake-up latencies must not lead to a loss in performance. This paper proposes the Self-tuned Pattern Prediction System (SPPS), a self-tuned algorithm for energy proportionality, which reduces interconnect energy consumption without needing any application-specific configuration parameters. The algorithm uses prediction to discover repetitive patterns in the application's communication, and it is implemented inside the MPI library, so that existing MPI programs do not need to be modified. We build on previous work, which showed how the application structure can be successfully exploited to predict the communication idle intervals. The previous work, however, required the manual adjustment of a critical idle interval length, whose value depends on the application and has a major effect on energy savings. The new technique automatically discovers the optimal value of this parameter, resulting in a self-tuned algorithm that obtains large interconnect energy savings at little performance cost. We study the effectiveness of our approach using ten real applications and benchmarks. Our simulations show average energy savings in the network links of up to 21%. Moreover, the link wake-up latencies and additional computation times have a negligible effect on performance, with an average penalty less than 1%.

[1]  German Rodriguez,et al.  Trace-driven co-simulation of high-performance computing systems using OMNeT++ , 2009, SIMUTools 2009.

[2]  Mahmut T. Kandemir,et al.  Energy optimization techniques in cluster interconnects , 2003, ISLPED '03.

[3]  John Shawe-Taylor,et al.  Fast string matching using an n‐gram algorithm , 1994, Softw. Pract. Exp..

[4]  Li Shang,et al.  Dynamic voltage scaling with links for power optimization of interconnection networks , 2003, The Ninth International Symposium on High-Performance Computer Architecture, 2003. HPCA-9 2003. Proceedings..

[5]  Kathleen M. Nichols Performance tools , 1990, IEEE Software.

[6]  Li-Shiuan Peh,et al.  Software-directed power-aware interconnection networks , 2005, CASES '05.

[7]  Torsten Hoefler Software and Hardware Techniques for Power-Efficient HPC Networking , 2010, Computing in Science & Engineering.

[8]  Jian Li,et al.  Power shifting in Thrifty Interconnection Network , 2011, 2011 IEEE 17th International Symposium on High Performance Computer Architecture.

[9]  Hong Liu,et al.  Energy proportional datacenter networks , 2010, ISCA.

[10]  Paul M. Carpenter,et al.  Software-Managed Power Reduction in Infiniband Links , 2014, 2014 43rd International Conference on Parallel Processing.

[11]  Cyriel Minkenberg,et al.  Trace-driven co-simulation of high-performance computing systems using OMNeT++ , 2009, SimuTools.

[12]  Chung-Ta King,et al.  Application-Driven End-to-End Traffic Predictions for Low Power NoC Design , 2013, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[13]  Mahmut T. Kandemir,et al.  Exploiting last idle periods of links for network power management , 2005, EMSOFT.

[14]  D.K. Lowenthal,et al.  Adaptive, Transparent Frequency and Voltage Scaling of Communication Phases in MPI Programs , 2006, ACM/IEEE SC 2006 Conference (SC'06).

[15]  Jesús Labarta,et al.  DiP: A Parallel Program Development Environment , 1996, Euro-Par, Vol. II.

[16]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[17]  Karthikeyan P. Saravanan,et al.  A performance perspective on energy efficient HPC links , 2014, ICS '14.

[18]  Karthikeyan P. Saravanan,et al.  Power/performance evaluation of energy efficient Ethernet (EEE) for High Performance Computing , 2013, 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).

[19]  Pedro López,et al.  Power saving in regular interconnection networks , 2010, Parallel Comput..