Optimization of power consumption in the iterative solution of sparse linear systems on graphics processors

In this paper, we analyze the power consumption of different GPU-accelerated iterative solver implementations enhanced with energy-saving techniques. Specifically, while conducting kernel calls on the graphics accelerator, we manually set the host system to a power-efficient idle-wait status so as to leverage dynamic voltage and frequency control. While the usage of iterative refinement combined with mixed precision arithmetic often improves the execution time of an iterative solver on a graphics processor, this may not necessarily be true for the power consumption as well. To analyze the trade-off between computation time and power consumption we compare a plain GMRES solver and its preconditioned variant to the mixed-precision iterative refinement implementations based on the respective solvers. Benchmark experiments conclusively reveal how the usage of idle-wait during GPU-kernel calls effectively leverages the power-tools provided by hardware, and improves the energy performance of the algorithm.

[1]  Jack Dongarra,et al.  Numerical Linear Algebra for High-Performance Computers , 1998 .

[2]  John Shalf,et al.  The International Exascale Software Project roadmap , 2011, Int. J. High Perform. Comput. Appl..

[3]  Hartwig Anzt,et al.  Energy efficiency of mixed precision iterative refinement methods using hybrid hardware platforms , 2010, Computer Science - Research and Development.

[4]  Yousef Saad,et al.  Iterative methods for sparse linear systems , 2003 .

[5]  W. Marsden I and J , 2012 .

[6]  Robert Strzodka,et al.  Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations , 2007, Int. J. Parallel Emergent Distributed Syst..

[7]  Vincent Heuveline,et al.  Analysis and optimization of power consumption in the iterative solution of sparse linear systems on multi-core and many-core platforms , 2011, 2011 International Green Computing Conference and Workshops.

[8]  Vincent Heuveline,et al.  Power Consumption of Mixed Precision in the Iterative Solution of Sparse Linear Systems , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[9]  Michael Garland,et al.  Implementing sparse matrix-vector multiplication on throughput-oriented processors , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[10]  João Correia Lopes,et al.  High Performance Computing for Computational Science - VECPAR 2010 - 9th International conference, Berkeley, CA, USA, June 22-25, 2010, Revised Selected Papers , 2011, VECPAR.

[11]  Frank Lampe,et al.  Green-IT, Virtualisierung und Thin Clients , 2010 .

[12]  Hartwig Anzt,et al.  An Error Correction Solver for Linear Systems: Evaluation of Mixed Precision Implementations , 2010, VECPAR.

[13]  Y. Saad,et al.  GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems , 1986 .