Krylov space iterative solvers on graphics processing units

CUDA architecture was introduced by Nvidia three years ago and since then there have been many promising publications demonstrating a huge potential of Graphics Processing Units (GPUs) in scientific computations. In this paper, we investigate the performance of iterative methods such as cg, minres, gmres, bicg that may be used to solve large sparse real and complex systems of equations arising in computational electromagnetics.

[1]  M. Okoniewski,et al.  GPU Accelerated Krylov Subspace Methods for Computational Electromagnetics , 2008, 2008 38th European Microwave Conference.

[2]  Michal Mrozowski Guided Electromagnetic Waves: Properties and Analysis , 1997 .

[3]  S. Adams,et al.  Finite Difference Time Domain (FDTD) Simulations Using Graphics Processors , 2007, 2007 DoD High Performance Computing Modernization Program Users Group Conference.

[4]  Jack Dongarra,et al.  Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects , 2009 .

[5]  Zhaoming Zhu,et al.  Full-vectorial finite-difference analysis of microstructured optical fibers. , 2002, Optics express.

[6]  L. Kulas,et al.  Implementation of matrix-type FDTD algorithm on a graphics accelerator , 2008, MIKON 2008 - 17th International Conference on Microwaves, Radar and Wireless Communications.

[7]  Michael Garland,et al.  Efficient Sparse Matrix-Vector Multiplication on CUDA , 2008 .

[8]  Guillaume Caumon,et al.  Concurrent number cruncher: a GPU implementation of a general sparse linear solver , 2009, Int. J. Parallel Emergent Distributed Syst..

[9]  M.M. Okoniewski,et al.  Acceleration of finite-difference time-domain (FDTD) using graphics processor units (GPU) , 2004, 2004 IEEE MTT-S International Microwave Symposium Digest (IEEE Cat. No.04CH37535).

[10]  Rajesh Bordawekar,et al.  Optimizing Sparse Matrix-Vector Multiplication on GPUs using Compile-time and Run-time Strategies , 2008 .

[11]  Michael Garland,et al.  Implementing sparse matrix-vector multiplication on throughput-oriented processors , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[12]  Yousef Saad,et al.  Iterative methods for sparse linear systems , 2003 .