Efficient Parallel Preconditioned Conjugate Gradient Solver on GPU for FE Modeling of Electromagnetic Fields in Highly Dissipative Media

We present a performance analysis of a parallel implementation of preconditioned conjugate gradient solvers using graphic processing units with compute unified device architecture programming model. The solvers were optimized for the solution of sparse systems of equations arising from finite-element analysis of electromagnetic phenomena involved in the diffusion of underground currents in both steady state and under time-harmonic current excitation. We used both shifted incomplete Cholesky factorization and incomplete LU factorization as preconditioners. The results show a significant speedup using the graphics processing unit compared with a serial CPU implementation.

[1]  Michael Garland,et al.  Efficient Sparse Matrix-Vector Multiplication on CUDA , 2008 .

[2]  Jonas Koko,et al.  Parallel preconditioned conjugate gradient algorithm on GPU , 2012, J. Comput. Appl. Math..

[3]  Yousef Saad,et al.  GPU-accelerated preconditioned iterative linear solvers , 2013, The Journal of Supercomputing.

[4]  K. Fujiwara,et al.  Acceleration of Convergence Characteristic of Iccg Method , 1992, Digest of the Fifth Biennial IEEE Conference on Electromagnetic Field Computation.

[5]  K. Watanabe,et al.  Iterative Solver for Linear System Obtained by Edge Element: Variable Preconditioned Method With Mixed Precision on GPU , 2012, IEEE Transactions on Magnetics.

[6]  Maxim Naumov,et al.  Incomplete-LU and Cholesky Preconditioned Iterative Methods Using CUSPARSE and CUBLAS , 2012 .

[7]  Wolfgang Straßer,et al.  A Parallel Preconditioned Conjugate Gradient Solver for the Poisson Problem on a Multi-GPU Platform , 2010, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing.

[8]  Additional properties of the incomplete gauge formulation for 3-D nodal finite-element magnetostatics , 1994 .

[9]  Determination of Frequency-Dependent Characteristics of Substation Grounding Systems by Vector Finite-Element Analysis , 2006, IEEE Transactions on Magnetics.

[10]  D. Lowther,et al.  Understanding the efficiency of parallel incomplete Cholesky preconditioners on the performance of ICCG solvers for multi-core and GPU systems , 2010, Digests of the 2010 14th Biennial IEEE Conference on Electromagnetic Field Computation.

[11]  V. C. Silva,et al.  Efficient Modeling of Thin Wires in a Lossy Medium by Finite Elements Applied to Grounding Systems , 2011, IEEE Transactions on Magnetics.

[12]  Yousef Saad,et al.  Iterative methods for sparse linear systems , 2003 .