Parallelized ICCG method using block-multicolor orderings in real symmetric linear system derived from voltage-driven FEM in time domain

Purpose – The purpose of this paper is to improve the performance of block-multicolor (BMC) ordering for the parallelized incomplete-Cholesky-preconditioned conjugate gradient (ICCG) method. Then, the BMC ordering based on level structure arising in reverse Cuthill-McKee RCM ordering is newly proposed. The name of proposed method is abbreviated as “RBMC”. This paper shows the validity of proposed method by comparison with greedy-based multicolor (MC) and conventional BMC on the real symmetric linear system derived from the voltage-driven finite element method in time domain. Design/methodology/approach – In RBMC, the blocking and coloring is performed level by level. The number of synchronizations in forward and backward substitution is reduced so that all blocks can be colored with two colors. However, the load-balance in forward and backward substitution might deteriorate because the irregular block matrices are distributed around diagonal. To uniformize load-balance in forward and backward substitution...

[1]  Christian Vollaire,et al.  Preconditioning techniques for the conjugate gradient solver on a parallel distributed memory computer , 1998 .

[2]  Takeshi Iwashita,et al.  Algebraic block red-black ordering method for parallelized ICCG solver with fast convergence and low communication costs , 2003 .

[3]  Hiroshi Nakashima,et al.  Algebraic Block Multi-Color Ordering Method for Parallel Multi-Threaded Sparse Triangular Solver in ICCG Method , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium.

[4]  Yousef Saad,et al.  Iterative methods for sparse linear systems , 2003 .

[5]  Takeshi Iwashita,et al.  Comparison Criteria for Parallel Orderings in ILU Preconditioning , 2005, SIAM J. Sci. Comput..

[6]  I. Duff,et al.  The effect of ordering on preconditioned conjugate gradients , 1989 .

[7]  K. Fujiwara,et al.  Acceleration of Convergence Characteristic of Iccg Method , 1992, Digest of the Fifth Biennial IEEE Conference on Electromagnetic Field Computation.

[8]  Hiroshi Nakashima,et al.  Parallel Performance of Multithreaded ICCG Solver Based on Algebraic Block Multicolor Ordering in Finite Element Electromagnetic Field Analyses , 2013, IEEE Transactions on Magnetics.

[9]  J. Meijerink,et al.  An iterative solution method for linear systems of which the coefficient matrix is a symmetric -matrix , 1977 .

[10]  Yoshifumi Okamoto,et al.  Parallelization of Preconditioned MRTR Method With Eisenstat’s Technique by Means of Algebraic Multicolor Ordering , 2015, IEEE Transactions on Magnetics.

[11]  Koji Fujiwara,et al.  Effectiveness of Higher Order Time Integration in Time-Domain Finite-Element Analysis , 2010, IEEE Transactions on Magnetics.

[12]  H. F. Jordan,et al.  Is SOR Color-Blind? , 1986 .

[13]  M. L. Barton Three-dimensional magnetic field computation on a distributed memory parallel processor , 1990 .

[14]  J. A. George Computer implementation of the finite element method , 1971 .

[15]  K. Fujiwara,et al.  Exact Minimization of Energy Functional for NR Method With Line-Search Technique , 2009, IEEE Transactions on Magnetics.

[16]  Takeshi Iwashita,et al.  Algebraic multicolor ordering for parallelized ICCG solver in finite-element analyses , 2002 .

[17]  J. S. Kowalik,et al.  An efficient parallel block conjugate method for linear equations , 1982, ICPP.