Self-adaptive Multiprecision Preconditioners on Multicore and Manycore Architectures

Based on the premise that preconditioners needed for scientific computing are not only required to be robust in the numerical sense, but also scalable for up to thousands of light-weight cores, we argue that this two-fold goal is achieved for the recently developed self-adaptive multi-elimination preconditioner. For this purpose, we revise the underlying idea and analyze the performance of implementations realized in the PARALUTION and MAGMA open-source software libraries on GPU architectures (using either CUDA or OpenCL), Intel’s Many Integrated Core Architecture, and Intel’s Sandy Bridge processor. The comparison with other well-established preconditioners like multi-coloured Gauss-Seidel, ILU(0) and multi-colored ILU(0), shows that the twofold goal of a numerically stable cross-platform performant algorithm is achieved.

[1]  Yousef Saad,et al.  Iterative methods for sparse linear systems , 2003 .

[2]  Yousef Saad,et al.  ILUM: A Multi-Elimination ILU Preconditioner for General Sparse Matrices , 1996, SIAM J. Sci. Comput..

[3]  Jun Zhang,et al.  BILUM: Block Versions of Multielimination and Multilevel ILU Preconditioner for General Sparse Linear Systems , 1999, SIAM J. Sci. Comput..

[4]  Michael Luby,et al.  A simple parallel algorithm for the maximal independent set problem , 1985, STOC '85.

[5]  John Michael Robson,et al.  Algorithms for Maximum Independent Sets , 1986, J. Algorithms.

[6]  John E. Stone,et al.  OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems , 2010, Computing in Science & Engineering.

[7]  Y. Saad BILUM : Block versions of multi-elimination ILU preconditioner for general sparse linear systems , 1999 .

[8]  Wei Cao,et al.  An Improved Independent Set Ordering Algorithm for Solving Large-Scale Sparse Linear Systems , 2010, 2010 Second International Conference on Intelligent Human-Machine Systems and Cybernetics.

[9]  Jack Dongarra,et al.  Multi-Elimination ILU Preconditioners on GPUs , 2014 .

[10]  Michael R. Leuze,et al.  Independent set orderings for parallel matrix factorization by Gaussian elimination , 1989, Parallel Comput..

[11]  Dimitar Lukarski,et al.  Parallel Sparse Linear Algebra for Multi-core and Many-core Platforms: Parallel Solvers and Preconditioners , 2012 .

[12]  Jack J. Dongarra,et al.  Hybrid Multi-elimination ILU Preconditioners on GPUs , 2014, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops.

[13]  Santa Clara,et al.  Parallel Solution of Sparse Triangular Linear Systems in the Preconditioned Iterative Methods on the GPU , 2011 .