WEIGHTED BLOCK-ASYNCHRONOUS RELAXATION FOR GPU-ACCELERATED SYSTEMS

In this paper, we analyze the potential of using weights for block-asynchronous re- laxation methods on GPUs. For this purpose, we introduce dierent weighting techniques similar to those applied in block-smoothers for multigrid methods. Having proven a sucient convergence condition for the weighted block-asynchronous iteration, we analyze the performance of the algo- rithms implemented using CUDA and compare them with weighted synchronous relaxation schemes like SOR. For test matrices taken from the University of Florida Matrix Collection we report the convergence behavior and the total runtime for the dierent weighting techniques. Analyzing the results, we observe that using weights may accelerate the convergence rate of block-asynchronous iteration considerably. This shows the high potential of using weights in block-asynchronous iter- ation for numerically solving linear systems of equations fullling certain convergence conditions. While component-wise relaxation methods are seldom directly applied to linear equation systems, using them as smoother in a multigrid framework they often provide an important contribution to nite element solvers. Since the parallelization potential of the classical smoothers like SOR and Gauss-Seidel is usually very limited, replacing them with block-asynchronous smoothers may have a considerable impact on the overall multigrid performance. Due to the explosion of parallelism in to- day's architecture designs, the signicance and the need for highly parallel asynchronous smoothers, as the ones described in this work, is expected to grow.

[1]  D. Szyld,et al.  Asynchronous two-stage iterative methods , 1994 .

[2]  L. Elsner,et al.  On the convergence of asynchronous paracontractions with application to tomographic reconstruction from incomplete data , 1990 .

[3]  D. E. Baz,et al.  Asynchronous iterations with flexible communication: contracting operators , 2005 .

[4]  Gérard M. Baudet,et al.  Asynchronous Iterative Methods for Multiprocessors , 1978, JACM.

[5]  Michel Dubois,et al.  The Run-Time Efficiency of Parallel Asynchronous Algorithms , 1991, IEEE Trans. Computers.

[6]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[7]  Michel Dubois,et al.  Generalized Asynchronous Iterations , 1986, CONPAR.

[8]  Debasis Mitra,et al.  A chaotic asynchronous algorithm for computing the fixed point of a nonnegative matrix of unit spectral radius , 1986, JACM.

[9]  L. Eisner Convergence of sequential and asynchronous nonlinear paracontractions , 2005 .

[10]  A. Frommer Generalized nonlinear diagonal dominance and applications to asynchronous iterative methods , 1991 .

[11]  Michel Dubois,et al.  Sufficient conditions for the convergence of asynchronous iterations , 1989, Parallel Computing.

[12]  Todd Gamblin,et al.  Scaling Algebraic Multigrid Solvers: On the Road to Exascale , 2010, CHPC.

[13]  Eric W. Weisstein Hundred-Dollar, Hundred-Digit Challenge Problems , 2002 .

[14]  Robert D. Falgout,et al.  Multigrid Smoothers for Ultra-Parallel Computing , 2011 .

[15]  D. Szyld Different Models Of Parallel Asynchronous Iterations With Overlapping Blocks , 1998 .

[16]  Roberto Bagnara,et al.  A Unified Proof for the Convergence of Jacobi and Gauss-Seidel Methods , 1995, SIAM Rev..

[17]  Yousef Saad,et al.  Iterative methods for sparse linear systems , 2003 .

[18]  A. Frommer,et al.  Asynchronous parallel methods for enclosing solutions of nonlinear equations , 1995 .

[19]  M. Tarazi Some convergence results for asynchronous algorithms , 1982 .

[20]  J. Gillis,et al.  Matrix Iterative Analysis , 1961 .

[21]  Ulrike Meier Yang,et al.  On the use of relaxation parameters in hybrid smoothers , 2004, Numer. Linear Algebra Appl..

[22]  Vincent Heuveline,et al.  A Block-Asynchronous Relaxation Method for Graphics Processing Units , 2011 .

[23]  Daniel B. Szyld,et al.  Block and asynchronous two-stage methods for mildly nonlinear systems , 1999, Numerische Mathematik.

[24]  D. Szyld,et al.  On asynchronous iterations , 2000 .

[25]  J. Strikwerda A convergence theorem for chaotic asynchronous relaxation , 1997 .