Block-asynchronous Multigrid Smoothers for GPU-accelerated Systems

Abstract This paper explores the need for asynchronous iteration algorithms as smoothers in multigrid methods. The hardware target for the new algorithms is top-of-the-line, highly parallel hybrid architectures – multicore-based systems enhanced with GPGPUs. These architectures are the most likely candidates for future high-end supercomputers. To pave the road for their effcient use, we must resolve challenges related to the fact that data movement, not floatingpoint operations, is the bottleneck to performance. Our work is in this direction — we designed block-asynchronous multigrid smoothers that perform more flops in order to reduce synchronization, and hence data movement. We show that the extra flops are done for “free,” while synchronization is reduced and the convergence properties of multigrid with classical smoothers like Gauss-Seidel can be preserved.

[1]  William L. Briggs,et al.  A multigrid tutorial , 1987 .

[2]  Aslak Tveito,et al.  Numerical solution of partial differential equations on parallel computers , 2006 .

[3]  Ulrike Meier Yang,et al.  On the use of relaxation parameters in hybrid smoothers , 2004, Numer. Linear Algebra Appl..

[4]  Daniel B. Szyld,et al.  Asynchronous Iterations , 2011, Encyclopedia of Parallel Computing.

[5]  Todd Gamblin,et al.  Scaling Algebraic Multigrid Solvers: On the Road to Exascale , 2010, CHPC.

[6]  Robert D. Falgout,et al.  Multigrid Smoothers for Ultra-Parallel Computing , 2011 .

[7]  R. Varga Geršgorin And His Circles , 2004 .

[8]  Daniel B. Szyld,et al.  Block and asynchronous two-stage methods for mildly nonlinear systems , 1999, Numerische Mathematik.

[9]  C. Kelley Iterative Methods for Linear and Nonlinear Equations , 1987 .

[10]  Long Chen INTRODUCTION TO MULTIGRID METHODS , 2005 .

[11]  Michel Dubois,et al.  Generalized Asynchronous Iterations , 1986, CONPAR.

[12]  Dimitri P. Bertsekas,et al.  Distributed Asynchronous Relaxation Methods for Linear Network Flow Problems , 1987 .

[13]  J. Strikwerda A convergence theorem for chaotic asynchronous relaxation , 1997 .

[14]  Michel Dubois,et al.  Sufficient conditions for the convergence of asynchronous iterations , 1989, Parallel Computing.

[15]  V. E. Henson,et al.  BoomerAMG: a parallel algebraic multigrid solver and preconditioner , 2002 .

[16]  Jack J. Dongarra,et al.  A Block-Asynchronous Relaxation Method for Graphics Processing Units , 2011, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.