Challenges of Scaling Algebraic Multigrid Across Modern Multicore Architectures

Algebraic multigrid (AMG) is a popular solver for large-scale scientific computing and an essential component of many simulation codes. AMG has shown to be extremely efficient on distributed-memory architectures. However, when executed on modern multicore architectures, we face new challenges that can significantly deteriorate AMG's performance. We examine its performance and scalability on three disparate multicore architectures: a cluster with four AMD Opteron Quad-core processors per node (Hera), a Cray XT5 with two AMD Opteron Hex-core processors per node (Jaguar), and an IBM Blue Gene/P system with a single Quad-core processor (Intrepid). We discuss our experiences on these platforms and present results using both an MPI-only and a hybrid MPI/OpenMP model. We also discuss a set of techniques that helped to overcome the associated problems, including thread and process pinning and correct memory associations.

[1]  Aslak Tveito,et al.  Numerical solution of partial differential equations on parallel computers , 2006 .

[2]  W. Marsden I and J , 2012 .

[3]  Robert D. Falgout,et al.  The Design and Implementation of hypre, a Library of Parallel High Performance Preconditioners , 2006 .

[4]  R.D. Falgout,et al.  An Introduction to Algebraic Multigrid Computing , 2006, Computing in Science & Engineering.

[5]  Robert D. Falgout,et al.  Pursuing scalability for hypre's conceptual interfaces , 2004, TOMS.

[6]  Edmond Chow,et al.  A Survey of Parallelization Techniques for Multigrid Solvers , 2006, Parallel Processing for Scientific Computing.

[7]  U. Yang,et al.  Distance-two interpolation for parallel algebraic multigrid , 2007 .

[8]  Hans De Sterck,et al.  Reducing Complexity in Parallel Algebraic Multigrid Preconditioners , 2004, SIAM J. Matrix Anal. Appl..

[9]  Robert D. Falgout,et al.  hypre: A Library of High Performance Preconditioners , 2002, International Conference on Computational Science.

[10]  Ulrike Meier Yang,et al.  On long‐range interpolation operators for aggressive coarsening , 2009, Numer. Linear Algebra Appl..

[11]  Ulrike Meier Yang,et al.  Parallel Algebraic Multigrid Methods — High Performance Preconditioners , 2006 .

[12]  Martin Schulz,et al.  On the Performance of an Algebraic Multigrid Solver on Multicore Clusters , 2010, VECPAR.

[13]  Roland Masson,et al.  Parallel Preconditioning for Sedimentary Basin Simulations , 2003, LSSC.

[14]  Xing Qiu,et al.  Hierarchical Parallelization of Gene Differential Association Analysis , 2011, BMC Bioinformatics.

[15]  StübenKlaus Algebraic multigrid (AMG) , 1983 .

[16]  D FalgoutRobert An Introduction to Algebraic Multigrid , 2006 .

[17]  G. Berti,et al.  A FINITE ELEMENT BASED TOOL CHAIN FOR THE PLANNING AND SIMULATION OF MAXILLOFACIAL SURGERY , 2004 .