Hardware-Based Efficiency Advances in the EXA-DUNE Project

We present advances concerning efficient finite element assembly and linear solvers on current and upcoming HPC architectures obtained in the frame of the Exa-Dune project, part of the DFG priority program 1648 Software for Exascale Computing (SPPEXA). In this project, we aim at the development of both flexible and efficient hardware-aware software components for the solution of PDEs based on the DUNE platform and the FEAST library. In this contribution, we focus on node-level performance and accelerator integration, which will complement the proven MPI-level scalability of the framework. The higher-level aspects of the Exa-Dune project, in particular multiscale methods and uncertainty quantification, are detailed in the companion paper (Bastian et al., Advances concerning multiscale methods and uncertainty quantification in Exa-Dune. In: Proceedings of the SPPEXA Symposium, 2016).

[1]  Oliver Bröker,et al.  Sparse approximate inverse smoothers for geometric and algebraic multigrid , 2002 .

[2]  Richard W. Vuduc,et al.  Model-driven autotuning of sparse matrix-vector multiply on GPUs , 2010, PPoPP '10.

[3]  Katharina Kormann,et al.  A generic interface for parallel cell-based finite element operator application , 2012 .

[4]  Gerhard Wellein,et al.  A Unified Sparse Matrix Data Format for Efficient General Sparse Matrix-Vector Multiplication on Modern Processors with Wide SIMD Units , 2013, SIAM J. Sci. Comput..

[5]  Volker Lindenstruth,et al.  Vc: A C++ library for explicit vectorization , 2012, Softw. Pract. Exp..

[6]  Stefan Turek,et al.  Advances Concerning Multiscale Methods and Uncertainty Quantification in EXA-DUNE , 2016, Software for Exascale Computing.

[7]  Andreas Dedner,et al.  A generic grid interface for parallel and adaptive scientific computing. Part II: implementation and tests in DUNE , 2008, Computing.

[8]  Gilles Fourestey,et al.  SPAI Preconditioners for HPC Applications , 2012 .

[9]  A. Ern,et al.  A discontinuous Galerkin method with weighted averages for advection–diffusion equations with locally small and anisotropic diffusivity , 2008 .

[10]  Stefan Turek,et al.  EXA-DUNE: Flexible PDE Solvers, Numerical Methods and Applications , 2014, Euro-Par Workshops.

[11]  Stefan Turek,et al.  FEAST—realization of hardware-oriented numerics for HPC simulations with finite elements , 2010, ISC 2010.

[12]  Jens Markus Melenk,et al.  Fully discrete hp-finite elements: fast quadrature , 2001 .

[13]  Steffen Müthing,et al.  Integrating Multi-threading and Accelerators into DUNE-ISTL , 2013, ENUMATH.

[14]  Marcus J. Grote,et al.  Parallel Preconditioning with Sparse Approximate Inverses , 1997, SIAM J. Sci. Comput..

[15]  Stefan Turek,et al.  Towards a complete FEM-based simulation toolkit on GPUs: Unstructured grid finite element geometric multigrid solvers with strong smoothers based on sparse approximate inverses , 2013 .

[16]  Gerhard Wellein,et al.  A unified sparse matrix data format for modern processors with wide SIMD units , 2013, ArXiv.

[17]  Christian Engwer,et al.  Scalable Hybrid Parallelization Strategies for the DUNE Grid Interface , 2013, ENUMATH.

[18]  Andreas Dedner,et al.  A generic grid interface for parallel and adaptive scientific computing. Part I: abstract framework , 2008, Computing.