A Volume Integral Equation Stokes Solver for Problems with Variable Coefficients

We present a novel numerical scheme for solving the Stokes equation with variable coefficients in the unit box. Our scheme is based on a volume integral equation formulation. Compared to finite element methods, our formulation decouples the velocity and pressure, generates velocity fields that are by construction divergence free to high accuracy and its performance does not depend on the order of the basis used for discretization. In addition, we employ a novel adaptive fast multipole method for volume integrals to obtain a scheme that is algorithmically optimal. Our scheme supports non-uniform discretizations and is spectrally accurate. To increase per node performance, we have integrated our code with both NVIDIA and Intel accelerators. In our largest scalability test, we solved a problem with 20 billion unknowns, using a 14-order approximation for the velocity, on 2048 nodes of the Stampede system at the Texas Advanced Computing Center. We achieved 0.656 peta FLOPS for the overall code (23% efficiency) and one peta FLOPS for the volume integrals (33% efficiency). As an application example, we simulate Stokes ow in a porous medium with highly complex pore structure using a penalty formulation to enforce the no slip condition.

[1]  Jonathan J. Hu,et al.  ML 3.1 smoothed aggregation user's guide. , 2004 .

[2]  Leslie Greengard,et al.  An Integral Equation Approach to the Incompressible Navier-Stokes Equations in Two Dimensions , 1998, SIAM J. Sci. Comput..

[3]  Georg Stadler,et al.  Large-scale adaptive mantle convection simulation , 2013 .

[4]  Olof B. Widlund,et al.  Domain Decomposition Algorithms for Indefinite Elliptic Problems , 2017, SIAM J. Sci. Comput..

[5]  Jed Brown,et al.  Efficient Nonlinear Solvers for Nodal High-Order Finite Elements in 3D , 2010, J. Sci. Comput..

[6]  Timothy C. Warburton,et al.  Extreme-Scale AMR , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.

[7]  Richard W. Vuduc,et al.  A massively parallel adaptive fast-multipole method on heterogeneous architectures , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[8]  Gene H. Golub,et al.  Numerical solution of saddle point problems , 2005, Acta Numerica.

[9]  R. Kress,et al.  Inverse Acoustic and Electromagnetic Scattering Theory , 1992 .

[10]  Dimitri J. Mavriplis,et al.  A GPU-based incompressible Navier–Stokes solver on moving overset grids , 2013 .

[11]  Georg Stadler,et al.  Parallel scalable adjoint-based adaptive solution of variable-viscosity Stokes flow problems , 2009 .

[12]  C. Pozrikidis Boundary Integral and Singularity Methods for Linearized Viscous Flow: Index , 1992 .

[13]  Leslie Greengard,et al.  A free-space adaptive fmm-based pde solver in three dimensions , 2011 .

[14]  Leslie Greengard,et al.  A New Fast-Multipole Accelerated Poisson Solver in Two Dimensions , 2001, SIAM J. Sci. Comput..

[15]  Luca Heltai,et al.  The deal.II Library, Version 8.1 , 2013, ArXiv.

[16]  Barry V. Hess,et al.  Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis , 2010, HiPC 2010.

[17]  Richard W. Vuduc,et al.  Petascale Direct Numerical Simulation of Blood Flow on 200K Cores and Heterogeneous Architectures , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.

[18]  Jonathan J. Hu,et al.  ML 5.0 Smoothed Aggregation Users's Guide , 2006 .

[19]  L. Trefethen Spectral Methods in MATLAB , 2000 .

[20]  Piotr K. Smolarkiewicz,et al.  Pores resolving simulation of Darcy flows , 2010, J. Comput. Phys..

[21]  Martin Kronbichler,et al.  High accuracy mantle convection simulation through modern numerical methods , 2012 .

[22]  W. Bangerth,et al.  deal.II—A general-purpose object-oriented finite element library , 2007, TOMS.

[23]  D. Zorin,et al.  A kernel-independent adaptive fast multipole algorithm in two and three dimensions , 2004 .

[24]  Philippe Angot,et al.  A penalization method to take into account obstacles in incompressible viscous flows , 1999, Numerische Mathematik.

[25]  DHAIRYA MALHOTRA A A Distributed Memory Fast Multipole Method for Volume Potentials , 2014 .

[26]  J. CARRIERt,et al.  A FAST ADAPTIVE MULTIPOLE ALGORITHM FOR PARTICLE SIMULATIONS * , 2022 .

[27]  Rainer Kress,et al.  Newton’s method for inverse obstacle scattering meets the method of least squares , 2003 .