A multi-GPU method for ADI-based fractional-step integration of incompressible Navier-Stokes equations

Abstract A computational method for GPU-accelerated fractional-step integration of incompressible Navier-Stokes equations based on the Alternating Direction Implicit (ADI) method is presented. Non-iterative, direct solution methods used in the semi-implicit fractional-step method take advantage of tridiagonal systems and Fourier transform whose solution can be computed using fast algorithms on a single GPU. However, when data is distributed to multiple GPUs, all-to-all matrix transposition is required, which increases computational cost significantly. In this work, a new strategy that does not require all-to-all transposition is proposed. The computational domain is divided in the wall-normal direction, and decoupled tridiagonal systems are obtained using Parallel Diagonal Dominant (PDD) and Parallel Partition (PPT) methods. An optimal batch size is determined to maximize the performance of PDD and PPT methods within a given amount of GPU memory. Strengths and weaknesses of this type of domain decomposition are investigated in comparison to conventional ways of dividing the domain along streamwise or spanwise directions. Using 8 NVIDIA Tesla P100 GPUs, the utility of the present method is demonstrated in a direct numerical simulation (DNS) of a canonical zero-pressure-gradient turbulent boundary layer and a DNS of a K-type boundary-layer transition on 1.4 billion grid cells.

[1]  Parviz Moin,et al.  An Efficient Method for Temporal Integration of the Navier-Stokes Equations in Confined Axisymmetric Geometries , 1996 .

[2]  P. Moin,et al.  Direct numerical simulation of turbulence in a nominally zero-pressure-gradient flat-plate boundary layer , 2009, Journal of Fluid Mechanics.

[3]  Cornelis Vuik,et al.  Toward a GPU-aware comparison of explicit and implicit CFD simulations on structured meshes , 2017, Comput. Math. Appl..

[4]  Javier Jiménez,et al.  A code for direct numerical simulation of turbulent boundary layers at high Reynolds numbers in BG/P supercomputers , 2013 .

[5]  I. Duff,et al.  Direct Methods for Sparse Matrices , 1987 .

[6]  H. Sung,et al.  Influence of large-scale motions on the frictional drag in a turbulent boundary layer , 2017, Journal of Fluid Mechanics.

[7]  Eng Leong Tan,et al.  Fundamental Locally One-Dimensional Method for 3-D Thermal Simulation , 2014, IEICE Trans. Electron..

[8]  Donghyun You,et al.  A GPU-accelerated semi-implicit fractional-step method for numerical solutions of incompressible Navier-Stokes equations , 2017, J. Comput. Phys..

[9]  Matteo Bernardini,et al.  STREAmS: A high-fidelity accelerated solver for direct numerical simulation of compressible turbulent flows , 2020, Comput. Phys. Commun..

[10]  Application of vortex identification schemes to direct numerical simulation data of a transitional boundary layer , 2013 .

[11]  E. Sparrow,et al.  Handbook of Numerical Heat Transfer , 1988 .

[12]  Jung-Il Choi,et al.  PaScaL_TDMA: A library of parallel and scalable solvers for massive tridiagonal systems , 2021, Comput. Phys. Commun..

[13]  Javier Jiménez,et al.  Near-wall turbulence , 2013 .

[14]  P. Moin,et al.  Transitional–turbulent spots and turbulent–turbulent spots in boundary layers , 2017, Proceedings of the National Academy of Sciences.

[15]  Hyung Jin Sung,et al.  Direct numerical simulation of a turbulent boundary layer up to Reθ = 2500 , 2011 .

[16]  Dan S. Henningson,et al.  Turbulent boundary layers up to Reθ=2500 studied through simulation and experiment , 2009 .

[17]  Yuichi Matsuo,et al.  Direct Numerical Simulation of a Fully Developed Turbulent Channel Flow With Respect to the Reynolds Number Dependence , 2001 .

[18]  Massimiliano Fatica,et al.  AFiD-GPU: A versatile Navier-Stokes solver for wall-bounded turbulent flows on GPU clusters , 2017, Comput. Phys. Commun..

[19]  Lionel M. Ni,et al.  Parallel algorithms for solution of tridiagonal systems on multicomputers , 1989, ICS '89.

[20]  P. Moin,et al.  Direct numerical simulation of complete H-type and K-type transitions with implications for the dynamics of turbulent boundary layers , 2013, Journal of Fluid Mechanics.

[21]  T. Lund,et al.  Generation of Turbulent Inflow Data for Spatially-Developing Boundary Layer Simulations , 1998 .

[22]  P. Moin,et al.  Application of a Fractional-Step Method to Incompressible Navier-Stokes Equations , 1984 .

[23]  Myoungkyu Lee,et al.  Petascale direct numerical simulation of turbulent channel flow on up to 786K cores , 2013, 2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[24]  J. Douglas,et al.  A general formulation of alternating direction methods , 1964 .

[25]  Massimiliano Fatica,et al.  GPU acceleration of CaNS for massively-parallel direct numerical simulations of canonical fluid flows , 2021, Comput. Math. Appl..

[26]  Ramis Örlü,et al.  Assessment of direct numerical simulation data of turbulent boundary layers , 2010, Journal of Fluid Mechanics.

[27]  Javier Jiménez,et al.  A high-resolution code for turbulent boundary layers , 2009, J. Comput. Phys..

[28]  Eng Leong Tan,et al.  Fundamental Schemes for Efficient Unconditionally Stable Implicit Finite-Difference Time-Domain Methods , 2008, IEEE Transactions on Antennas and Propagation.

[29]  R. Moser,et al.  One-point statistics for turbulent wall-bounded flows at Reynolds numbers up to δ+ ≈ 2000 , 2013 .

[30]  Seonghyeon Hahn,et al.  Direct numerical simulation of turbulent channel flow with permeable walls , 2002, Journal of Fluid Mechanics.

[31]  D Joslin Ronald,et al.  Large-Eddy Simulation of Transition to Turbulence in Boundary Layers , 1997 .

[32]  Parviz Moin,et al.  An improvement of fractional step methods for the incompressible Navier-Stokes equations , 1991 .

[33]  Javier Urzay,et al.  HTR solver: An open-source exascale-oriented task-based multi-GPU high-order code for hypersonic aerothermodynamics , 2020, Comput. Phys. Commun..

[34]  Pedro Costa,et al.  A FFT-based finite-difference solver for massively-parallel direct numerical simulations of turbulent flows , 2018, Comput. Math. Appl..

[35]  U. Piomelli,et al.  Large-Eddy Simulation of Transition to Turbulence in Boundary Layers , 1997 .

[36]  Anthony Randriamampianina,et al.  Higher-order compact scheme for high-performance computing of stratified rotating flows , 2018, Computers & Fluids.

[37]  Dominik Obrist,et al.  High-order accurate simulation of incompressible turbulent flows on many parallel GPUs of a hybrid-node supercomputer , 2019, Comput. Phys. Commun..

[38]  Hermann F. Fasel,et al.  Non-parallel stability of a flat-plate boundary layer using the complete Navier-Stokes equations , 1990, Journal of Fluid Mechanics.

[39]  Xian-He Sun Application and Accuracy of the Parallel Diagonal Dominant Algorithm , 1995, Parallel Comput..

[40]  Haecheon Choi,et al.  Unsteady Simulation of Jets in a Cross Flow , 1997 .