论文信息 - GPU Parallelization of a High Order Immersed Boundary Method Fluid Solver

GPU Parallelization of a High Order Immersed Boundary Method Fluid Solver

A GPU parallelized high order immersed boundary method fluid solver is developed. Memory management, asynchronous, and algorithm optimization are required to have the highest GPU speed-up potential. Task parallelization must also be implemented through asynchronous and host parallelization (OpenMP). The Poisson solver is the speed-up bottle neck for high convergence iteration count. For small Poisson solver iteration count, the 5th order WENO scheme restricts speed-up. An overall speed-up of ∼4.9 is obtained for a single time step. Speed-up increases with grid size. Multi GPU parallelization requires OpenMP to decrease the GPUs’ idle time. With two GPUs, the increase in speed-up is ∼84.5%, with respect to single GPU, for the largest grid size currently examined.

Thomas L. Jackson | Ju Zhang | Antoine M.D. Jost

[1] Rajat Mittal,et al. A versatile sharp interface immersed boundary method for incompressible flows with complex boundaries , 2008, J. Comput. Phys..

[2] Seyong Lee,et al. Early evaluation of directive-based GPU programming models for productive exascale computing , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[3] John Shalf,et al. The International Exascale Software Project roadmap , 2011, Int. J. High Perform. Comput. Appl..

[4] Kyle E. Niemeyer,et al. Accelerating moderately stiff chemical kinetics in reactive-flow simulations using GPUs , 2013, J. Comput. Phys..

[5] P. Moin,et al. Application of a Fractional-Step Method to Incompressible Navier-Stokes Equations , 1984 .

[6] Gianluca Iaccarino,et al. IMMERSED BOUNDARY METHODS , 2005 .

[7] Thomas L. Jackson,et al. A high-order incompressible flow solver with WENO , 2009, J. Comput. Phys..

[8] Chi-Wang Shu,et al. Efficient Implementation of Weighted ENO Schemes , 1995 .

[9] M.Y. Hussaini,et al. Low-Dissipation and Low-Dispersion Runge-Kutta Schemes for Computational Acoustics , 1994 .

[10] William J. Dally,et al. GPUs and the Future of Parallel Computing , 2011, IEEE Micro.