Parallel Implementation of the Finite Element Method on Graphics Processors for the Solution of Incompressible Flows

................................................................................................................ v ÖZ .............................................................................................................................. vii ACKNOWLEDGMENTS ........................................................................................... x TABLE OF CONTENTS ............................................................................................ xi LIST OF TABLES .................................................................................................... xiii LIST OF FIGURES .................................................................................................. xiv LIST OF SYMBOLS AND ABBREVIATIONS ..................................................... xvi

[1]  Jie Shen,et al.  An overview of projection methods for incompressible flows , 2006 .

[2]  Runar Heggelien Refsnæs Matrix-Free Conjugate Gradient Methods for Finite Element Simulations on GPUs , 2010 .

[3]  A. Huerta,et al.  A fractional-step method for the incompressible Navier-Stokes equations related to a predictor-multicorrector algorithm , 1998 .

[4]  Łukasz Mirosław,et al.  GPU-based simulation of 3D blood flow in abdominal aorta using OpenFOAM , 2011 .

[5]  Dalibor Stys,et al.  The lattice Boltzmann method in bioreactor design and simulation , 2013, Math. Comput. Model..

[6]  P. H. J. Kelly,et al.  Finite element assembly strategies on multi-and many-core architectures , 2011 .

[7]  J. Kan A second-order accurate pressure correction scheme for viscous incompressible flow , 1986 .

[8]  J. Whitelaw,et al.  Convective heat and mass transfer , 1966 .

[9]  Pradeep Dubey,et al.  Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU , 2010, ISCA.

[10]  Eric Darve,et al.  Assembly of finite element methods on graphics processors , 2011 .

[11]  Stefan Turek,et al.  GPU acceleration of an unmodified parallel finite element Navier-Stokes solver , 2009, 2009 International Conference on High Performance Computing & Simulation.

[12]  Robert M. Farber,et al.  CUDA Application Design and Development , 2011 .

[13]  Liu You,et al.  Real-Time 3D Fluid Simulation on GPU with Complex Obstacles , 2006 .

[14]  G. Kreiss,et al.  Spurious currents in finite element based level set methods for two‐phase flow , 2012 .

[15]  Moncho Gómez-Gesteira,et al.  Optimization strategies for CPU and GPU implementations of a smoothed particle hydrodynamics method , 2013, Comput. Phys. Commun..

[16]  S. Garimella,et al.  Investigation of heat transfer in rectangular microchannels , 2005 .

[17]  Gordon Erlebacher,et al.  Porting a high-order finite-element earthquake modeling application to NVIDIA graphics cards using CUDA , 2009, J. Parallel Distributed Comput..

[18]  P. Nithiarasu An efficient artificial compressibility (AC) scheme based on the characteristic based split (CBS) method for incompressible flows , 2003 .

[19]  Jie Cheng,et al.  CUDA by Example: An Introduction to General-Purpose GPU Programming , 2010, Scalable Comput. Pract. Exp..

[20]  Manfred Krafczyk,et al.  TeraFLOP computing on a desktop PC with GPUs for 3D CFD , 2008 .

[21]  A. Huerta,et al.  Finite Element Methods for Flow Problems , 2003 .

[22]  P. Hood,et al.  A numerical solution of the Navier-Stokes equations using the finite element technique , 1973 .

[23]  Nathan Ida,et al.  Introduction to the Finite Element Method , 1997 .

[24]  Timothy C. Warburton,et al.  Nodal discontinuous Galerkin methods on graphics processors , 2009, J. Comput. Phys..

[25]  James Reinders,et al.  Intel Xeon Phi Coprocessor High Performance Programming , 2013 .

[26]  R. J. Schnipke,et al.  An equal-order velocity-pressure formulation that does not exhibit spurious pressure modes , 1986 .

[27]  Dennis C. Jespersen,et al.  Acceleration of a CFD code with a GPU , 2010, Sci. Program..

[28]  Robert A. Dalrymple,et al.  SPH on GPU with CUDA , 2010 .

[29]  A. Abdel-azim Fundamentals of Heat and Mass Transfer , 2011 .

[30]  Michael S. Engelman,et al.  Segregated finite element algorithms for the numerical solution of large‐scale incompressible flow problems , 1993 .

[31]  Rainald Löhner,et al.  Porting of an Edge-Based CFD Solver to GPUs , 2010 .

[32]  Claude Basdevant,et al.  Optimizing 2D and 3D structured Euler CFD solvers on Graphical Processing Units , 2012 .

[33]  Ioannis E. Venetis,et al.  Porting FEASTFLOW to the Intel Xeon Phi: Lessons Learned , 2014 .

[34]  S. Giuliani,et al.  Finite element solution of the unsteady Navier-Stokes equations by a fractional step method , 1982 .

[35]  Gerhard Wellein,et al.  Performance analysis and optimization strategies for a D3Q19 lattice Boltzmann kernel on nVIDIA GPUs using CUDA , 2011, Adv. Eng. Softw..

[36]  W. Tao,et al.  Numerical studies of simultaneously developing laminar flow and heat transfer in microtubes with thick wall and constant outside wall temperature , 2010 .

[37]  P. Dupont,et al.  Incompressible Navier-Stokes computations with SUPG and GLS formulations—A comparison study , 1995 .

[38]  C. Bennemann,et al.  Teraflops for Games and Derivatives Pricing , 2008 .

[39]  Jack Dongarra,et al.  Templates for the Solution of Algebraic Eigenvalue Problems , 2000, Software, environments, tools.

[40]  Kyriakos C. Giannakoglou,et al.  Unsteady CFD computations using vertex‐centered finite volumes for unstructured grids on Graphics Processing Units , 2011 .

[41]  Graham Pullan,et al.  Acceleration of a two-dimensional Euler flow solver using commodity graphics hardware , 2007 .

[42]  Michael J. Flynn,et al.  Some Computer Organizations and Their Effectiveness , 1972, IEEE Transactions on Computers.

[43]  Michael Garland,et al.  Implementing sparse matrix-vector multiplication on throughput-oriented processors , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[44]  E. Barragy,et al.  STREAM FUNCTION-VORTICITY DRIVEN CAVITY SOLUTION USING p FINITE ELEMENTS , 1997 .

[45]  Suhas V. Patankar,et al.  A CONTROL VOLUME-BASED FINITE-ELEMENT METHOD FOR SOLVING THE NAVIER-STOKES EQUATIONS USING EQUAL-ORDER VELOCITY-PRESSURE INTERPOLATION , 1984 .

[46]  Kenneth E. Jansen,et al.  A stabilized finite element method for the incompressible Navier–Stokes equations using a hierarchical basis , 2001 .

[47]  V. John Higher order finite element methods and multigrid solvers in a benchmark problem for the 3D Navier–Stokes equations , 2002 .

[48]  J. Donea A Taylor–Galerkin method for convective transport problems , 1983 .

[49]  Jostein R. Natvig,et al.  Solving the Euler Equations on Graphics Processing Units , 2006, International Conference on Computational Science.

[50]  J. N. Reddy,et al.  On penalty function methods in the finite‐element analysis of flow problems , 1982 .

[51]  Jean-Luc Guermond,et al.  Calculation of Incompressible Viscous Flows by an Unconditionally Stable Projection FEM , 1997 .

[52]  Nikolaus A. Adams,et al.  Numerical simulation of fluid flow on complex geometries using the Lattice-Boltzmann method and CUDA-enabled GPUs , 2009, SIGGRAPH '09.

[53]  T. Hughes,et al.  Streamline upwind/Petrov-Galerkin formulations for convection dominated flows with particular emphasis on the incompressible Navier-Stokes equations , 1990 .

[54]  Robert L. Lee,et al.  A comparison of various mixed-interpolation finite elements in the velocity-pressure formulation of the Navier-Stokes equations☆ , 1978 .

[55]  O. C. Zienkiewicz,et al.  Finite element methods for high speed flows , 1985 .

[56]  Timothy A. Davis,et al.  Direct methods for sparse linear systems , 2006, Fundamentals of algorithms.

[57]  Roger L. Davis,et al.  Rapid Aerodynamic Performance Prediction on a Cluster of Graphics Processing Units , 2009 .

[58]  K. Morgan,et al.  Three‐dimensional incompressible flow calculations using the characteristic based split (CBS) scheme , 2004 .

[59]  Naga K. Govindaraju,et al.  High performance discrete Fourier transforms on graphics processors , 2008, HiPC 2008.

[60]  Kyle E. Niemeyer,et al.  Recent progress and challenges in exploiting graphics processors in computational fluid dynamics , 2013, The Journal of Supercomputing.

[61]  Bernardo Cockburn Discontinuous Galerkin methods , 2003 .

[62]  Eric Darve,et al.  Large calculation of the flow over a hypersonic vehicle using a GPU , 2008, J. Comput. Phys..

[63]  Wing Kam Liu,et al.  Finite Element Analysis of Incompressible Viscous Flows by the Penalty Function Formulation , 1979 .