Accuracy, Memory, and Speed Strategies in GPU-Based Finite-Element Matrix-Generation

This letter presents strategies on how to optimize graphics processing unit (GPU)-based finite-element matrix-generation that occurs in the finite element method (FEM) using higher-order curvilinear elements. The goal of the optimization is to increase the speed of evaluation and assembly of large finite-element matrices on a single GPU while maintaining the accuracy of numerical integration at the desired level. For this reason, the choice of the optimal Gaussian quadratures for curvilinear finite elements focused on accuracy, memory usage, and runtime of numerical integration is discussed. Moreover, we show how to efficiently utilize symmetry of local mass and stiffness matrices on a GPU in the numerical integration step. The performance results, obtained on a workstation equipped with one Tesla C2075, indicate that the proposed strategies retain the accuracy of computations, allow generation of larger sparse linear systems, and provide 2.5-fold acceleration of GPU-based finite-element matrix-generation.

[1]  M. Mrozowski,et al.  How to Render FDTD Computations More Effective Using a Graphics Accelerator , 2009, IEEE Transactions on Magnetics.

[2]  Maryam Mehri Dehnavi,et al.  Finite-Element Sparse Matrix Vector Multiplication on Graphic Processing Units , 2010, IEEE Transactions on Magnetics.

[3]  Hui,et al.  A SET OF SYMMETRIC QUADRATURE RULES ON TRIANGLES AND TETRAHEDRA , 2009 .

[4]  P. Ingelstrom,et al.  A new set of H(curl)-conforming hierarchical basis functions for tetrahedral meshes , 2006, IEEE Transactions on Microwave Theory and Techniques.

[5]  A. Lamecki,et al.  Tuning a Hybrid GPU-CPU V-Cycle Multilevel Preconditioner for Solving Large Real and Complex Systems of FEM Equations , 2011, IEEE Antennas and Wireless Propagation Letters.

[6]  J. Volakis,et al.  Finite element method for electromagnetics : antennas, microwave circuits, and scattering applications , 1998 .

[7]  Tomasz Topa,et al.  Adapting MoM With RWG Basis Functions to GPU Technology Using CUDA , 2011, IEEE Antennas and Wireless Propagation Letters.

[8]  Krzysztof Banas,et al.  3D finite element numerical integration on GPUs , 2010, ICCS.

[9]  Eric Darve,et al.  Assembly of finite element methods on graphics processors , 2011 .

[10]  Michal Mrozowski,et al.  FINITE ELEMENT MATRIX GENERATION ON A GPU , 2012 .

[11]  David A. Ham,et al.  Finite element assembly strategies on multi‐core and many‐core architectures , 2013 .

[12]  Yousef Saad,et al.  Iterative methods for sparse linear systems , 2003 .