A Hybrid Approach for Solving the 3D Helmholtz Equation on Heterogeneous Platforms

We are interested in the resolution of the 3D Helmholtz equation for real applications. Solving this problem numerically is a computational challenge due to the large memory requirements of the matrices and vectors involved.For these cases, the massive parallelism of GPU architectures and the high performance at lower energy of the multicores can be exploited. To do a fair comparison between the benefit of accelerating the three-dimensional Helmholtz equation using GPU architectures and multicore platforms, this paper describes three different parallelization schemes on a multi-GPU cluster and also includes an evaluation of their performance. The three parallel schemes consist of:(1) using the multicore processors (CPU version), (2) using the GPU devices (GPU version); and (3) using a hybrid implementation which combines CPU cores and GPU devices simultaneously (hybrid version).Experimental results show that our hybrid implementation outperforms the other approaches in terms of performance.

[1]  C. DeWitt-Morette,et al.  Mathematical Analysis and Numerical Methods for Science and Technology , 1990 .

[2]  D. Feit,et al.  Sound, structures, and their interaction (2nd edition) , 1986 .

[3]  Inanc Senocak,et al.  An MPI-CUDA Implementation for Massively Parallel Incompressible Flow Computations on Multi-GPU Clusters , 2010 .

[4]  Matthew N. O. Sadiku,et al.  Numerical Techniques in Electromagnetics , 2000 .

[5]  Ivo Babuška,et al.  Solution of Helmholtz problems by knowledge-based FEM , 1997 .

[6]  C. Lanczos An iteration method for the solution of the eigenvalue problem of linear differential and integral operators , 1950 .

[7]  R. Duraiswami,et al.  Fast Multipole Methods for the Helmholtz Equation in Three Dimensions , 2005 .

[8]  Cornelis Vuik,et al.  GPU implementation of a Helmholtz Krylov solver preconditioned by a shifted Laplace multigrid method , 2011, J. Comput. Appl. Math..

[9]  Shan Zhao,et al.  Numerical solution of the Helmholtz equation with high wavenumbers , 2004 .

[10]  Stefan A. Sauter,et al.  Is the Pollution Effect of the FEM Avoidable for the Helmholtz Equation Considering High Wave Numbers? , 1997, SIAM Rev..

[11]  Ramani Duraiswami,et al.  CHAPTER 5 – Fast Multipole Methods , 2004 .

[12]  Inmaculada García,et al.  High performance computing for Optical Diffraction Tomography , 2012, 2012 International Conference on High Performance Computing & Simulation (HPCS).

[13]  Alexey L. Lastovetsky Special issue of Journal of Parallel and Distributed Computing: Heterogeneity in parallel and distributed computing , 2012, J. Parallel Distributed Comput..

[14]  Francisco Vázquez,et al.  The BiConjugate gradient method on GPUs , 2012, The Journal of Supercomputing.