Acceleration study for the FDTD method using SSE and AVX instructions

In this paper, we discuss the acceleration effect for the FDTD method using SSE and AVX instructions. We developed three types of code: code developed by C language, code accelerated by SSE instructions and code accelerated by AVX instructions. We present the results of a performance study of the FDTD algorithm on Intel i5 Sandy Bridge. The results show that we can improve the performance of FDTD algorithm using SSE and AVX instructions.

[1]  Wenhua. Wenhua Yu ... . Yu,et al.  Parallel Finite-Difference Time-Domain Method , 2006 .

[2]  K. Yee Numerical solution of initial boundary value problems involving maxwell's equations in isotropic media , 1966 .

[3]  G. Mur Absorbing Boundary Conditions for the Finite-Difference Approximation of the Time-Domain Electromagnetic-Field Equations , 1981, IEEE Transactions on Electromagnetic Compatibility.

[4]  Atef Z. Elsherbeni,et al.  Compute unified device architecture (CUDA) based finite-difference time-domain (FDTD) implementation , 2010 .

[5]  Wei Shao,et al.  Parallelized ADI-FDTD algorithm for attenuation constant extraction by using OpenMP library , 2010, 2010 International Conference on Microwave and Millimeter Wave Technology.

[6]  Tomoyoshi Ito,et al.  Improved performance of FDTD computation using a thread block constructed as a two-dimensional array with CUDA , 2010 .

[7]  David A. Bader,et al.  A novel FDTD application featuring OpenMP-MPI hybrid parallelization , 2004 .

[8]  Stephen D. Gedney,et al.  Convolution PML (CPML): An efficient FDTD implementation of the CFS–PML for arbitrary media , 2000 .

[9]  Haider R. Khaleel,et al.  Enhanced Parallel FDTD Method Using SSE Instruction Sets , 2012, The Applied Computational Electromagnetics Society Journal (ACES).

[10]  Allen Taflove,et al.  Computational Electrodynamics the Finite-Difference Time-Domain Method , 1995 .

[11]  Y. Zhang,et al.  Study on the Optimum Virtual Topology for MPI based Parallel Conformal FDTD Algorithm on PC Clusters , 2005 .