High-speed FDTD simulation algorithm for GPU with compute unified device architecture

We proposed an FDTD algorithm for GPU with CUDA. Our GPU-FDTD algorithm performed high-speed FDTD simulation using GPU with CUDA, and maintained single-floating point accuracy. In the larger computational domain, the speedup factor becomes worse. The result suggests that the bottleneck of the FDTD simulation is memory bandwidth. Our GPU-FDTD algorithm can be applied to 3-D FDTD simulation. In future, we plan to implement our GPU-FDTD algorithm to the 3-D FDTD simulation.

[1]  M.J. Inman,et al.  Programming video cards for computational electromagnetics applications , 2005, IEEE Antennas and Propagation Magazine.