论文信息 - Performance Evaluation of the Three-Dimensional Finite-Difference Time-Domain(FDTD) Method on Fermi Architecture GPUs

Performance Evaluation of the Three-Dimensional Finite-Difference Time-Domain(FDTD) Method on Fermi Architecture GPUs

GPUs excel at solving many parallel problems and hence dramatically increase the computation performance. In electrodynamics and many other fields, FDTD method is widely used due to its simplicity, accuracy, and practicability. In this paper, we applied the FDTD method on the Fermi Architecture GPUs, the latest product of NVidia, for a better understanding of Fermi's new features, such as the double precision support and improved memory hierarchy. Then we make a comparison between the strategies using the shared memory, the traditional optimization method on GPUs, and using L1 cache. Next, the paper provides insights into the disparity of these two strategies. We demonstrate that parallel computations only using L1 cache can reach the similar or even better performance as the traditional optimization method using the shared memory does when the dataset is not too large or the frequency of repeated use of the related data is low.

Ying Zhao | Lingjie Zhang | Kaixi Hou | Jiumei Huang

[1] David A. Bader,et al. A novel FDTD application featuring OpenMP-MPI hybrid parallelization , 2004, International Conference on Parallel Processing, 2004. ICPP 2004..

[2] Philippas Tsigas,et al. The Synchronization Power of Coalesced Memory Accesses , 2010, IEEE Transactions on Parallel and Distributed Systems.

[3] Yu Tian,et al. Analysis of the Electromagnetic Characteristics of Coplanar Waveguide by FDTD Method , 2009, 2009 IEEE Circuits and Systems International Conference on Testing and Diagnosis.

[4] Philippas Tsigas,et al. The Synchronization Power of Coalesced Memory Accesses , 2010, IEEE Trans. Parallel Distributed Syst..

[5] Kevin Skadron,et al. Scalable parallel programming , 2008, 2008 IEEE Hot Chips 20 Symposium (HCS).

[6] Allen Taflove,et al. Computational Electrodynamics the Finite-Difference Time-Domain Method , 1995 .

[7] W. Yu,et al. Electromagnetic Simulation Techniques Based on the Fdtd Method , 2009 .

[8] Hyesoon Kim,et al. An analytical model for a GPU architecture with memory-level and thread-level parallelism awareness , 2009, ISCA '09.

[9] K. Yee. Numerical solution of initial boundary value problems involving maxwell's equations in isotropic media , 1966 .

[10] David A. Bader,et al. A novel FDTD application featuring OpenMP-MPI hybrid parallelization , 2004 .