An Implementation of Parallel 3-D FFT with 2-D Decomposition on a Massively Parallel Cluster of Multi-core Processors
暂无分享,去创建一个
[1] Ramesh C. Agarwal,et al. An efficient parallel algorithm for the 3-D FFT NAS parallel benchmark , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.
[2] Steven G. Johnson,et al. The Design and Implementation of FFTW3 , 2005, Proceedings of the IEEE.
[3] Bin Fang,et al. Performance of the 3D FFT on the 6D network torus QCDOC parallel supercomputer , 2007, Comput. Phys. Commun..
[4] J. Tukey,et al. An algorithm for the machine calculation of complex Fourier series , 1965 .
[5] Andy Brass,et al. Two and three dimensional FFTs on highly parallel computers , 1986, Parallel Comput..
[6] Daisuke Takahashi. Efficient implementation of parallel three-dimensional FFT on clusters of PCs , 2003 .
[7] C. Loan. Computational Frameworks for the Fast Fourier Transform , 1992 .
[8] Daisuke Takahashi. A Hybrid MPI/OpenMP Implementation of a Parallel 3-D FFT on SMP Clusters , 2005, PPAM.
[9] Robert S. Germain,et al. Scalable framework for 3D FFTs on the Blue Gene/L supercomputer: Implementation and early performance measurements , 2005, IBM J. Res. Dev..