Impacts of Multi-GPU MPI Collective Communications on Large FFT Computation
暂无分享,去创建一个
Stanimire Tomov | Azzam Haidar | Jack Dongarra | George Bosilca | Xi Luo | Alan Ayala | Hejer Shaeik | J. Dongarra | G. Bosilca | A. Haidar | S. Tomov | Alan Ayala | Xi Luo | Hejer Shaeik
[1] Jack Dongarra,et al. GPUDirect MPI Communications and Optimizations to Accelerate FFTs on Exascale Systems , 2019 .
[2] Jack J. Dongarra,et al. Towards dense linear algebra for hybrid GPU accelerated manycore systems , 2009, Parallel Comput..
[3] Hal Finkel,et al. HACC , 2016, Commun. ACM.
[4] Mei Han An,et al. accuracy and stability of numerical algorithms , 1991 .
[5] James Demmel,et al. Communication-avoiding algorithms for linear algebra and beyond , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.
[6] Jack Dongarra,et al. Evaluation and Design of FFT for Distributed Accelerated Systems , 2018 .
[7] Steven G. Johnson,et al. The Design and Implementation of FFTW3 , 2005, Proceedings of the IEEE.
[8] Jack Dongarra,et al. Design and Implementation for FFT-ECP on Distributed Accelerated Systems , 2019 .
[9] J. Dongarra,et al. ECP Milestone Report FFT-ECP Implementation Optimizations and Features Phase WBS 2 . 3 . 3 . 09 , Milestone FFT-ECP ST-MS-10-1440 Stanimire , 2019 .