An Investigation into the Performance of Reduction Algorithms under Load Imbalance
暂无分享,去创建一个
[1] Rajeev Thakur,et al. Optimization of Collective Communication Operations in MPICH , 2005, Int. J. High Perform. Comput. Appl..
[2] Jack Dongarra,et al. Recent Advances in Parallel Virtual Machine and Message Passing Interface, 15th European PVM/MPI Users' Group Meeting, Dublin, Ireland, September 7-10, 2008. Proceedings , 2008, PVM/MPI.
[3] Rolf Rabenseifner,et al. Optimization of Collective Reduction Operations , 2004, International Conference on Computational Science.
[4] George Karypis,et al. Introduction to Parallel Computing , 1994 .
[5] Torsten Hoefler,et al. Accurately measuring overhead, communication time and progression of blocking and nonblocking collective operations at massive scale , 2010, Int. J. Parallel Emergent Distributed Syst..
[6] Jesper Larsson Träff,et al. More Efficient Reduction Algorithms for Non-Power-of-Two Number of Processors in Message-Passing Parallel Systems , 2004, PVM/MPI.
[7] Stephen Gilmore,et al. Evaluating the Performance of Skeleton-Based High Level Parallel Programs , 2004, International Conference on Computational Science.
[8] Werner Augustin,et al. On Benchmarking Collective MPI Operations , 2002, PVM/MPI.