Bandwidth Efficient All-reduce Operation on Tree Topologies
暂无分享,去创建一个
[1] Rajeev Thakur,et al. Optimization of Collective Communication Operations in MPICH , 2005, Int. J. High Perform. Comput. Appl..
[2] Yves Robert,et al. Broadcast trees for heterogeneous platforms , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.
[3] Philip Heidelberger,et al. Optimization of MPI collective communication on BlueGene/L systems , 2005, ICS '05.
[4] Dhabaleswar K. Panda,et al. Fast collective operations using shared and remote memory access protocols on clusters , 2003, Proceedings International Parallel and Distributed Processing Symposium.
[5] D. Panda,et al. Efficient Barrier and Allreduce on InfiniBand Clusters using Hardware Multicast and Adaptive Algorithms , 2004 .
[6] William Gropp,et al. Users guide for mpich, a portable implementation of MPI , 1996 .
[7] Alex Rapaport,et al. Mpi-2: extensions to the message-passing interface , 1997 .
[8] Rolf Rabenseifner,et al. Optimization of Collective Reduction Operations , 2004, International Conference on Computational Science.
[9] Xin Yuan,et al. Bandwidth Efficient All-to-All Broadcast on Switched Clusters , 2005, 2005 IEEE International Conference on Cluster Computing.
[10] John Markus Bjørndalen,et al. Extending collective operations with application semantics for improving multi-cluster performance , 2004, Third International Symposium on Parallel and Distributed Computing/Third International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Networks.
[11] Dhabaleswar K. Panda,et al. Efficient collective operations using remote memory operations on VIA-based clusters , 2003, Proceedings International Parallel and Distributed Processing Symposium.
[12] Amith R. Mamidala,et al. Efficient Barrier and Allreduce on Infiniband clusters using multicast and adaptive algorithms , 2004, 2004 IEEE International Conference on Cluster Computing (IEEE Cat. No.04EX935).
[13] Rajeev Thakur,et al. Improving the Performance of Collective Operations in MPICH , 2003, PVM/MPI.