Optimizing MPI Alltoall Communication of Large Messages in Multicore Clusters
暂无分享,去创建一个
[1] Xin Yuan,et al. Automatic generation and tuning of MPI collective communication routines , 2005, ICS '05.
[2] Paul D. Coddington. Analysis of Algorithm Selection for Optimizing Collective Communication with MPICH for Ethernet and Myrinet Networks , 2007, Eighth International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT 2007).
[3] Paul D. Coddington,et al. Analysis of Algorithm Selection for Optimizing Collective Communication with MPICH for Ethernet and Myrinet Networks , 2007 .
[4] Dhabaleswar K. Panda,et al. High Performance RDMA-Based MPI Implementation over InfiniBand , 2003, ICS '03.
[5] Sathish S. Vadhiyar,et al. Automatically Tuned Collective Communications , 2000, ACM/IEEE SC 2000 Conference (SC'00).
[6] Dhabaleswar K. Panda,et al. Scalable and high performance collective communication for next generation multicore infiniband clusters , 2008 .
[7] Sayantan Sur,et al. Can memory-less network adapters benefit next-generation infiniband systems? , 2005, 13th Symposium on High Performance Interconnects (HOTI'05).
[8] Dhabaleswar K. Panda,et al. High performance RDMA-based MPI implementation over InfiniBand , 2003, ICS.
[9] F. Petrini,et al. The Case of the Missing Supercomputer Performance: Achieving Optimal Performance on the 8,192 Processors of ASCI Q , 2003, ACM/IEEE SC 2003 Conference (SC'03).