A decomposition approach for optimizing the performance of MPI libraries
暂无分享,去创建一个
[1] Robert A. van de Geijn,et al. On optimizing collective communication , 2004, 2004 IEEE International Conference on Cluster Computing (IEEE Cat. No.04EX935).
[2] Sathish S. Vadhiyar,et al. Automatically Tuned Collective Communications , 2000, ACM/IEEE SC 2000 Conference (SC'00).
[3] Jack J. Dongarra,et al. Performance analysis of MPI collective operations , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.
[4] Qing Huang,et al. A Comparison of MPICH Allgather Algorithms on Switched Networks , 2003, PVM/MPI.
[5] William Gropp,et al. Users guide for mpich, a portable implementation of MPI , 1996 .
[6] Jack Dongarra,et al. Performance Modeling for Self Adapting Collective Communications for MPI , 2001 .
[7] P. J. van der Houwen,et al. Parallel Adams methods , 1999 .
[8] Thomas Rauber,et al. Execution Schemes for Parallel Adams Methods , 2004, Euro-Par.