Performance analysis of MPI collective operations
暂无分享,去创建一个
Jack J. Dongarra | George Bosilca | Edgar Gabriel | Jelena Pjesivac-Grbovic | Graham E. Fagg | Thara Angskun | G. Fagg | J. Dongarra | G. Bosilca | E. Gabriel | T. Angskun | Jelena Pjesivac-Grbovic
[1] Eli Upfal,et al. Efficient Algorithms for All-to-All Communications in Multiport Message-Passing Systems , 1997, IEEE Trans. Parallel Distributed Syst..
[2] Jason Duell,et al. An evaluation of current high-performance networks , 2003, Proceedings International Parallel and Distributed Processing Symposium.
[3] Roger W. Hockney,et al. The Communication Challenge for MPP: Intel Paragon and Meiko CS-2 , 1994, Parallel Computing.
[4] Sathish S. Vadhiyar,et al. Automatically Tuned Collective Communications , 2000, ACM/IEEE SC 2000 Conference (SC'00).
[5] Kees Verstoep,et al. Fast Measurement of LogP Parameters for Message Passing Platforms , 2000, IPDPS Workshops.
[6] William Gropp,et al. Reproducible Measurements of MPI Performance Characteristics , 1999, PVM/MPI.
[7] Jack Dongarra,et al. Recent Advances in Parallel Virtual Machine (PVM) and Message Passing Interface (MPI) - 10th European PVM/MPI Users' Group Meeting, Venice, Italy, September 29 - October 2, 2003, Proceedings , 2003 .
[8] Henri E. Bal,et al. MagPIe: MPI's collective communication operations for clustered wide area systems , 1999, PPoPP '99.
[9] W HockneyRoger. The communication challenge for MPP , 1994 .
[10] Mario Lauria,et al. Efficient implementation of reduce-scatter in MPI , 2003, J. Syst. Archit..
[11] Anthony Skjellum,et al. A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard , 1996, Parallel Comput..
[12] Richard P. Martin,et al. Assessing Fast Network Interfaces , 1996, IEEE Micro.
[13] Kees Verstoep,et al. Network performance-aware collective communication for clustered wide-area systems , 2001, Parallel Comput..
[14] Jack Dongarra,et al. Fault Tolerant Communication Library and Applications for High Performance Computing , 2003 .
[15] Rajeev Thakur,et al. Improving the Performance of Collective Operations in MPICH , 2003, PVM/MPI.
[16] Ramesh Subramonian,et al. LogP: towards a realistic model of parallel computation , 1993, PPOPP '93.
[17] Chris J. Scheiman,et al. LogGP: incorporating long messages into the LogP model—one step closer towards a realistic model for parallel computation , 1995, SPAA '95.
[18] R. Rabenseifner,et al. Automatic MPI Counter Profiling of All Users: First Results on a CRAY T3E 900-512 , 2004 .
[19] Robert A. van de Geijn,et al. On optimizing collective communication , 2004, 2004 IEEE International Conference on Cluster Computing (IEEE Cat. No.04EX935).
[20] Jesper Larsson Träff,et al. More Efficient Reduction Algorithms for Non-Power-of-Two Number of Processors in Message-Passing Parallel Systems , 2004, PVM/MPI.
[21] Luiz Angelo Steffenel,et al. Fast Tuning of Intra-cluster Collective Communications , 2004, PVM/MPI.
[22] J. van Leeuwen,et al. Recent Advances in Parallel Virtual Machine and Message Passing Interface , 2002, Lecture Notes in Computer Science.
[23] George Bosilca,et al. Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation , 2004, PVM/MPI.
[24] SkjellumAnthony,et al. A high-performance, portable implementation of the MPI message passing interface standard , 1996 .