A New Approach to MPI Collective Communication Implementations
暂无分享,去创建一个
George Bosilca | Torsten Hoefler | Wolfgang Rehm | Jeffrey M. Squyres | Andrew Lumsdaine | Graham E. Fagg | A. Lumsdaine | G. Fagg | T. Hoefler | G. Bosilca | J. Squyres | W. Rehm
[1] Ramesh Subramonian,et al. LogP: towards a realistic model of parallel computation , 1993, PPOPP '93.
[2] George Bosilca,et al. Analysis of the Component Architecture Overhead in Open MPI , 2005, PVM/MPI.
[3] Chris J. Scheiman,et al. LogGP: incorporating long messages into the LogP model—one step closer towards a realistic model for parallel computation , 1995, SPAA '95.
[4] R. Rabenseifner,et al. Automatic MPI Counter Profiling of All Users: First Results on a CRAY T3E 900-512 , 2004 .
[5] Jack J. Dongarra,et al. Performance analysis of MPI collective operations , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.
[6] Robert A. van de Geijn,et al. On optimizing collective communication , 2004, 2004 IEEE International Conference on Cluster Computing (IEEE Cat. No.04EX935).
[7] Greg Burns,et al. LAM: An Open Cluster Environment for MPI , 2002 .
[8] Lars Paul Huse. Collective Communication on Dedicated Clusters of Workstations , 1999, PVM/MPI.
[9] Jeffrey M. Squyres,et al. The Component Architecture of Open MPI: Enabling Third-Party Collective Algorithms* , 2005 .
[10] Jack Dongarra,et al. Extending the MPI Specification for Process Fault Tolerance on High Performance Computing Systems , 2004 .
[11] Anthony Skjellum,et al. A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard , 1996, Parallel Comput..
[12] Robert A. van de Geijn,et al. Fast Collective Communication Libraries, Please , 1995 .
[13] Torsten Hoefler,et al. A practical approach to the rating of barrier algorithms using the LogP model and Open MPI , 2005, 2005 International Conference on Parallel Processing Workshops (ICPPW'05).
[14] George Bosilca,et al. Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation , 2004, PVM/MPI.
[15] Sathish S. Vadhiyar,et al. Automatically Tuned Collective Communications , 2000, ACM/IEEE SC 2000 Conference (SC'00).
[16] Thomas Rauber,et al. A decomposition approach for optimizing the performance of MPI libraries , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.