Communication characteristics of large-scale scientific applications for contemporary cluster architectures

[1]  A.M. Wissink,et al.  Large Scale Parallel Structured AMR Calculations Using the SAMRAI Framework , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[2]  José E. Moreira,et al.  Demonstrating the scalability of a molecular dynamics application on a Petaflop computer , 2001, ICS '01.

[3]  John M. May,et al.  MPX: Software for multiplexing hardware performance counters in multithreaded programs , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[4]  William Gropp,et al.  Performance Modeling and Tuning of an Unstructured Mesh CFD Application , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[5]  Franck Cappello,et al.  MPI versus MPI+OpenMP on the IBM SP for the NAS Benchmarks , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[6]  Fabrizio Petrini,et al.  A general predictive performance model for wavefront algorithms on clusters of SMPs , 2000, Proceedings 2000 International Conference on Parallel Processing.

[7]  Patrick H. Worley,et al.  Performance evaluation of the IBM SP and the Compaq AlphaServer SC , 2000, ICS '00.

[8]  Richard E. Kessler,et al.  Performance analysis of the Alpha 21264-based Compaq ES40 system , 2000, Proceedings of 27th International Symposium on Computer Architecture (IEEE Cat. No.RS00201).

[9]  Leonid Oliker,et al.  A Comparison of Three Programming Models for Adaptive Applications on the Origin2000 , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[10]  Robert D. Falgout,et al.  Semicoarsening Multigrid on Distributed Memory Machines , 1999, SIAM J. Sci. Comput..

[11]  Anthony Skjellum,et al.  Using MPI: portable parallel programming with the message-passing interface, 2nd Edition , 1999, Scientific and engineering computation series.

[12]  B. C. Curtis,et al.  Very High Resolution Simulation of Compressible Turbulence on the IBM-SP System , 1999, ACM/IEEE SC 1999 Conference (SC'99).

[13]  Remzi H. Arpaci-Dusseau,et al.  Architectural Requirements and Scalability of the NAS Parallel Benchmarks , 1999, ACM/IEEE SC 1999 Conference (SC'99).

[14]  Anoop Gupta,et al.  Parallel computer architecture - a hardware / software approach , 1998 .

[15]  Allen D. Malony,et al.  Portable profiling and tracing for parallel, scientific applications using C++ , 1998, SPDT '98.

[16]  B. Bershad,et al.  Execution characteristics of desktop applications on Windows NT , 1998, Proceedings. 25th Annual International Symposium on Computer Architecture (Cat. No.98CB36235).

[17]  Jesús Labarta,et al.  DiP: A Parallel Program Development Environment , 1996, Euro-Par, Vol. II.

[18]  Jack Dongarra,et al.  MPI: The Complete Reference , 1996 .

[19]  K. M. Decker,et al.  HPF and MPI Implementation of the NAS Parallel Benchmarks Supported by Integrated Program Engineering Tools , 1996 .

[20]  J. Singh,et al.  The SPLASH-2 programs: characterization and methodological considerations , 1995, Proceedings 22nd Annual International Symposium on Computer Architecture.

[21]  George Karypis,et al.  Introduction to Parallel Computing , 1994 .

[22]  Anthony Skjellum,et al.  Using MPI - portable parallel programming with the message-parsing interface , 1994 .

[23]  Paul Messina,et al.  Architectural Requirements Of Parallel Scientific Applications With Explicit Communication , 1993, Proceedings of the 20th Annual International Symposium on Computer Architecture.

[24]  David H. Bailey,et al.  NAS parallel benchmark results , 1993, IEEE Parallel & Distributed Technology: Systems & Applications.

[25]  M.,et al.  An Overview of the Pablo Performance Analysis , 1992 .

[26]  Robert A. van de Geijn,et al.  LAPACK for Distributed Memory Architectures: Progress Report , 1991, SIAM Conference on Parallel Processing for Scientific Computing.

[27]  G. A. Geist,et al.  A user's guide to PICL a portable instrumented communication library , 1990 .