Comparison of MPI benchmark programs on an SGI Altix ccNUMA shared memory machine
暂无分享,去创建一个
The results produced by five different MPI benchmark programs on an SGI Altix 3700 are analyzed and compared. There are significant differences in the results for some MPI operations. We investigate the reasons for these discrepancies, which are due to differences in the measurement techniques, implementation details and default configurations of the different benchmarks. The variation in results on the Altix are generally much greater than on a distributed memory machine, due primarily to the ccNUMA architecture and the importance of cache effects, as well as some implementation details of the SGI MPI libraries
[1] Werner Augustin,et al. On Benchmarking Collective MPI Operations , 2002, PVM/MPI.
[2] Ralf H. Reussner,et al. SKaMPI: A Detailed, Accurate MPI Benchmark , 1998, PVM/MPI.
[3] William Gropp,et al. Reproducible Measurements of MPI Performance Characteristics , 1999, PVM/MPI.
[4] Hermann Mierendorff,et al. Working with MPI Benchmarking Suites on ccNUMA Architectures , 2000, PVM/MPI.
[5] Duncan A. Grove,et al. Precise MPI Performance Measurement Using MPIBench , 2001 .