Summary form only given. We describe a new suite of computational benchmarks that models applications featuring multiple levels of parallelism. Such parallelism is often available in realistic flow computations on systems of meshes, but had not previously been captured in benchmarks. The new suite, named NPB (NAS parallel benchmarks) multizone, is extended from the NPB suite, and involves solving the application benchmarks LU, BT and SP on collections of loosely coupled discretization meshes. The solutions on the meshes are updated independently, but after each time step they exchange boundary value information. This strategy provides relatively easily exploitable coarse-grain parallelism between meshes. Three reference implementations are available: one serial, one hybrid using the message passing interface (MPI) and OpenMP, and another hybrid using a shared memory multilevel programming model (SMP+OpenMP). We examine the effectiveness of hybrid parallelization paradigms in these implementations on three different parallel computers. We also use an empirical formula to investigate the performance characteristics of the hybrid parallel codes.
[1]
James R. Taft,et al.
Achieving 60 GFLOP/s on the production CFD code OVERFLOW-MLP
,
2001,
Parallel Comput..
[2]
Rupak Biswas,et al.
An analysis of performance enhancement techniques for overset grid applications
,
2003,
Proceedings International Parallel and Distributed Processing Symposium.
[3]
R. V. D. Wijngaart.
NAS Parallel Benchmarks Version 2.4
,
2022
.
[4]
Haoqiang Jin,et al.
Performance Evaluation of Remote Memory Access (RMA) Programming on Shared Memory Parallel Computers
,
2002
.
[5]
Rupak Biswas,et al.
Load Balancing Strategies for Multi-Block Overset Grid Applications
,
2003,
Computers and Their Applications.
[6]
R. Vanderwijngaart,et al.
NAS Parallel Benchmarks, Multi-Zone Versions
,
2003
.