This paper presents a multi Graphic Processing Unit (GPU) implementation of a 2D shallow water equations solver which is able to exploit the computational power of modern HPC clusters equipped with several GPUs on different nodes. The domain has been discretized by means of a Block Uniform Quadtree (BUQ) grid which allows to efficiently introduce variable resolution in a GPU-accelerated finite value code. In the present work the BUQ grid is decomposed into different partitions, and each partition is assigned to a dedicated GPU. Communications between different partitions are then handled by means of a Message Passing Interface (MPI) protocol. Computations and communications have been overlapped to reduce the overheads of the multi-GPU implementation. The strong scalability test shows an efficiency dropdown better than linear in the number of GPUs adopted by the simulation, and the weak scalability test shows that network overheads caused by border communication are completely maskable by GPU calculations.
[1]
Alessandro Dal Palù,et al.
GPU-enhanced Finite Volume Shallow Water solver for fast flood simulations
,
2014,
Environ. Model. Softw..
[3]
Alessandro Dal Palù,et al.
A non-uniform efficient grid type for GPU-parallel Shallow Water Equations models
,
2017,
Environ. Model. Softw..
[4]
P ? ? ? ? ? ? ? % ? ? ? ?
,
1991
.
[5]
Mustafa S. Altinakar,et al.
Efficient shallow water simulations on GPUs: Implementation, visualization, verification, and validation
,
2012
.
[6]
E. Toro.
Shock-Capturing Methods for Free-Surface Shallow Flows
,
2001
.
[7]
Marc de la Asunción,et al.
Efficient GPU implementation of a two waves TVD-WAF method for the two-dimensional one layer shallow water system on structured meshes
,
2013
.
[8]
Tsuyoshi Murata,et al.
{m
,
1934,
ACML.