论文信息 - PGAS implementation of SpMVM and LBM using GPI

PGAS implementation of SpMVM and LBM using GPI

GPI is a PGAS model based library that targets to provide low-latency and highly efficient communication routines for large scale systems. We compare and analyse the performance of two algorithms, which are implemented with GPI and MPI. These algorithms are a sparse matrix-vector-multiplication (SpMVM) and a fluid flow solver based on a lattice Boltzmann method (LBM). Both algorithms are purely memory-bound on a single node, whereas at the large scale, the communication between the processes becomes more significant. GPI, in principle, is fully capable of performing communication alongside computation. Both the algorithms are modified to leverage this feature. In addition to the näıve approach with blocking calls in MPI, the algorithms are also evaluated using non-blocking calls and explicit asynchronous progress via an external library. We conclude that GPI implementations handle non-blocking asynchronous communication very effectively and thus hiding communication costs.

[1] P. Bhatnagar,et al. A Model for Collision Processes in Gases. I. Small Amplitude Processes in Charged and Neutral One-Component Systems , 1954 .

[2] Daniel Diaz,et al. Parallel Local Search: Experiments with a PGAS-based programming model , 2013, ArXiv.

[3] Gerhard Wellein,et al. On the single processor performance of simple lattice Boltzmann kernels , 2006 .

[4] Daniel Grünewald. BQCD with GPI: A case study , 2012, 2012 International Conference on High Performance Computing & Simulation (HPCS).

[5] Gerhard Wellein,et al. Asynchronous MPI for the Masses , 2013, ArXiv.

[6] Jens Jägersküpper,et al. A PGAS-based Implementation for the Unstructured CFD Solver TAU , 2011 .

[7] J. Boon. The Lattice Boltzmann Equation for Fluid Dynamics and Beyond , 2003 .

[8] Gerhard Wellein,et al. Leveraging Shared Caches for Parallel Temporal Blocking of Stencil Codes on Multicore Processors and Clusters , 2010, Parallel Process. Lett..

[9] Gerhard Wellein,et al. Prospects for truly asynchronous communication with pure MPI and hybrid MPI/OpenMP on current supercomputing platforms , 2011 .

[10] Y. Qian,et al. Lattice BGK Models for Navier-Stokes Equation , 1992 .