论文信息 - Enabling the UCD-SPH code on the Xeon Phi

Enabling the UCD-SPH code on the Xeon Phi

This white-paper reports on our efforts to enable an SPH-based Fortran code on the Intel Xeon Phi. As a result of the work described here , the two most computationally intensive subroutines (rates and shepard_beta) of the UCD-SPH code were refactored and parallelised with OpenMP for the first time, enabling the code to be executed on multi-core and many-core shared memory systems. This parallelisation achieved speedups of up to 4.3x for the rates subroutine and 6.0x for the shepard_beta subroutine resulting in overall speedups of up to 4.2x on a 2 processor Sandy Bridge Xeon E5 machine. The code was subsequently enabled and refactored to execute in different modes on the Intel Xeon Phi co-processor achieving speedups of up to 2.8x for the rates subroutine and up to 3.8x for the shepard_beta subroutine producing overall speedups of up to 2.7x compared to the original unoptimised code. To explore the capabilities of auto-vectorisation the shepard_beta subroutine was refactored which results in speedups of up to 6.4x for the shepard_beta subroutine relative to the original unoptimised version of the shepard_beta subroutine. The development and testing phases of the project were carried out on the PRACE EURORA machine.

Denys Dutykh | Frédéric Dias | Christian Lalanne | Michael Lysaght | Ashkan Rafiee

[1] Frédéric Dias,et al. Numerical Simulations of 2D Liquid Impact Benchmark Problem Using Two–Phase Compressible and Incompressible Methods , 2013 .

[2] E. Jacquin,et al. Parallel hybrid CPU/GPU acceleration of the 3-D parallel code SPH-flow , 2010 .

[3] Jean-Paul Vila,et al. SPH Renormalized Hybrid Methods for Conservation Laws: Applications to Free Surface Flows , 2005 .

[4] Ravi Narayanaswamy,et al. Offload Compiler Runtime for the Intel® Xeon Phi Coprocessor , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.

[5] Murray Rudman,et al. Comparative study on the accuracy and stability of SPH schemes in simulating energetic free-surface flows , 2012 .

[6] Denys Dutykh,et al. Numerical simulation of wave impact on a rigid wall using a two-phase compressible SPH method , 2013, ArXiv.

[7] Jean-Paul Vila,et al. ON PARTICLE WEIGHTED METHODS AND SMOOTH PARTICLE HYDRODYNAMICS , 1999 .

[8] Frédéric Dias,et al. Numerical Simulation of Wave Interaction With an Oscillating Wave Surge Converter , 2013 .

[9] Matthias Teschner,et al. A Parallel SPH Implementation on Multi‐Core CPUs , 2011, Comput. Graph. Forum.