Transforming the multifluid PPM algorithm to run on GPUs

[1]  Hee-Seok Kim,et al.  Locality-centric thread scheduling for bulk-synchronous programming models on CPU architectures , 2015, 2015 IEEE/ACM International Symposium on Code Generation and Optimization (CGO).

[2]  Paul R. Woodward,et al.  A Study of the Performance of Multifluid PPM Gas Dynamics on CPUs and GPUs , 2011, 2011 Symposium on Application Accelerators in High-Performance Computing.

[3]  Paul R. Woodward,et al.  Simulating Rayleigh-Taylor (RT) instability using PPM hydrodynamics @scale on Roadrunner (u) , 2011 .

[4]  Satoshi Matsuoka,et al.  An 80-Fold Speedup, 15.0 TFlops Full GPU Acceleration of Non-Hydrostatic Weather Model ASUCA Production Code , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.

[5]  Paul R. Woodward,et al.  mPPM, Viewed as a Co-Design Effort , 2014, 2014 Hardware-Software Co-Design for High Performance Computing.

[6]  Paul R. Woodward,et al.  A Study of Performance Portability Using Piecewise-Parabolic Method (PPM) Gas Dynamics Applications , 2012, ICCS.

[7]  Apan Qasem,et al.  Automatic Restructuring of GPU Kernels for Exploiting Inter-thread Data Locality , 2012, CC.

[8]  Tobias Gysi,et al.  Towards a performance portable, architecture agnostic implementation strategy for weather and climate models , 2014, Supercomput. Front. Innov..

[9]  Michael R. Knox,et al.  GLOBAL NON-SPHERICAL OSCILLATIONS IN THREE-DIMENSIONAL 4π SIMULATIONS OF THE H-INGESTION FLASH , 2013, 1310.4584.

[10]  Adrian Sandu,et al.  Automatic Generation of Multicore Chemical Kernels , 2011, IEEE Transactions on Parallel and Distributed Systems.

[11]  Naoyuki Onodera,et al.  High-Productivity Framework on GPU-Rich Supercomputers for Operational Weather Prediction Code ASUCA , 2014, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis.

[12]  Paul R. Woodward,et al.  CFD Builder: A Library Builder for Computational Fluid Dynamics , 2014, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops.

[13]  Bormin Huang,et al.  Improved GPU/CUDA Based Parallel Weather and Research Forecast (WRF) Single Moment 5-Class (WSM5) Cloud Microphysics , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[14]  Alexander Aiken,et al.  Singe: leveraging warp specialization for high performance on GPUs , 2014, PPoPP '14.

[15]  Paul R. Woodward,et al.  Simulating Turbulent Mixing from Richtmyer-Meshkov and Rayleigh-Taylor Instabilities in Converging Geometries using Moving Cartesian Grids , 2013 .

[16]  Liwen Chang,et al.  Optimization and architecture effects on GPU computing workload performance , 2012, 2012 Innovative Parallel Computing (InPar).

[17]  Rolf Krause,et al.  A stencil-based implementation of Parareal in the C++ domain specific embedded language STELLA , 2014, Appl. Math. Comput..

[18]  Manish Vachharajani,et al.  GPU acceleration of numerical weather prediction , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.