Transforming the multifluid PPM algorithm to run on GPUs
暂无分享,去创建一个
[1] Hee-Seok Kim,et al. Locality-centric thread scheduling for bulk-synchronous programming models on CPU architectures , 2015, 2015 IEEE/ACM International Symposium on Code Generation and Optimization (CGO).
[2] Paul R. Woodward,et al. A Study of the Performance of Multifluid PPM Gas Dynamics on CPUs and GPUs , 2011, 2011 Symposium on Application Accelerators in High-Performance Computing.
[3] Paul R. Woodward,et al. Simulating Rayleigh-Taylor (RT) instability using PPM hydrodynamics @scale on Roadrunner (u) , 2011 .
[4] Satoshi Matsuoka,et al. An 80-Fold Speedup, 15.0 TFlops Full GPU Acceleration of Non-Hydrostatic Weather Model ASUCA Production Code , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.
[5] Paul R. Woodward,et al. mPPM, Viewed as a Co-Design Effort , 2014, 2014 Hardware-Software Co-Design for High Performance Computing.
[6] Paul R. Woodward,et al. A Study of Performance Portability Using Piecewise-Parabolic Method (PPM) Gas Dynamics Applications , 2012, ICCS.
[7] Apan Qasem,et al. Automatic Restructuring of GPU Kernels for Exploiting Inter-thread Data Locality , 2012, CC.
[8] Tobias Gysi,et al. Towards a performance portable, architecture agnostic implementation strategy for weather and climate models , 2014, Supercomput. Front. Innov..
[9] Michael R. Knox,et al. GLOBAL NON-SPHERICAL OSCILLATIONS IN THREE-DIMENSIONAL 4π SIMULATIONS OF THE H-INGESTION FLASH , 2013, 1310.4584.
[10] Adrian Sandu,et al. Automatic Generation of Multicore Chemical Kernels , 2011, IEEE Transactions on Parallel and Distributed Systems.
[11] Naoyuki Onodera,et al. High-Productivity Framework on GPU-Rich Supercomputers for Operational Weather Prediction Code ASUCA , 2014, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis.
[12] Paul R. Woodward,et al. CFD Builder: A Library Builder for Computational Fluid Dynamics , 2014, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops.
[13] Bormin Huang,et al. Improved GPU/CUDA Based Parallel Weather and Research Forecast (WRF) Single Moment 5-Class (WSM5) Cloud Microphysics , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.
[14] Alexander Aiken,et al. Singe: leveraging warp specialization for high performance on GPUs , 2014, PPoPP '14.
[15] Paul R. Woodward,et al. Simulating Turbulent Mixing from Richtmyer-Meshkov and Rayleigh-Taylor Instabilities in Converging Geometries using Moving Cartesian Grids , 2013 .
[16] Liwen Chang,et al. Optimization and architecture effects on GPU computing workload performance , 2012, 2012 Innovative Parallel Computing (InPar).
[17] Rolf Krause,et al. A stencil-based implementation of Parareal in the C++ domain specific embedded language STELLA , 2014, Appl. Math. Comput..
[18] Manish Vachharajani,et al. GPU acceleration of numerical weather prediction , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.