Performance analysis of OpenMP on a GPU using a CORAL proxy application
暂无分享,去创建一个
Kevin O'Brien | Zehra Sura | Tong Chen | Carlo Bertolli | Alexandre E. Eichenberger | Arpith C. Jacob | Gheorghe-Teodor Bercea | Georgios Rokos | Hyojin Sung | Samuel Antão | David Appelhans | S. Antão | A. Eichenberger | Hyojin Sung | Gheorghe-Teodor Bercea | A. Jacob | Zehra Sura | Tong Chen | C. Bertolli | K. O'Brien | G. Rokos | D. Appelhans
[1] Martin Schulz,et al. Exploring Traditional and Emerging Parallel Programming Models Using a Proxy Application , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.
[2] Kevin O'Brien,et al. Integrating GPU support for OpenMP offloading directives into Clang , 2015, LLVM '15.
[3] Yi Yang,et al. CUDA-NP: Realizing Nested Thread-Level Parallelism in GPGPU Applications , 2015, Journal of Computer Science and Technology.
[4] Kevin O'Brien,et al. Coordinating GPU Threads for OpenMP 4.0 in LLVM , 2014, 2014 LLVM Compiler Infrastructure in HPC.
[5] Ian Karlin,et al. LULESH Programming Model and Performance Ports Overview , 2012 .
[6] Eduard Ayguadé,et al. On the Roles of the Programmer, the Compiler and the Runtime System When Programming Accelerators in OpenMP , 2014, IWOMP.