Exploring Programming Multi-GPUs Using OpenMP and OpenACC-Based Hybrid Model
暂无分享,去创建一个
[1] Alejandro Duran,et al. The Design of OpenMP Tasks , 2009, IEEE Transactions on Parallel and Distributed Systems.
[2] Alan Gray,et al. Porting and scaling OpenACC applications on massively-parallel, GPU-accelerated supercomputers , 2012 .
[3] Francisco de Sande,et al. Optimization strategies in different CUDA architectures using llCoMP , 2012, Microprocess. Microsystems.
[4] Wen-mei W. Hwu,et al. CUDA-Lite: Reducing GPU Programming Complexity , 2008, LCPC.
[5] Kevin Skadron,et al. Performance modeling and automatic ghost zone optimization for iterative stencil loops on GPUs , 2009, ICS.
[6] Scott Klasky,et al. Terascale direct numerical simulations of turbulent combustion using S3D , 2008 .
[7] Christian Terboven,et al. OpenACC - First Experiences with Real-World Applications , 2012, Euro-Par.
[8] Wenguang Chen,et al. OpenUH: an optimizing, portable OpenMP compiler , 2007, Concurr. Comput. Pract. Exp..
[9] Rudolf Eigenmann,et al. OpenMPC: Extended OpenMP Programming and Tuning for GPUs , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.
[10] Barbara M. Chapman,et al. Experiences with High-Level Programming Directives for Porting Applications to GPUs , 2011, Facing the Multicore-Challenge.
[11] D. K. Arvind,et al. Languages and Compilers for Parallel Computing , 2014, Lecture Notes in Computer Science.
[12] Tarek S. Abdelrahman,et al. hiCUDA: a high-level directive-based language for GPU programming , 2009, GPGPU-2.
[13] Seyong Lee,et al. Early evaluation of directive-based GPU programming models for productive exascale computing , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.
[14] Francisco de Sande,et al. accULL: An OpenACC Implementation with CUDA and OpenCL Support , 2012, Euro-Par.