OpenMP as a High-Level Specification Language for Parallelism - And its use in Evaluating Parallel Programming Systems
暂无分享,去创建一个
[1] Vivek Sarkar,et al. Habanero-Java: the new adventures of old X10 , 2011, PPPJ.
[2] Nathan R. Tallent,et al. HPCTOOLKIT: tools for performance analysis of optimized parallel programs , 2010, Concurr. Comput. Pract. Exp..
[3] Richard D. Hornung,et al. The RAJA Portability Layer: Overview and Status , 2014 .
[4] Vivek Sarkar,et al. X10: an object-oriented approach to non-uniform cluster computing , 2005, OOPSLA '05.
[5] Robert Dietrich,et al. OMPT: An OpenMP Tools Application Programming Interface for Performance Analysis , 2013, IWOMP.
[6] Vivek Sarkar,et al. Integrating Asynchronous Task Parallelism with MPI , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.
[7] Rudolf Eigenmann,et al. OpenMP to GPGPU: a compiler framework for automatic translation and optimization , 2009, PPoPP '09.
[8] Tamara G. Kolda,et al. An overview of the Trilinos project , 2005, TOMS.
[9] Bradley C. Kuszmaul,et al. Cilk: an efficient multithreaded runtime system , 1995, PPOPP '95.
[10] James Reinders,et al. Intel threading building blocks - outfitting C++ for multi-core processor parallelism , 2007 .
[11] J. Ramanujam,et al. Automatic C-to-CUDA Code Generation for Affine Programs , 2010, CC.
[12] Hiroki Honda,et al. OMPCUDA : OpenMP Execution Framework for CUDA Based on Omni OpenMP Compiler , 2010, IWOMP.
[13] Daniel Sunderland,et al. Kokkos: Enabling manycore performance portability through polymorphic memory access patterns , 2014, J. Parallel Distributed Comput..
[14] Anne Marsden,et al. International Organization for Standardization , 2014 .