accULL: An OpenACC Implementation with CUDA and OpenCL Support
暂无分享,去创建一个
Francisco de Sande | Ruymán Reyes | Iván López-Rodríguez | Juan J. Fumero | Ruymán Reyes | J. Fumero | F. D. Sande | I. López-Rodríguez
[1] Jesper Larsson Träff,et al. Euro-Par 2010 Parallel Processing Workshops - HeteroPar, HPCC, HiBB, CoreGrid, UCHPC, HPCF, PROPER, CCPI, VHPC, Ischia, Italy, August 31-September 3, 2010, Revised Selected Papers , 2011, Euro-Par Workshops.
[2] Kevin Skadron,et al. A characterization of the Rodinia benchmark suite with comparison to contemporary CMP workloads , 2010, IEEE International Symposium on Workload Characterization (IISWC'10).
[3] Francisco de Sande,et al. Optimization strategies in different CUDA architectures using llCoMP , 2012, Microprocess. Microsystems.
[4] François Bodin,et al. Heterogeneous multicore parallel programming for graphics processing units , 2009 .
[5] Michael Wolfe,et al. Implementing the PGI Accelerator model , 2010, GPGPU-3.
[6] François Bodin,et al. Heterogeneous multicore parallel programming for graphics processing units , 2009, Sci. Program..
[7] Bernd Mohr,et al. Guided Performance Analysis Combining Profile and Trace Tools , 2010, Euro-Par Workshops.