A Practical Performance Model for Compute and Memory Bound GPU Kernels
暂无分享,去创建一个
[1] K. Srinathan,et al. A performance prediction model for the CUDA GPGPU platform , 2009, 2009 International Conference on High Performance Computing (HiPC).
[2] James Demmel,et al. Benchmarking GPUs to tune dense linear algebra , 2008, HiPC 2008.
[3] Yannis Cotronis,et al. Graphics processing unit acceleration of the red/black SOR method , 2013, Concurr. Comput. Pract. Exp..
[4] Samuel Williams,et al. Roofline: an insightful visual performance model for multicore architectures , 2009, CACM.
[5] Lin Ma,et al. A Memory Access Model for Highly-threaded Many-core Architectures , 2012, 2012 IEEE 18th International Conference on Parallel and Distributed Systems.
[6] Aaftab Munshi,et al. The OpenCL specification , 2009, 2009 IEEE Hot Chips 21 Symposium (HCS).
[7] William Gropp,et al. An adaptive performance modeling tool for GPU architectures , 2010, PPoPP '10.
[8] Yannis Cotronis,et al. A comparison of CPU and GPU implementations for solving the Convection Diffusion equation using the local Modified SOR method , 2014, Parallel Comput..
[9] Yiannis Cotronis,et al. Accelerating the Red/Black SOR Method Using GPUs with CUDA , 2011, PPAM.
[10] Yannis Cotronis,et al. A GPU Implementation for Solving the Convection Diffusion Equation Using the Local Modified SOR Method , 2014, Numerical Computations with GPUs.
[11] Hyesoon Kim,et al. An analytical model for a GPU architecture with memory-level and thread-level parallelism awareness , 2009, ISCA '09.
[12] Jiayuan Meng,et al. Improving GPU Performance Prediction with Data Transfer Modeling , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.
[13] Tao Li,et al. Exploring GPGPU workloads: Characterization methodology, analysis and microarchitecture evaluation implications , 2010, IEEE International Symposium on Workload Characterization (IISWC'10).
[14] Yao Zhang,et al. A quantitative performance analysis model for GPU architectures , 2011, 2011 IEEE 17th International Symposium on High Performance Computer Architecture.
[15] Richard W. Vuduc,et al. A performance analysis framework for identifying potential benefits in GPGPU applications , 2012, PPoPP '12.
[16] Ali Karami,et al. A statistical performance prediction model for OpenCL kernels on NVIDIA GPUs , 2013, The 17th CSI International Symposium on Computer Architecture & Digital Systems (CADS 2013).
[17] Weiguo Liu,et al. Performance Predictions for General-Purpose Computation on GPUs , 2007, 2007 International Conference on Parallel Processing (ICPP 2007).