Auto-tuning Dense Matrix Multiplication for GPGPU with Cache
暂无分享,去创建一个
Yifeng Chen | Changyou Zhang | Hong Mei | Xiang Cui | Hong Mei | Changyou Zhang | Yifeng Chen | Xiang Cui
[1] Yifeng Chen,et al. Improving Performance of Matrix Multiplication and FFT on GPU , 2009, 2009 15th International Conference on Parallel and Distributed Systems.
[2] James Demmel,et al. Benchmarking GPUs to tune dense linear algebra , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.
[3] Jack J. Dongarra,et al. A Note on Auto-tuning GEMM for GPUs , 2009, ICCS.