CUDA-enabled Sparse Matrix-Vector Multiplication on GPUs using atomic operations
暂无分享,去创建一个
[1] Endong Wang,et al. Intel Math Kernel Library , 2014 .
[2] Guillaume Caumon,et al. Concurrent number cruncher: a GPU implementation of a general sparse linear solver , 2009, Int. J. Parallel Emergent Distributed Syst..
[3] Ester M. Garzón,et al. Improving the Performance of the Sparse Matrix Vector Product with GPUs , 2010, 2010 10th IEEE International Conference on Computer and Information Technology.
[4] Kiran Kumar Matam,et al. Accelerating Sparse Matrix Vector Multiplication in Iterative Methods Using GPU , 2011, 2011 International Conference on Parallel Processing.
[5] Kevin Skadron,et al. Scalable parallel programming , 2008, 2008 IEEE Hot Chips 20 Symposium (HCS).
[6] Michael Garland,et al. Implementing sparse matrix-vector multiplication on throughput-oriented processors , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.
[7] Richard W. Vuduc,et al. Model-driven autotuning of sparse matrix-vector multiply on GPUs , 2010, PPoPP '10.
[8] Yousef Saad,et al. Iterative methods for sparse linear systems , 2003 .
[9] Andrew S. Grimshaw,et al. Scalable GPU graph traversal , 2012, PPoPP '12.
[10] Arutyun Avetisyan,et al. Automatically Tuning Sparse Matrix-Vector Multiplication for GPU Architectures , 2010, HiPEAC.
[11] Gerhard Wellein,et al. Sparse Matrix-vector Multiplication on GPGPU Clusters: A New Storage Format and a Scalable Implementation , 2011, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.
[12] Rajesh Bordawekar,et al. Optimizing Sparse Matrix-Vector Multiplication on GPUs , 2009 .
[13] John G. Lewis,et al. Sparse matrix test problems , 1982, SGNM.
[14] Bertil Schmidt,et al. The Sliced COO Format for Sparse Matrix-Vector Multiplication on CUDA-enabled GPUs , 2012, ICCS.
[15] Bertil Schmidt,et al. Iterative Sparse Matrix-Vector Multiplication for Integer Factorization on GPUs , 2011, Euro-Par.