Exploring the effect of block shapes on the performance of sparse kernels
暂无分享,去创建一个
Nectarios Koziris | Vasileios Karakasis | Georgios I. Goumas | N. Koziris | G. Goumas | V. Karakasis
[1] Katherine A. Yelick,et al. Optimizing Sparse Matrix Vector Multiplication on SMP , 1999, SIAM Conference on Parallel Processing for Scientific Computing.
[2] Richard Barrett,et al. Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods , 1994, Other Titles in Applied Mathematics.
[3] Michael T. Heath,et al. Improving Performance of Sparse Matrix-Vector Multiplication , 1999, ACM/IEEE SC 1999 Conference (SC'99).
[4] Olivier Temam,et al. Characterizing the behavior of sparse algorithms on caches , 1992, Proceedings Supercomputing '92.
[5] E. Im,et al. Optimizing Sparse Matrix Vector Multiplication on SMP , 1999, PPSC.
[6] Nectarios Koziris,et al. Understanding the Performance of Sparse Matrix-Vector Multiplication , 2008, 16th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP 2008).
[7] David E. Keyes,et al. Towards Realistic Performance Bounds for Implicit CFD Codes , 2000 .
[8] David Moloney,et al. Streaming Sparse Matrix Compression/Decompression , 2005, HiPEAC.
[9] P. Sadayappan,et al. On improving the performance of sparse matrix-vector multiplication , 1997, Proceedings Fourth International Conference on High-Performance Computing.
[10] Nectarios Koziris,et al. Improving the Performance of Multithreaded Sparse Matrix-Vector Multiplication Using Index and Value Compression , 2008, 2008 37th International Conference on Parallel Processing.
[11] Katherine A. Yelick,et al. Optimizing Sparse Matrix Computations for Register Reuse in SPARSITY , 2001, International Conference on Computational Science.
[12] Hyun Jin Moon,et al. Fast Sparse Matrix-Vector Multiplication by Exploiting Variable Block Structure , 2005, HPCC.
[13] Katherine Yelick,et al. OSKI: A library of automatically tuned sparse matrix kernels , 2005 .
[14] Samuel Williams,et al. Optimization of sparse matrix-vector multiplication on emerging multicore platforms , 2007, Proceedings of the 2007 ACM/IEEE Conference on Supercomputing (SC '07).
[15] Andrew Lumsdaine,et al. Accelerating sparse matrix computations via data compression , 2006, ICS '06.