Benchmarking SpMV Methods on Many-Core Platforms
暂无分享,去创建一个
Zhen Jia | Yungang Bao | Biwei Xie | Yungang Bao | Zhen Jia | Biwei Xie
[1] Yun Liang,et al. Optimizing and auto-tuning scale-free sparse matrix-vector multiplication on Intel Xeon Phi , 2015, 2015 IEEE/ACM International Symposium on Code Generation and Optimization (CGO).
[2] Yen-Chen Liu,et al. Knights Landing: Second-Generation Intel Xeon Phi Product , 2016, IEEE Micro.
[3] Endong Wang,et al. Intel Math Kernel Library , 2014 .
[4] J. Ramanujam,et al. Distributed memory code generation for mixed Irregular/Regular computations , 2015, PPoPP.
[5] Yuqing Zhu,et al. BigDataBench: A big data benchmark suite from internet services , 2014, 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA).
[6] Walid A. Abu-Sufah,et al. Auto-tuning of Sparse Matrix-Vector Multiplication on Graphics Processors , 2013, ISC.
[7] Srinivasan Parthasarathy,et al. Automatic Selection of Sparse Matrix Representation on GPUs , 2015, ICS.
[8] P. Sadayappan,et al. An efficient two-dimensional blocking strategy for sparse matrix-vector multiplication on GPUs , 2014, ICS '14.
[9] Joseph L. Greathouse,et al. Efficient Sparse Matrix-Vector Multiplication on GPUs Using the CSR Storage Format , 2014, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis.
[10] Ninghui Sun,et al. SMAT: an input adaptive auto-tuner for sparse matrix-vector multiplication , 2013, PLDI.
[11] Chunjie Luo,et al. Characterizing data analysis workloads in data centers , 2013, 2013 IEEE International Symposium on Workload Characterization (IISWC).
[12] Xing Liu,et al. Efficient sparse matrix-vector multiplication on x86-based many-core processors , 2013, ICS '13.
[13] Michael Garland,et al. Merge-Based Parallel Sparse Matrix-Vector Multiplication , 2016, SC16: International Conference for High Performance Computing, Networking, Storage and Analysis.
[14] Xin Liu,et al. Towards Efficient SpMV on Sunway Manycore Architectures , 2018, ICS.
[15] Timothy A. Davis,et al. The university of Florida sparse matrix collection , 2011, TOMS.
[16] Shengen Yan,et al. yaSpMV: yet another SpMV framework on GPUs , 2014, PPoPP.
[17] ZhaoYue,et al. Bridging the gap between deep learning and sparse matrix format selection , 2018 .
[18] Michael Garland,et al. Implementing sparse matrix-vector multiplication on throughput-oriented processors , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.
[19] Mary W. Hall,et al. Loop and data transformations for sparse matrix code , 2015, PLDI.
[20] Nectarios Koziris,et al. A lightweight optimization selection method for Sparse Matrix-Vector Multiplication , 2015, ArXiv.
[21] A. Pinar,et al. Improving Performance of Sparse Matrix-Vector Multiplication , 1999, ACM/IEEE SC 1999 Conference (SC'99).
[22] Yue Zhao,et al. Bridging the gap between deep learning and sparse matrix format selection , 2018, PPoPP.
[23] John J. Cannon,et al. The Magma Algebra System I: The User Language , 1997, J. Symb. Comput..
[24] Zhen Jia,et al. CVR: efficient vectorization of SpMV on x86 processors , 2018, CGO.
[25] Brian Vinter,et al. CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector Multiplication , 2015, ICS.
[26] Fabio Checconi,et al. Optimizing Sparse Matrix-Vector Multiplication for Large-Scale Data Analytics , 2016, ICS.
[27] Ran Ginosar,et al. Accelerator for Sparse Machine Learning , 2018, IEEE Computer Architecture Letters.