Cache Oblivious Dense and Sparse Matrix Multiplication Based on Peano Curves
暂无分享,去创建一个
[1] Fred G. Gustavson,et al. Recursion leads to automatic variable blocking for dense linear-algebra algorithms , 1997, IBM J. Res. Dev..
[2] Jack J. Dongarra,et al. Automated empirical optimizations of software and the ATLAS project , 2001, Parallel Comput..
[3] Richard W. Vuduc,et al. Sparsity: Optimization Framework for Sparse Matrix Kernels , 2004, Int. J. High Perform. Comput. Appl..
[4] Erik Elmroth,et al. SIAM REVIEW c ○ 2004 Society for Industrial and Applied Mathematics Vol. 46, No. 1, pp. 3–45 Recursive Blocked Algorithms and Hybrid Data Structures for Dense Matrix Library Software ∗ , 2022 .
[5] Juan J. Navarro,et al. Adapting Linear Algebra Codes to the Memory Hierarchy Using a Hypermatrix Scheme , 2005, PPAM.
[6] Timo O. Reiss,et al. Optimal control of coupled spin dynamics: design of NMR pulse sequences by gradient ascent algorithms. , 2005, Journal of magnetic resonance.
[7] Michael Bader,et al. Cache oblivious matrix multiplication using an element ordering based on the Peano curve , 2006 .
[8] Michael Bader,et al. Hardware-Oriented Implementation of Cache Oblivious Matrix Operations Based on Space-Filling Curves , 2007, PPAM.
[9] Robert A. van de Geijn,et al. Anatomy of high-performance matrix multiplication , 2008, TOMS.
[10] Alexander Heinecke,et al. Parallel matrix multiplication based on space-filling curves on shared memory multicore platforms , 2008, MAW '08.