A Compiler-Blockable Algorithm for QR Decomposition
暂无分享,去创建一个
[1] Ken Kennedy,et al. Parallel Programming Support in ParaScope , 1988, Parallel Computing in Science and Engineering.
[2] Ken Kennedy,et al. Analysis of interprocedural side effects in a parallel programming environment , 1988, J. Parallel Distributed Comput..
[3] Jack J. Dongarra,et al. Solving linear systems on vector and shared memory computers , 1990 .
[4] Steven Mark Carr,et al. Memory-hierarchy management , 1993 .
[5] Michael Wolfe,et al. Iteration Space Tiling for Memory Hierarchies , 1987, PPSC.
[6] Monica S. Lam,et al. The cache performance and optimizations of blocked algorithms , 1991, ASPLOS IV.
[7] Ken Kennedy,et al. Software methods for improvement of cache performance on supercomputer applications , 1989 .
[8] Ken Kennedy,et al. An Implementation of Interprocedural Bounded Regular Section Analysis , 1991, IEEE Trans. Parallel Distributed Syst..
[9] John B. Shoven,et al. I , Edinburgh Medical and Surgical Journal.
[10] Richard B. Lehoucq,et al. Implementing Efficient and Portable Dense Matrix Factorizations , 1991, SIAM Conference on Parallel Processing for Scientific Computing.
[11] Ken Kennedy,et al. Blocking Linear Algebra Codes for Memory Hierarchies , 1989, PPSC.