Analysis of dynamically scheduled tile algorithms for dense linear algebra on multicore architectures
暂无分享,去创建一个
Jack J. Dongarra | Hatem Ltaief | Azzam Haidar | Asim YarKhan | J. Dongarra | A. Haidar | H. Ltaief | A. YarKhan
[1] Charles L. Lawson,et al. Basic Linear Algebra Subprograms for Fortran Usage , 1979, TOMS.
[2] C. Loan,et al. A Storage-Efficient $WY$ Representation for Products of Householder Transformations , 1989 .
[3] Robert A. van de Geijn,et al. The libflame Library for Dense Matrix Computations , 2009, Computing in Science & Engineering.
[4] Julien Langou,et al. Parallel tiled QR factorization for multicore architectures , 2007, Concurr. Comput. Pract. Exp..
[5] Jesús Labarta,et al. A dependency-aware task-based programming environment for multi-core architectures , 2008, 2008 IEEE International Conference on Cluster Computing.
[6] Jack Dongarra,et al. QR Factorization for the CELL Processor , 2008 .
[7] Robert A. van de Geijn,et al. Supermatrix out-of-order scheduling of matrix operations for SMP and multi-core architectures , 2007, SPAA '07.
[8] Robert A. van de Geijn,et al. Parallel out-of-core computation and updating of the QR factorization , 2005, TOMS.
[9] Christian H. Bischof,et al. The WY representation for products of householder matrices , 1985, PPSC.
[10] Jack J. Dongarra,et al. Solving Systems of Linear Equations on the CELL Processor Using Cholesky Factorization , 2008, IEEE Transactions on Parallel and Distributed Systems.
[11] Jack Dongarra,et al. QUARK Users' Guide: QUeueing And Runtime for Kernels , 2011 .
[12] Bradley C. Kuszmaul,et al. Cilk: an efficient multithreaded runtime system , 1995, PPOPP '95.
[13] Emmanuel Agullo,et al. Comparative study of one-sided factorizations with multiple software packages on multi-core hardware , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.
[14] DongarraJack,et al. Solving Systems of Linear Equations on the CELL Processor Using Cholesky Factorization , 2008 .
[15] Jack Dongarra,et al. Fully Dynamic Scheduler for Numerical Computing on Multicore Processors , 2009 .
[16] Jack Dongarra,et al. QR factorization for the Cell Broadband Engine , 2009, HiPC 2009.
[17] Jack J. Dongarra,et al. A set of level 3 basic linear algebra subprograms , 1990, TOMS.
[18] Julien Langou,et al. A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures , 2007, Parallel Comput..
[19] DongarraJack,et al. Parallel tiled QR factorization for multicore architectures , 2008 .
[20] Robert A. van de Geijn,et al. Updating an LU Factorization with Pivoting , 2008, TOMS.
[21] John B. Shoven,et al. I , Edinburgh Medical and Surgical Journal.
[22] Jack Dongarra,et al. Numerical Linear Algebra for High-Performance Computers , 1998 .
[23] James Demmel,et al. LAPACK Users' Guide, Third Edition , 1999, Software, Environments and Tools.
[24] James Demmel,et al. Applied Numerical Linear Algebra , 1997 .