Domain Decomposition Preconditioners for Communication-Avoiding Krylov Methods on a Hybrid CPU/GPU Cluster
暂无分享,去创建一个
Sivasankaran Rajamanickam | Stanimire Tomov | Ichitaro Yamazaki | Erik G. Boman | Michael A. Heroux | Mark Hoemmen
[1] Y. Saad,et al. Practical Use of Polynomial Preconditionings for the Conjugate Gradient Method , 1985 .
[2] John Van Rosendale. Minimizing Inner Product Data Dependencies in Conjugate Gradient Iteration , 1983, ICPP.
[3] Jack J. Dongarra,et al. Improving the Performance of CA-GMRES on Multicores with Multiple GPUs , 2014, 2014 IEEE 28th International Parallel and Distributed Processing Symposium.
[4] Laura Grigori,et al. Communication Avoiding ILU0 Preconditioner , 2015, SIAM J. Sci. Comput..
[5] E. Cuthill,et al. Reducing the bandwidth of sparse symmetric matrices , 1969, ACM '69.
[6] Jack Dongarra,et al. Mixed-precision orthogonalization scheme and adaptive step size for CA-GMRES on GPUs , 2014 .
[7] Anthony T. Chronopoulos,et al. On the efficient implementation of preconditioned s-step conjugate gradient methods on multiprocessors with memory hierarchy , 1989, Parallel Comput..
[8] J. Demmel,et al. Avoiding Communication in Computing Krylov Subspaces , 2007 .
[9] Sivan Toledo,et al. Quantitative performance modeling of scientific computations and creating locality in numerical algorithms , 1995 .
[10] Kesheng Wu,et al. A Block Orthogonalization Procedure with Constant Synchronization Requirements , 2000, SIAM J. Sci. Comput..
[11] N. Abdelmalek. Round off error analysis for Gram-Schmidt method and solution of linear least squares problems , 1971 .
[12] Kesheng Wu,et al. A Communication-Avoiding Thick-Restart Lanczos Method on a Distributed-Memory System , 2011, Euro-Par Workshops.
[13] Y. Saad,et al. GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems , 1986 .
[14] James Demmel,et al. Minimizing communication in sparse matrix solvers , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.
[15] D. Hut. A Newton Basis Gmres Implementation , 1991 .
[16] Santa Clara,et al. Parallel Solution of Sparse Triangular Linear Systems in the Preconditioned Iterative Methods on the GPU , 2011 .
[17] C. W. Gear,et al. Implementation of preconditioned s-step Conjugate Gradient methods on a multiprocessor system with memory hierarchy , 1987 .
[18] Mark Hoemmen,et al. Communication-avoiding Krylov subspace methods , 2010 .
[19] W. Marsden. I and J , 2012 .
[20] James Demmel,et al. Communication lower bounds and optimal algorithms for numerical linear algebra*† , 2014, Acta Numerica.
[21] Xiao-Chuan Cai,et al. A Restricted Additive Schwarz Preconditioner for General Sparse Linear Systems , 1999, SIAM J. Sci. Comput..