Iteration-fusing conjugate gradient
暂无分享,去创建一个
[1] L. Reichel,et al. A Newton basis GMRES implementation , 1994 .
[2] Wim Vanroose,et al. Hiding global synchronization latency in the preconditioned Conjugate Gradient algorithm , 2014, Parallel Comput..
[3] Victor Eijkhout,et al. LAPACK Working Note 56: Reducing Communication Costs in the Conjugate Gradient Algorithm on Distributed Memory Multiprocessors , 1993 .
[4] Francisco J. Cazorla,et al. A Quantitative Analysis of OS Noise , 2011, 2011 IEEE International Parallel & Distributed Processing Symposium.
[5] Gérard Meurant. Multitasking the conjugate gradient method on the CRAY X-MP/48 , 1987, Parallel Comput..
[6] Wim Vanroose,et al. Hiding Global Communication Latency in the GMRES Algorithm on Massively Parallel Machines , 2013, SIAM J. Sci. Comput..
[7] Laurence T. Yang,et al. The improved BiCG method for large and sparse linear systems on parallel distributed memory architectures , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.
[8] Anthony T. Chronopoulos,et al. s-step iterative methods for symmetric linear systems , 1989 .
[9] Qiang Ye,et al. Residual Replacement Strategies for Krylov Subspace Iterative Methods for the Convergence of True Residuals , 2000, SIAM J. Sci. Comput..
[10] Kai Li,et al. The PARSEC benchmark suite: Characterization and architectural implications , 2008, 2008 International Conference on Parallel Architectures and Compilation Techniques (PACT).
[11] Richard Barrett,et al. Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods , 1994, Other Titles in Applied Mathematics.
[12] Torsten Hoefler,et al. Characterizing the Influence of System Noise on Large-Scale Applications by Simulation , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.
[13] John Van Rosendale. Minimizing Inner Product Data Dependencies in Conjugate Gradient Iteration , 1983, ICPP.
[14] Ron Brightwell,et al. Characterizing application sensitivity to OS interference using kernel-level noise injection , 2008, HiPC 2008.
[15] L.T. Yang,et al. The improved BiCGStab method for large and sparse unsymmetric linear systems on parallel distributed memory architectures , 2002, Fifth International Conference on Algorithms and Architectures for Parallel Processing, 2002. Proceedings..
[16] E. F. DAzevedo,et al. Reducing communication costs in the conjugate gradient algorithm on distributed memory multiprocessors , 1992 .
[17] Eduard Ayguadé,et al. Implementing OmpSs support for regions of data in architectures with multiple address spaces , 2013, ICS '13.
[18] James Demmel,et al. A Residual Replacement Strategy for Improving the Maximum Attainable Accuracy of s-Step Krylov Subspace Methods , 2014, SIAM J. Matrix Anal. Appl..
[19] Mark Hoemmen,et al. Communication-avoiding Krylov subspace methods , 2010 .
[20] Timothy A. Davis,et al. The university of Florida sparse matrix collection , 2011, TOMS.
[21] H. V. D. Vorst,et al. Reducing the effect of global communication in GMRES( m ) and CG on parallel distributed memory computers , 1995 .
[22] Y. Saad,et al. Practical Use of Some Krylov Subspace Methods for Solving Indefinite and Nonsymmetric Linear Systems , 1984 .
[23] Emmanuel Agullo,et al. On rounding error resilience, maximal attainable accuracy and parallel performance of the pipelined Conjugate Gradients method for large-scale linear systems in PETSc , 2016, EASC.
[24] Anthony T. Chronopoulos. s-Step Iterative Methods for (Non) Symmetric (In) Definite Linear Systems , 1989, PPSC.
[25] H. V. der. Residual Replacement Strategies for Krylov Subspace Iterative Methods for the Convergence of True Residuals , 2000 .
[26] James Demmel,et al. Parallel numerical linear algebra , 1993, Acta Numerica.