Enhanced Parallel ILU(p)-based Preconditioners for Multi-core CPUs and GPUs -- The Power(q)-pattern Method
暂无分享,去创建一个
Jan-Philipp Weiss | Vincent Heuveline | Dimitar Lukarski | V. Heuveline | D. Lukarski | Jan-Philipp Weiss
[1] K. Chen,et al. Matrix preconditioning techniques and applications , 2005 .
[2] Philippe G. Ciarlet,et al. The finite element method for elliptic problems , 2002, Classics in applied mathematics.
[3] Vincent Heuveline. HiFlow3: a flexible and hardware-aware parallel finite element package , 2010, POOSC '10.
[4] L. Kolotilina,et al. Factorized Sparse Approximate Inverse Preconditionings I. Theory , 1993, SIAM J. Matrix Anal. Appl..
[5] Jan-Philipp Weiss,et al. A multi-platform linear algebra toolbox for finite element solvers on heterogeneous clusters , 2010, 2010 IEEE International Conference On Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS).
[6] Rajesh Bordawekar,et al. Optimizing Sparse Matrix-Vector Multiplication on GPUs , 2009 .
[7] D. Braess. Finite Elements: Theory, Fast Solvers, and Applications in Solid Mechanics , 1995 .
[8] Yousef Saad,et al. Iterative methods for sparse linear systems , 2003 .
[9] Mark Frederick Hoemmen,et al. An Overview of Trilinos , 2003 .
[10] O. Axelsson,et al. Finite element solution of boundary value problemes - theory and computation , 2001, Classics in applied mathematics.
[11] Richard Barrett,et al. Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods , 1994, Other Titles in Applied Mathematics.
[12] V. E. Henson,et al. BoomerAMG: a parallel algebraic multigrid solver and preconditioner , 2002 .
[13] M. Benzi,et al. A comparative study of sparse approximate inverse preconditioners , 1999 .
[14] D. Chen. Analysis , Implementation , and Evaluation of Vaidya ’ s Preconditioners , 2001 .
[15] L. R. Scott,et al. The Mathematical Theory of Finite Element Methods , 1994 .
[16] Michael Garland,et al. Implementing sparse matrix-vector multiplication on throughput-oriented processors , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.
[17] James Demmel,et al. Applied Numerical Linear Algebra , 1997 .
[18] Jan-Philipp Weiss,et al. Scalable Multi-coloring Preconditioning for Multi-core CPUs and GPUs , 2010, Euro-Par Workshops.
[19] Dominik Göddeke,et al. Fast and accurate finite-element multigrid solvers for PDE simulations on GPU clusters , 2011 .