论文信息 - Implementation of a Parallel Sparse Direct Solver on Vector Architecture

Implementation of a Parallel Sparse Direct Solver on Vector Architecture

Linear systems with large sparse matrices are solved in finite element analysis of elasticity and/or fluid problems. Thanks to development of graph partitioning software, it becomes feasible to extract dense sub-matrices efficiently with minimizing fill-in during factorization. By analyzing task dependency of block factorization of dense matrix, multi-cores of CPUs which share the main memory are used in parallel and asynchronously. The tasks in dense sub-matrices consist of BLAS level 3 kernels which efficiently use arithmetic capabilities of modern super-scalar CPU with large cache memory and also of modern vector CPU. BLAS level 3 kernels can also efficiently use vector architecture, without writing any directives for explicit vectorization in the code. Nevertheless, the sparse part still remains in factorization process. Although it is only a small fraction of the whole process and almost negligible on the super-scalar CPU, its optimization is important on vector architecture due to short vector loop.

Atsushi Suzuki | François‐Xavier Roux

[1] Charbel Farhat,et al. Implicit parallel processing in structural mechanics , 1994 .

[2] O. Schenk,et al. ON FAST FACTORIZATION PIVOTING METHODS FOR SPARSE SYMMETRI C INDEFINITE SYSTEMS , 2006 .

[3] J. Mandel. Balancing domain decomposition , 1993 .

[4] Patrick R. Amestoy,et al. Multifrontal parallel distributed symmetric and unsymmetric solvers , 2000 .

[5] Patrick R. Amestoy,et al. Hybridizing nested dissection and halo approximate minimum degree for efficient sparse matrix ordering , 2000 .

[6] Vipin Kumar,et al. A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[7] Chandrajit L. Bajaj,et al. NURBS approximation of surface/surface intersection curves , 1994, Adv. Comput. Math..

[8] A. George. Numerical Experiments Using Dissection Methods to Solve n by n Grid Problems , 1977 .

[9] A. George,et al. Algorithms for Matrix Partitioning and the Numerical Solution of Finite Element Systems , 1978 .

[10] Atsushi Suzuki,et al. A dissection solver with kernel detection for symmetric finite element matrices on shared memory computers , 2014 .