Parallel direct solver for solving systems of linear equations resulting from finite element method on multi-core desktops and workstations

This paper presents the PARFES-parallel sparse direct solver for solving systems of linear equations resulting from the finite element method applied to problems of structural and solid mechanics. PARFES is intended for use on shared-memory multi-core desktops, laptops and workstations. The solver's high performance relies on the use of a specific algorithm for the procedure of block matrix multiplication using AVX instructions for the vectorization of computations, and the parallelizing approach that ensures a balanced processor load. The block looking-left factorization algorithm allows for the use of disk memory if the dimensionality of problem exceeds the capacity of RAM.

[1]  Alan George,et al.  The Evolution of the Minimum Degree Ordering Algorithm , 1989, SIAM Rev..

[2]  Patrick R. Amestoy,et al.  Multifrontal parallel distributed symmetric and unsymmetric solvers , 2000 .

[3]  S.Yu. Fialko Iterative methods for solving large-scale problems of structural mechanics using multi-core computers , 2014 .

[4]  Sergiy Fialko Application of AVX (Advanced Vector Extensions) for improved performance of the PARFES - finite element Parallel Direct Solver , 2013, 2013 Federated Conference on Computer Science and Information Systems.

[5]  Sergiy Fialko,et al.  Parallel finite element solver for multi-core computers , 2012, 2012 Federated Conference on Computer Science and Information Systems (FedCSIS).

[6]  B. Parlett The Symmetric Eigenvalue Problem , 1981 .

[7]  S. Fialko,et al.  PARFES: A method for solving finite element linear equations on multi-core computers , 2010, Adv. Eng. Softw..

[8]  James Demmel,et al.  Applied Numerical Linear Algebra , 1997 .

[9]  Jean-Yves L'Excellent,et al.  On Computing Inverse Entries of a Sparse Matrix in an Out-of-Core Environment , 2012, SIAM J. Sci. Comput..

[10]  Robert Schreiber,et al.  Efficient Methods for Out-of-Core Sparse Cholesky Factorization , 1999, SIAM J. Sci. Comput..

[11]  Alan George,et al.  Computer Solution of Large Sparse Positive Definite , 1981 .

[12]  Jennifer A. Scott,et al.  New Parallel Sparse Direct Solvers for Multicore Architectures , 2013, Algorithms.

[13]  Jean-Yves L'Excellent,et al.  Some Experiments and Issues to Exploit Multicore Parallelism in a Distributed-Memory Parallel Sparse Direct Solver , 2010 .

[14]  Gene H. Golub,et al.  Matrix computations , 1983 .

[15]  Sivan Toledo,et al.  The design and implementation of a new out-of-core sparse cholesky factorization method , 2004, TOMS.

[16]  David Pardo,et al.  Simulation of marine controlled source electromagnetic measurements using a parallel fourier hp-finite element method , 2011 .

[17]  James Demmel,et al.  A Supernodal Approach to Sparse Partial Pivoting , 1999, SIAM J. Matrix Anal. Appl..

[18]  Jean-Yves L'Excellent,et al.  Improving Multifrontal Methods by Means of Block Low-Rank Representations , 2015, SIAM J. Sci. Comput..

[19]  Victor Eijkhout,et al.  Sparse direct factorizations through unassembled hyper-matrices , 2010 .

[20]  Nicholas I. M. Gould,et al.  A numerical evaluation of sparse direct solvers for the solution of large sparse symmetric linear systems of equations , 2007, TOMS.

[21]  S. Timoshenko,et al.  THEORY OF PLATES AND SHELLS , 1959 .

[22]  Alfredo Buttari,et al.  Fine-Grained Multithreading for the Multifrontal QR Factorization of Sparse Matrices , 2013, SIAM J. Sci. Comput..

[23]  Robert A. van de Geijn,et al.  Anatomy of high-performance matrix multiplication , 2008, TOMS.

[24]  Anatoly V Perelmuter,et al.  Numerical structural analysis , 2003 .

[25]  Victor Eijkhout,et al.  A Parallel Sparse Direct Solver via Hierarchical DAG Scheduling , 2014, ACM Trans. Math. Softw..

[26]  Robert A. van de Geijn,et al.  BLIS: A Framework for Rapidly Instantiating BLAS Functionality , 2015, ACM Trans. Math. Softw..

[27]  YANQING CHEN,et al.  Algorithm 8 xx : CHOLMOD , supernodal sparse Cholesky factorization and update / downdate ∗ , 2006 .

[28]  Daniel Kressner,et al.  Block variants of Hammarling's method for solving Lyapunov equations , 2008, TOMS.

[29]  Olaf Schenk,et al.  Two-level dynamic scheduling in PARDISO: Improved scalability on shared memory multiprocessing systems , 2002, Parallel Comput..

[30]  Timothy A. Davis,et al.  Dynamic Supernodes in Sparse Cholesky Update/Downdate and Triangular Solves , 2009, TOMS.