Evaluating Accuracy and Efficiency of HPC Solvers for Sparse Linear Systems with Applications to PDEs

Partial Differential Equations (PDEs) describe several problems relevant to many fields of applied sciences, and their discrete counterparts typically involve the solution of sparse linear systems. In this context, we focus on the analysis of the computational aspects related to the solution of large and sparse linear systems with HPC solvers, by considering the performances of direct and iterative solvers in terms of computational efficiency, scalability, and numerical accuracy. Our aim is to identify the main criteria to support application-domain specialists in the selection of the most suitable solvers, according to the application requirements and available resources. To this end, we discuss how the numerical solver is affected by the regular/irregular discretisation of the input domain, the discretisation of the input PDE with piecewise linear or polynomial basis functions, which generally result in a higher/lower sparsity of the coefficient matrix, and the choice of different initial conditions, which are associated with linear systems with multiple right-hand side terms. Finally, our analysis is independent of the characteristics of the underlying computational architectures, and provides a methodological approach that can be applied to different classes of PDEs or with approximation problems.

[1]  Patrick Amestoy,et al.  A Fully Asynchronous Multifrontal Solver Using Distributed Dynamic Scheduling , 2001, SIAM J. Matrix Anal. Appl..

[2]  Xiaoye S. Li,et al.  An overview of SuperLU: Algorithms, implementation, and user interface , 2003, TOMS.

[3]  Jack J. Dongarra,et al.  A scalable approach to solving dense linear algebra problems on hybrid CPU‐GPU systems , 2015, Concurr. Comput. Pract. Exp..

[4]  Patrick Amestoy,et al.  Hybrid scheduling for the parallel solution of linear systems , 2006, Parallel Comput..

[5]  Christian Becker,et al.  FEAST—realization of hardware‐oriented numerics for HPC simulations with finite elements , 2010, Concurr. Comput. Pract. Exp..

[6]  David E. Keyes,et al.  Performance optimization of Sparse Matrix‐Vector Multiplication for multi‐component PDE‐based applications using GPUs , 2016, Concurr. Comput. Pract. Exp..

[7]  Jack Dongarra,et al.  Parallel Programming Models for Dense Linear Algebra on Heterogeneous Systems , 2015, Supercomput. Front. Innov..

[8]  Pier Luca Maffettone,et al.  Computational simulations of 3D large-scale time-dependent viscoelastic flows in high performance co , 2011 .

[9]  Olaf Schenk,et al.  Solving unsymmetric sparse systems of linear equations with PARDISO , 2002, Future Gener. Comput. Syst..

[10]  Brian Vinter,et al.  Fast synchronization‐free algorithms for parallel sparse triangular solves with multiple right‐hand sides , 2017, Concurr. Comput. Pract. Exp..

[11]  Pascal Hénon,et al.  PaStiX: a high-performance parallel direct solver for sparse symmetric positive definite systems , 2002, Parallel Comput..

[12]  Maxim Naumov,et al.  Incomplete-LU and Cholesky Preconditioned Iterative Methods Using CUSPARSE and CUBLAS , 2012 .

[13]  Seid Koric,et al.  Evaluation of parallel direct sparse linear solvers in electromagnetic geophysical problems , 2016, Comput. Geosci..

[14]  Jianlin Xia,et al.  A Parallel Geometric Multifrontal Solver Using Hierarchically Semiseparable Structure , 2016, ACM Trans. Math. Softw..

[15]  A. Quarteroni Numerical Models for Differential Problems , 2009 .

[16]  Giuseppe Patanè,et al.  STAR ‐ Laplacian Spectral Kernels and Distances for Geometry Processing and Shape Analysis , 2016, Comput. Graph. Forum.

[17]  James Demmel,et al.  the Parallel Computing Landscape , 2022 .

[18]  Barry F. Smith,et al.  PETSc Users Manual , 2019 .

[19]  Yousef Saad,et al.  Iterative methods for sparse linear systems , 2003 .

[20]  Gerhard Wellein,et al.  GHOST: Building Blocks for High Performance Sparse Linear Algebra on Heterogeneous Systems , 2015, International Journal of Parallel Programming.

[21]  Lucas Mello Schnorr,et al.  Performance modeling of a geophysics application to accelerate over‐decomposition parameter tuning through simulation , 2019, Concurr. Comput. Pract. Exp..

[22]  Robert D. Falgout,et al.  The Design and Implementation of hypre, a Library of Parallel High Performance Preconditioners , 2006 .

[23]  Patrick R. Amestoy,et al.  An Approximate Minimum Degree Ordering Algorithm , 1996, SIAM J. Matrix Anal. Appl..

[24]  Mathias Jacquelin,et al.  Highly scalable distributed-memory sparse triangular solution algorithms , 2018, CSC.

[25]  Julien Langou,et al.  The Impact of Multicore on Math Software , 2006, PARA.

[26]  Tamara G. Kolda,et al.  An overview of the Trilinos project , 2005, TOMS.

[27]  Emmanuel Agullo,et al.  Multifrontal QR Factorization for Multicore Architectures over Runtime Systems , 2013, Euro-Par.

[28]  Emmanuel Agullo,et al.  Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems , 2016, ACM Trans. Math. Softw..

[29]  G. M.,et al.  Partial Differential Equations I , 2023, Applied Mathematical Sciences.

[30]  Vipin Kumar,et al.  A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[31]  Jack J. Dongarra,et al.  Software Libraries for Linear Algebra Computations on High Performance Computers , 1995, SIAM Rev..

[32]  James Demmel,et al.  SuperLU_DIST: A scalable distributed-memory sparse direct solver for unsymmetric linear systems , 2003, TOMS.

[33]  Eric Darve,et al.  A distributed-memory hierarchical solver for general sparse linear systems , 2017, Parallel Comput..