Parallel Solution of Sparse Linear Systems

Many simulations in science and engineering give rise to sparse linear systems of equations. It is a well known fact that the cost of the simulation process is almost always governed by the solution of the linear systems especially for large-scale problems. The emergence of extreme-scale parallel platforms, along with the increasing number of processing cores available on a single chip pose significant challenges for algorithm development. Machines with tens of thousands of multicore processors place tremendous constraints on the communication as well as memory access requirements of algorithms. The increase in number of cores in a processing unit without an increase in memory bandwidth aggravates an already significant memory bottleneck. Sparse linear algebra kernels are well-known for their poor processor utilization. This is a result of limited memory reuse, which renders data caching less effective. In view of emerging hardware trends, it is necessary to develop algorithms that strike a more meaningful balance between memory accesses, communication, and computation. Specifically, an algorithm that performs more floating point operations at the expense of reduced memory accesses and communication is likely to yield better performance. We present two alternative variations of DS factorization based methods for solution of sparse linear systems on parallel computing platforms. Performance comparisons to traditional LU factorization based parallel solvers are also discussed. We show that combining iterative methods with direct solvers and using DS factorization, one can achieve better scalability and shorter time to solution.

[1]  Patrick Amestoy,et al.  A Fully Asynchronous Multifrontal Solver Using Distributed Dynamic Scheduling , 2001, SIAM J. Matrix Anal. Appl..

[2]  Timothy A. Davis,et al.  The university of Florida sparse matrix collection , 2011, TOMS.

[3]  Michael W. Berry,et al.  Multiprocessor Schemes for Solving Block Tridiagonal Linear Systems , 1988 .

[4]  Murat Manguoglu,et al.  Weighted Matrix Ordering and Parallel Banded Preconditioners for Iterative Linear System Solvers , 2010, SIAM J. Sci. Comput..

[5]  Anshul Gupta,et al.  Recent advances in direct methods for solving unsymmetric sparse systems of linear equations , 2002, TOMS.

[6]  Edwin R. Hancock,et al.  Graph matching and clustering using spectral partitions , 2006, Pattern Recognit..

[7]  Jack Dongarra,et al.  High Performance Computing for Computational Science , 2003 .

[8]  Eorge,et al.  Unstructured Graph Partitioning and Sparse Matrix Ordering System Version 2 . 0 , 1995 .

[9]  David J. Kuck,et al.  On Stable Parallel Linear System Solvers , 1978, JACM.

[10]  GuermoucheAbdou,et al.  Hybrid scheduling for the parallel solution of linear systems , 2006 .

[11]  Murat Manguoglu,et al.  Nested and parallel sparse algorithms for arterial fluid mechanics computations with boundary layer mesh refinement , 2011 .

[12]  Henk A. van der Vorst,et al.  Bi-CGSTAB: A Fast and Smoothly Converging Variant of Bi-CG for the Solution of Nonsymmetric Linear Systems , 1992, SIAM J. Sci. Comput..

[13]  M. Fiedler Algebraic connectivity of graphs , 1973 .

[14]  Alex Pothen,et al.  PARTITIONING SPARSE MATRICES WITH EIGENVECTORS OF GRAPHS* , 1990 .

[15]  Ahmed Sameh,et al.  Hybrid Parallel Linear System Solvers , 1999 .

[16]  Jack Dongarra,et al.  LAPACK: a portable linear algebra library for high-performance computers , 1990, SC.

[17]  A. Sameh,et al.  The trace minimization method for the symmetric generalized eigenvalue problem , 2000 .

[18]  Iain S. Duff,et al.  The Design and Use of Algorithms for Permuting Large Entries to the Diagonal of Sparse Matrices , 1999, SIAM J. Matrix Anal. Appl..

[19]  James Demmel,et al.  SuperLU_DIST: A scalable distributed-memory sparse direct solver for unsymmetric linear systems , 2003, TOMS.

[20]  Ahmed Sameh,et al.  SPIKE: A parallel environment for solving banded linear systems , 2007 .

[21]  Ahmed H. Sameh,et al.  Trace Minimization Algorithm for the Generalized Eigenvalue Problem , 1982, PPSC.

[22]  James Demmel,et al.  ScaLAPACK: A Linear Algebra Library for Message-Passing Computers , 1997, PPSC.

[23]  Murat Manguoglu,et al.  Parallel scalable PDE-constrained optimization: antenna identification in hyperthermia cancer treatment planning , 2009, Computer Science - Research and Development.

[24]  Hongyuan Zha,et al.  Web document clustering using hyperlink structures , 2001 .

[25]  Duncan H. Lawrie,et al.  The computation and communication complexity of a parallel banded system solver , 1984, TOMS.

[26]  Seid Koric,et al.  Sparse matrix factorization on massively parallel computers , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[27]  Patrick Amestoy,et al.  Hybrid scheduling for the parallel solution of linear systems , 2006, Parallel Comput..

[28]  Alex Pothen,et al.  A spectral algorithm for envelope reduction of sparse matrices , 1993, Supercomputing '93. Proceedings.

[29]  Olaf Schenk,et al.  Solving unsymmetric sparse systems of linear equations with PARDISO , 2004, Future Gener. Comput. Syst..

[30]  O. Schenk,et al.  ON FAST FACTORIZATION PIVOTING METHODS FOR SPARSE SYMMETRI C INDEFINITE SYSTEMS , 2006 .

[31]  Iain S. Duff,et al.  On Algorithms For Permuting Large Entries to the Diagonal of a Sparse Matrix , 2000, SIAM J. Matrix Anal. Appl..

[32]  S. J. Shepherd,et al.  Amino acid partitioning using a Fiedler vector model , 2007, European Biophysics Journal.

[33]  Jack J. Dongarra,et al.  On some parallel banded system solvers , 1984, Parallel Comput..

[34]  Ahmed H. Sameh,et al.  A parallel hybrid banded system solver: the SPIKE algorithm , 2006, Parallel Comput..

[35]  Murat Manguoglu A domain-decomposing parallel sparse linear system solver , 2011, J. Comput. Appl. Math..

[36]  Patrick R. Amestoy,et al.  Multifrontal parallel distributed symmetric and unsymmetric solvers , 2000 .

[37]  Murat Manguoglu A parallel hybrid sparse linear system solver , 2009, 2009 Computational Electromagnetics International Workshop.

[38]  Guillermo Ricardo Simari,et al.  Non-commercial Research and Educational Use including without Limitation Use in Instruction at Your Institution, Sending It to Specific Colleagues That You Know, and Providing a Copy to Your Institution's Administrator. All Other Uses, Reproduction and Distribution, including without Limitation Comm , 2022 .

[39]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[40]  George Karypis,et al.  Multilevel k-way Partitioning Scheme for Irregular Graphs , 1998, J. Parallel Distributed Comput..

[41]  Murat Manguoglu,et al.  TRACEMIN-Fiedler: A Parallel Algorithm for Computing the Fiedler Vector , 2010, VECPAR.

[42]  Murat Manguoglu,et al.  PSPIKE: A Parallel Hybrid Sparse Linear System Solver , 2009, Euro-Par.

[43]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[44]  David J. Kuck,et al.  Practical Parallel Band Triangular System Solvers , 1978, TOMS.

[45]  Sibsankar Kundu,et al.  Automatic domain decomposition of proteins by a Gaussian Network Model , 2004, Proteins.

[46]  H. D. Simon,et al.  A spectral algorithm for envelope reduction of sparse matrices , 1993, Supercomputing '93. Proceedings.