The Design of I/O-Efficient Sparse Direct Solvers

We consider two problems related to I/O: First, find the minimum primary memory size required to factor a sparse, symmetric matrix when permitted to read and write the data exactly once. Second, find the minimum data traffic between core and external memory when permitted to read and write the data many times. These problems are likely to be intractable in general, but we prove upper and lower bounds on these quantities for several model problems with useful sparsity (i.e., whose computational graphs have small separators). We provide fast algorithms for computing these quantities through simulation for irregular problems. The choice of factorization algorithms (left-looking, right-looking, multifrontal), orderings (nested dissection or minimum degree), and blocking techniques (1- or 2- dimensional blocks) can change the memory size and traffic by orders of magnitude. Explicitly moving the data (files managed by the program) improves performance significantly over implicit data movement (pages managed by the operating system). Thus this work guides us in designing a software library that implements an external memory sparse solver.

[1]  John May,et al.  Parallel I/O for High Performance Computing , 2000 .

[2]  Sivan Toledo,et al.  A survey of out-of-core algorithms in numerical linear algebra , 1999, External Memory Algorithms.

[3]  Florin Dobrian,et al.  The design of sparse direct solvers using object-oriented techniques , 1999 .

[4]  Robert Schreiber,et al.  Efficient Methods for Out-of-Core Sparse Cholesky Factorization , 1999, SIAM J. Sci. Comput..

[5]  Jeffrey Scott Vitter,et al.  External memory algorithms , 1998, ESA.

[6]  Sivan Toledo Locality of Reference in LU Decomposition with Partial Pivoting , 1997, SIAM J. Matrix Anal. Appl..

[7]  Barry W. Peyton,et al.  Block Sparse Cholesky Algorithms on Advanced Uniprocessor Computers , 1991, SIAM J. Sci. Comput..

[8]  Joseph W. H. Liu The role of elimination trees in sparse factorization , 1990 .

[9]  Barry W. Peyton,et al.  Progress in Sparse Matrix Methods for Large Linear Systems On Vector Supercomputers , 1987 .

[10]  Joseph W. H. Liu An adaptive general sparse out-of-core cholesky factorization scheme , 1987 .

[11]  Joseph W. H. Liu,et al.  On the storage requirement in the out-of-core multifrontal method for sparse factorization , 1986, TOMS.

[12]  Joseph W. H. Liu,et al.  Modification of the minimum-degree algorithm by multiple elimination , 1985, TOMS.

[13]  H. T. Kung,et al.  I/O complexity: The red-blue pebble game , 1981, STOC '81.

[14]  W. H. Liu,et al.  AN APPLICATION OF GENERALIZED TREE PEBBLING TO SPARSE MATRIX FACTORIZATION , 2022 .