Efficient Parallel Computation of the Characteristic Polynomial of a Sparse, Separable Matrix

Abstract. {This paper is concerned with the problem of computing the characteristic polynomial of a matrix. In a large number of applications, the matrices are symmetric and sparse : with O(n) non-zero entries. The problem has an efficient sequential solution in this case, requiring O(n2) work by use of the sparse Lanczos method. A major remaining open question is: to find a polylog time parallel algorithm with matching work bounds. Unfortunately, the sparse Lanczos method cannot be parallelized to faster than time Ω (n) using n processors. Let M(n) be the processor bound to multiply two n \times n matrices in O(log  n) parallel time. Giesbrecht [G2] gave the best previous polylog time parallel algorithms for the characteristic polynomial of a dense matrix with O (M(n)) processors. There is no known improvement to this processor bound in the case where the matrix is sparse. Often, in addition to being symmetric and sparse, the matrix has a sparsity graph (which has edges between indices of the matrix with non-zero entries) that has small separators. This paper gives a new algorithm for computing the characteristic polynomial of a sparse symmetric matrix, assuming that the sparsity graph is s(n) -separable and has a separator of size s(n)=O(nγ) , for some γ , 0 < γ < 1 , that when deleted results in connected components of ≤α n vertices, for some 0 < α < 1 , with the same property. We derive an interesting algebraic version of Nested Dissection, which constructs a sparse factorization of the matrix A-λ In where A is the input matrix and In is the n \times n identity matrix. While Nested Dissection is commonly used to minimize the fill-in in the solution of sparse linear systems, our innovation is to use the separator structure to bound also the work for manipulation of rational functions in the recursively factored matrices. The matrix elements are assumed to be over an arbitrary field. We compute the characteristic polynomial of a sparse symmetric matrix in polylog time using P(n)(n+M(s(n)))≤ P(n)(n+ s(n) 2.376) processors, where P(n) is the processor bound to multiply two degree n polynomials in O(log  n) parallel time using a PRAM (P(n) = O(n) if the field supports an FFT of size n but is otherwise O(nlog log  n) [CK]. Our method requires only that a matrix be symmetric and non-singular (it need not be positive definite as usual for Nested Dissection techniques). For the frequently occurring case where the matrix has small separator size, our polylog parallel algorithm has work bounds competitive with the best known sequential algorithms (i.e., the Ω(n2) work of sparse Lanczos methods), for example, when the sparsity graph is a planar graph, s(n) ≤ O( \sqrt n ) , and we require polylog time with only P(n)n1.188 processors.}

[1]  Donald E. Knuth The Art of Computer Programming 2 / Seminumerical Algorithms , 1971 .

[2]  Christopher C. Paige,et al.  The computation of eigenvalues and eigenvectors of very large sparse matrices , 1971 .

[3]  Victor Y. Pan,et al.  Processor efficient parallel solution of linear systems over an abstract field , 1991, SPAA '91.

[4]  V. Pan Algebraic complexity of computing polynomial zeros , 1987 .

[5]  Axel Ruhe methods for the eigenvalue problem with large sparse matrices , 1974 .

[6]  Erich Kaltofen,et al.  On Wiedemann's Method of Solving Sparse Linear Systems , 1991, AAECC.

[7]  Edward L. Wilson,et al.  Eigensolution of Large Structural Systems with Small Bandwidth , 1973 .

[8]  Gene H. Golub,et al.  Matrix computations , 1983 .

[9]  R. Tarjan,et al.  A Separator Theorem for Planar Graphs , 1977 .

[10]  Victor Y. Pan,et al.  The Parallel Computation of Minimum Cost Paths in Graphs by Stream Contraction , 1991, Inf. Process. Lett..

[11]  V. Pan PARAMETRIZATION OF NEWTON'S ITERATION FOR COMPUTATIONS WITH STRUCTURED MATRICES AND APPLICATIONS , 1992 .

[12]  D Kozen,et al.  Fast parallel orthogonalization , 1986, SIGA.

[13]  John H. Reif,et al.  Space and time efficient implementations of parallel nested dissection , 1992, SPAA '92.

[14]  A. George Nested Dissection of a Regular Finite Element Mesh , 1973 .

[15]  Victor Y. Pan,et al.  Processor-efficient parallel solution of linear systems. II. The positive characteristic and singular cases , 1992, Proceedings., 33rd Annual Symposium on Foundations of Computer Science.

[16]  A. Householder The numerical treatment of a single nonlinear equation , 1970 .

[17]  John H. Reif,et al.  An Efficient Algorithm for the Complex Roots Problem , 1996, J. Complex..

[18]  James Renegar,et al.  On the cost of approximating all roots of a complex polynomial , 1985, Math. Program..

[19]  C. Bender,et al.  The iterative calculation of several of the lowest or highest eigenvalues and corresponding eigenvectors of very large symmetric matrices , 1973 .

[20]  Don Coppersmith,et al.  Matrix multiplication via arithmetic progressions , 1987, STOC.

[21]  Allan Borodin,et al.  The computational complexity of algebraic and numeric problems , 1975, Elsevier computer science library.

[22]  M. Sadkane A block Arnoldi-Chebyshev method for computing the leading eigenpairs of large sparse unsymmetric matrices , 1993 .

[23]  Joseph F. Traub Complexity of Sequential and Parallel Numerical Algorithms , 1973 .

[24]  J. Cuppen A divide and conquer method for the symmetric tridiagonal eigenproblem , 1980 .

[25]  Jennifer A. Scott,et al.  An Arnoldi code for computing selected eigenvalues of sparse, real, unsymmetric matrices , 1995, TOMS.

[26]  J. G. Lewis Algorithms for sparse matrix eigenvalue problems , 1977 .

[27]  Ahmed Sameh,et al.  On the intermediate eigenvalues of symmetric sparse matrices , 1973 .

[28]  Victor Y. Pan,et al.  Fast and efficient parallel solution of dense linear systems , 1989 .

[29]  J. Cullum The simultaneous computation of a few of the algebraically largest and smallest eigenvalues of a large, sparse, symmetric matrix , 1978 .

[30]  Wayne Eberly On efficient band matrix arithmetic , 1992, Proceedings., 33rd Annual Symposium on Foundations of Computer Science.

[31]  Ole H. Hald,et al.  Inverse eigenvalue problems for Jacobi matrices , 1976 .

[32]  Jeffrey D Ullma Computational Aspects of VLSI , 1984 .

[33]  J. Bunch,et al.  Rank-one modification of the symmetric eigenproblem , 1978 .

[34]  Michael Ben-Or,et al.  Simple algorithms for approximating all roots of a polynomial with real roots , 1990, J. Complex..

[35]  Victor Y. Pan,et al.  Fast and E cient Parallel Evaluation of the Zeros of a Polynomial Having Only Real Zeros , 1989 .

[36]  Prasoon Tiwari,et al.  Polynomial root-finding: analysis and computational investigation of a parallel algorithm , 1992, SPAA '92.

[37]  Douglas H. Wiedemann Solving sparse linear equations over finite fields , 1986, IEEE Trans. Inf. Theory.

[38]  Axel Ruhe Implementation aspects of band Lanczos algorithms for computation of eigenvalues of large sparse sym , 1979 .

[39]  D. Rose,et al.  Generalized nested dissection , 1977 .

[40]  Allan Borodin,et al.  Fast parallel matrix and GCD computations , 1982, 23rd Annual Symposium on Foundations of Computer Science (sfcs 1982).

[41]  Jack J. Dongarra,et al.  A fully parallel algorithm for the symmetric eigenvalue problem , 1985, PPSC.

[42]  Victor Y. Pan,et al.  Parallel complexity of tridiagonal symmetric Eigenvalue problem , 1991, SODA '91.

[43]  M. Morf,et al.  Eigenvalues of a symmetric tridiagonal matrix: A divide-and-conquer approach , 1986 .

[44]  John H. Reif,et al.  Synthesis of Parallel Algorithms , 1993 .

[45]  Victor Y. Pan,et al.  Fast and Efficient Parallel Solution of Sparse Linear Systems , 1993, SIAM J. Comput..

[46]  C. A. Neff,et al.  An O(n^1+epsilon log b) Algorithm for the Complex Roots Problem , 1994, FOCS 1994.

[47]  C. Andrew Neff,et al.  Specified precision polynomial root isolation is in NC , 1990, Proceedings [1990] 31st Annual Symposium on Foundations of Computer Science.

[48]  Mark Giesbrecht,et al.  Nearly Optimal Algorithms for Canonical Matrix Forms , 1995, SIAM J. Comput..

[49]  Maurice Mignotte,et al.  Mathematics for computer algebra , 1991 .

[50]  Jacob T. Schwartz,et al.  Fast Probabilistic Algorithms for Verification of Polynomial Identities , 1980, J. ACM.

[51]  Victor Y. Pan,et al.  Fast and Efficient Parallel Algorithms for the Exact Inversion of Integer Matrices , 1985, FSTTCS.

[52]  J. Cullum,et al.  A block Lanczos algorithm for computing the q algebraically largest eigenvalues and a corresponding eigenspace of large, sparse, real symmetric matrices , 1974, CDC 1974.

[53]  Luca Gemignani,et al.  Iteration schemes for the divide-and-conquer eigenvalue solver , 1994 .

[54]  Alfred V. Aho,et al.  The Design and Analysis of Computer Algorithms , 1974 .

[55]  Iain S. Duff,et al.  On the Reduction of Sparse Matrices to Condensed Forms by Similarity Transformations , 1975 .

[56]  G. E. Collins,et al.  Real Zeros of Polynomials , 1983 .

[57]  Ephraim Feig,et al.  A fast parallel algorithm for determining all roots of a polynomial with real roots , 1986, STOC '86.

[58]  John H. Reif An O(nlog/sup 3/ n) algorithm for the real root problem , 1993, Proceedings of 1993 IEEE 34th Annual Foundations of Computer Science.

[59]  Mark Giesbrechty,et al.  Eecient Parallel Solution of Sparse Systems of Linear Diophantine Equations Eecient Parallel Solution of Sparse Systems of Linear Diophantine Equations , 2007 .

[60]  Dario Bini,et al.  On the Complexity of Polynomial Zeros , 1992, SIAM J. Comput..

[61]  Victor Y. Pan,et al.  Optimum parallel computations with banded matrices , 1994, SODA '94.

[62]  A. H. Sherman,et al.  Applications of an Element Model for Gaussian Elimination , 1976 .

[63]  Richard R. Underwood An iterative block Lanczos method for the solution of large sparse symmetric eigenproblems , 1975 .

[64]  V. Pan Sequential and parallel complexity of approximate evaluation of polynomial zeros , 1987 .