A Two-pronged Progress in Structured Dense Matrix Vector Multiplication

Matrix-vector multiplication is one of the most fundamental computing primitives. Given a matrix A ∈ F N × N and a vector b ∈ F N , it is known that in the worst case Θ(N 2) operations over F are needed to compute Ab. Many types of structured matrices do admit faster multiplication. However, even given a matrix A that is known to have this property, it is hard in general to recover a representation of A exposing the actual fast multiplication algorithm. Additionally, it is not known in general whether the inverses of such structured matrices can be computed or multiplied quickly. A broad question is thus to identify classes of structured dense matrices that can be represented with O(N) parameters, and for which matrix-vector multiplication (and ideally other operations such as solvers) can be performed in a sub-quadratic number of operations. One such class of structured matrices that admit near-linear matrix-vector multiplication are the orthogonal polynomial transforms whose rows correspond to a family of orthogonal polynomials. Other well known classes include the Toeplitz, Hankel, Vandermonde, Cauchy matrices and their extensions (e.g. confluent Cauchy-like matrices) that are all special cases of a low displacementrank property. In this paper, we make progress on two fronts: Our work unifies, generalizes, and simplifies existing state-of-the-art results in structured matrix-vector multiplication. Finally, we show how applications in areas such as multipoint evaluations of multivariate polynomials can be reduced to problems involving low recurrence width matrices.

[1]  Erich Kaltofen,et al.  On Wiedemann's Method of Solving Sparse Linear Systems , 1991, AAECC.

[2]  Swastik Kopparty,et al.  List-Decoding Multiplicity Codes , 2012, Theory Comput..

[3]  Michael Clausen,et al.  Algebraic complexity theory , 1997, Grundlehren der mathematischen Wissenschaften.

[4]  Amin Shokrollahi,et al.  A displacement approach to efficient decoding of algebraic-geometric codes , 1999, STOC '99.

[5]  Valeria Simoncini,et al.  Computational Methods for Linear Matrix Equations , 2016, SIAM Rev..

[6]  V. Pan,et al.  Structured matrices and newton's iteration: unified approach , 2000 .

[7]  R. Vandebril,et al.  Matrix Computations and Semiseparable Matrices: Linear Systems , 2010 .

[8]  Alfred V. Aho,et al.  The Design and Analysis of Computer Algorithms , 1974 .

[9]  Georg Heinig,et al.  Algebraic Methods for Toeplitz-like Matrices and Operators , 1984 .

[10]  Dennis M. Healy,et al.  Fast Discrete Polynomial Transforms with Applications to Data Analysis for Distance Transitive Graphs , 1997, SIAM J. Comput..

[11]  Clément Pernet Computing with Quasiseparable Matrices , 2016, ISSAC.

[12]  Raf Vandebril,et al.  A note on the representation and definition of semiseparable matrices , 2005, Numer. Linear Algebra Appl..

[13]  Éric Schost,et al.  Fast Conversion Algorithms for Orthogonal Polynomials , 2008, ArXiv.

[14]  L. Ljung,et al.  New inversion formulas for matrices classified in terms of their distance from Toeplitz matrices , 1979 .

[15]  Victor Y. Pan,et al.  Nearly optimal computations with structured matrices , 2014, Theor. Comput. Sci..

[16]  Yanzhi Wang,et al.  Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank , 2017, ICML.

[17]  Or Meir,et al.  High-rate locally-correctable and locally-testable codes with sub-polynomial query complexity , 2016, STOC.

[18]  Vadim Olshevsky,et al.  Fast algorithms for structured matrices : theory and applications : AMS-IMS-SIAM Joint Summer Research Conference on Fast Algorithms in Mathematics, Computer Science and Engineering, August 5-9, 2001, Mount Holyoke College, South Hadley, Massachusetts , 2003 .

[19]  V. Pan,et al.  Polynomial and Matrix Computations , 1994, Progress in Theoretical Computer Science.

[20]  P. Lancaster,et al.  Factored forms for solutions of AX − XB = C and X − AXB = C in companion matrices , 1984 .

[21]  Joan Boyar,et al.  Logic Minimization Techniques with Applications to Cryptology , 2013, Journal of Cryptology.

[22]  Christopher Umans,et al.  Fast Polynomial Factorization and Modular Composition , 2011, SIAM J. Comput..

[23]  V. Pan Structured Matrices and Polynomials: Unified Superfast Algorithms , 2001 .

[24]  Éric Schost,et al.  Algorithms for Structured Linear Systems Solving and Their Implementation , 2017, ISSAC.

[25]  T. Chihara,et al.  An Introduction to Orthogonal Polynomials , 1979 .

[26]  Tara N. Sainath,et al.  Structured Transforms for Small-Footprint Deep Learning , 2015, NIPS.

[27]  M. Morf,et al.  Displacement ranks of matrices and linear equations , 1979 .

[28]  J. L. Bordewijk Inter-reciprocity applied to electrical networks , 1957 .

[29]  Ce Zhang,et al.  Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features , 2016, Nature Communications.

[30]  Israel Gohberg,et al.  Separable Type Representations of Matrices and Fast Algorithms , 2013 .

[31]  J. Bordewijk Inter-reciprocity applied to electrical networks , 1957 .

[32]  Clément Pernet,et al.  Time and space efficient generators for quasiseparable matrices , 2017, J. Symb. Comput..

[33]  Martin Morf,et al.  Doubling algorithms for Toeplitz and related equations , 1980, ICASSP.

[34]  G. Golub,et al.  A bibliography on semiseparable matrices* , 2005 .

[35]  Marc Van Barel,et al.  A Givens-Weight Representation for Rank Structured Matrices , 2007, SIAM J. Matrix Anal. Appl..

[36]  Gabriele Steidl,et al.  Fast algorithms for discrete polynomial transforms , 1998, Math. Comput..

[37]  Chee-Keng Yap,et al.  Fundamental problems of algorithmic algebra , 1999 .

[38]  Harold S. Stone,et al.  A Parallel Algorithm for the Efficient Solution of a General Class of Recurrence Equations , 1973, IEEE Transactions on Computers.

[39]  Misha Denil,et al.  ACDC: A Structured Efficient Linear Layer , 2015, ICLR.

[40]  Claude-Pierre Jeannerod,et al.  On Matrices With Displacement Structure: Generalized Operators and Faster Algorithms , 2017, SIAM J. Matrix Anal. Appl..

[41]  Amin Shokrollahi,et al.  Matrix-vector product for confluent Cauchy-like matrices with application to confluent rational interpolation , 2000, STOC '00.

[42]  I. Gohberg,et al.  Complexity of multiplication with vectors for structured matrices , 1994 .

[43]  V. Pan On computations with dense structured matrices , 1990 .

[44]  Shubhangi Saraf,et al.  High-rate codes with sublinear-time decoding , 2014, Electron. Colloquium Comput. Complex..

[45]  Ali H. Sayed,et al.  Displacement Structure: Theory and Applications , 1995, SIAM Rev..

[46]  Erich Kaltofen Asymptotically fast solution of Toeplitz-like singular linear systems , 1994, ISSAC '94.

[47]  Israel Gohberg,et al.  Computations with quasiseparable polynomials and matrices , 2008, Theor. Comput. Sci..

[48]  Martin Ziegler,et al.  Fast Multipoint Evaluation of Bivariate Polynomials , 2004, ESA.

[49]  Victor Y. Pan,et al.  A unified superfast algorithm for boundary rational tangential interpolation problems and for inversion and factorization of dense structured matrices , 1998, Proceedings 39th Annual Symposium on Foundations of Computer Science (Cat. No.98CB36280).

[50]  I. Gohberg,et al.  On a new class of structured matrices , 1999 .

[51]  B. Anderson,et al.  Asymptotically fast solution of toeplitz and related systems of linear equations , 1980 .

[52]  Beatrice Meini,et al.  Solving certain matrix equations by means of Toeplitz computations: algorithms and applications , 2001 .