论文信息 - Reduced-Rank Regression with Operator Norm Error

Reduced-Rank Regression with Operator Norm Error

A common data analysis task is the reduced-rank regression problem: $$\min_{\textrm{rank-}k \ X} \|AX-B\|,$$ where $A \in \mathbb{R}^{n \times c}$ and $B \in \mathbb{R}^{n \times d}$ are given large matrices and $\|\cdot\|$ is some norm. Here the unknown matrix $X \in \mathbb{R}^{c \times d}$ is constrained to be of rank $k$ as it results in a significant parameter reduction of the solution when $c$ and $d$ are large. In the case of Frobenius norm error, there is a standard closed form solution to this problem and a fast algorithm to find a $(1+\varepsilon)$-approximate solution. However, for the important case of operator norm error, no closed form solution is known and the fastest known algorithms take singular value decomposition time. We give the first randomized algorithms for this problem running in time $$(\text{nnz}{(A)} + \text{nnz}{(B)} + c^2) \cdot k/\varepsilon^{1.5} + (n+d)k^2/\epsilon + c^{\omega},$$ up to a polylogarithmic factor involving condition numbers, matrix dimensions, and dependence on $1/\varepsilon$. Here $\text{nnz}{(M)}$ denotes the number of non-zero entries of a matrix $M$, and $\omega$ is the exponent of matrix multiplication. As both (1) spectral low rank approximation ($A = B$) and (2) linear system solving ($m = n$ and $d = 1$) are special cases, our time cannot be improved by more than a $1/\varepsilon$ factor (up to polylogarithmic factors) without a major breakthrough in linear algebra. Interestingly, known techniques for low rank approximation, such as alternating minimization or sketch-and-solve, provably fail for this problem. Instead, our algorithm uses an existential characterization of a solution, together with Krylov methods, low degree polynomial approximation, and sketching-based preconditioning.

David P. Woodruff | Praneeth Kacham

[1] Aaron Sidford,et al. Stability of the Lanczos Method for Matrix Function Approximation , 2017, SODA.

[2] Pierre Legendre,et al. DISTANCE‐BASED REDUNDANCY ANALYSIS: TESTING MULTISPECIES RESPONSES IN MULTIFACTORIAL ECOLOGICAL EXPERIMENTS , 1999 .

[3] Mark Tygert,et al. An implementation of a randomized algorithm for principal component analysis , 2014, ArXiv.

[4] Cameron Musco,et al. Randomized Block Krylov Methods for Stronger and Faster Approximate Singular Value Decomposition , 2015, NIPS.

[5] David P. Woodruff,et al. Low rank approximation and regression in input sparsity time , 2012, STOC '13.

[6] Huy L. Nguyen,et al. OSNAP: Faster Numerical Linear Algebra Algorithms via Sparser Subspace Embeddings , 2012, 2013 IEEE 54th Annual Symposium on Foundations of Computer Science.

[7] Christos Boutsidis,et al. Topics in Matrix Sampling Algorithms , 2011, ArXiv.

[8] David P. Woodruff. Sketching as a Tool for Numerical Linear Algebra , 2014, Found. Trends Theor. Comput. Sci..

[9] C.J.F. ter Braak,et al. Biplots in Reduced-Rank Regression , 1994 .

[10] G. Reinsel,et al. Multivariate Reduced-Rank Regression: Theory and Applications , 1998 .

[11] Max Simchowitz,et al. The gradient complexity of linear regression , 2020, COLT.