论文信息 - Iterative Refinement for $\ell_p$-norm Regression

Iterative Refinement for $\ell_p$-norm Regression

We give improved algorithms for the $\ell_{p}$-regression problem, $\min_{x} \|x\|_{p}$ such that $A x=b,$ for all $p \in (1,2) \cup (2,\infty).$ Our algorithms obtain a high accuracy solution in $\tilde{O}_{p}(m^{\frac{|p-2|}{2p + |p-2|}}) \le \tilde{O}_{p}(m^{\frac{1}{3}})$ iterations, where each iteration requires solving an $m \times m$ linear system, $m$ being the dimension of the ambient space. By maintaining an approximate inverse of the linear systems that we solve in each iteration, we give algorithms for solving $\ell_{p}$-regression to $1 / \text{poly}(n)$ accuracy that run in time $\tilde{O}_p(m^{\max\{\omega, 7/3\}}),$ where $\omega$ is the matrix multiplication constant. For the current best value of $\omega > 2.37$, we can thus solve $\ell_{p}$ regression as fast as $\ell_{2}$ regression, for all constant $p$ bounded away from $1.$ Our algorithms can be combined with fast graph Laplacian linear equation solvers to give minimum $\ell_{p}$-norm flow / voltage solutions to $1 / \text{poly}(n)$ accuracy on an undirected graph with $m$ edges in $\tilde{O}_{p}(m^{1 + \frac{|p-2|}{2p + |p-2|}}) \le \tilde{O}_{p}(m^{\frac{4}{3}})$ time. For sparse graphs and for matrices with similar dimensions, our iteration counts and running times improve on the $p$-norm regression algorithm by [Bubeck-Cohen-Lee-Li STOC`18] and general-purpose convex optimization algorithms. At the core of our algorithms is an iterative refinement scheme for $\ell_{p}$-norms, using the smoothed $\ell_{p}$-norms introduced in the work of Bubeck et al. Given an initial solution, we construct a problem that seeks to minimize a quadratically-smoothed $\ell_{p}$ norm over a subspace, such that a crude solution to this problem allows us to improve the initial solution by a constant factor, leading to algorithms with fast convergence.

[1] Yurii Nesterov,et al. Interior-point polynomial algorithms in convex programming , 1994, Siam studies in applied mathematics.

[2] Jonah Sherman,et al. Generalized Preconditioning and Undirected Minimum-Cost Flow , 2017, SODA.

[3] Zeyuan Allen Zhu,et al. A simple, combinatorial algorithm for solving SDD systems in nearly-linear time , 2013, STOC '13.

[4] Aleksander Madry,et al. Navigating Central Path with Electrical Flows: From Flows to Matchings, and Back , 2013, 2013 IEEE 54th Annual Symposium on Foundations of Computer Science.

[5] Yin Tat Lee,et al. Solving linear programs in the current matrix multiplication time , 2018, STOC.

[6] Gary L. Miller,et al. A Nearly-m log n Time Solver for SDD Linear Systems , 2011, 2011 IEEE 52nd Annual Symposium on Foundations of Computer Science.

[7] Yin Tat Lee,et al. An Almost-Linear-Time Algorithm for Approximate Max Flow in Undirected Graphs, and its Multicommodity Generalizations , 2013, SODA.

[8] O. Axelsson. Iterative solution methods , 1995 .

[9] Aleksander Madry,et al. Matrix Scaling and Balancing via Box Constrained Newton's Method and Interior Point Methods , 2017, 2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS).

[10] Carl Tim Kelley,et al. Iterative methods for optimization , 1999, Frontiers in applied mathematics.

[11] Yin Tat Lee,et al. An homotopy method for lp regression provably beyond self-concordance and in input-sparsity time , 2018, STOC.

[12] Richard Peng,et al. An efficient parallel solver for SDD linear systems , 2013, STOC.

[13] Shang-Hua Teng,et al. The Laplacian Paradigm: Emerging Algorithms for Massive Graphs , 2010, TAMC.

[14] Andreas M. Tillmann. Equivalence of Linear Programming and Basis Pursuit , 2015 .

[15] François Le Gall,et al. Powers of tensors and fast matrix multiplication , 2014, ISSAC.

[16] Chintan Shah,et al. Computing Cut-Based Hierarchical Decompositions in Almost Linear Time , 2014, SODA.

[17] Richard Peng,et al. On Fully Dynamic Graph Sparsifiers , 2016, 2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS).

[18] Virginia Vassilevska Williams,et al. Multiplying matrices faster than coppersmith-winograd , 2012, STOC '12.

[19] Richard Peng,et al. Approximate Undirected Maximum Flows in O(mpolylog(n)) Time , 2014, SODA.

[20] Pravin M. Vaidya,et al. Speeding-up linear programming using fast matrix multiplication , 1989, 30th Annual Symposium on Foundations of Computer Science.

[21] Richard Peng,et al. Sparsified Cholesky and multigrid solvers for connection laplacians , 2015, STOC.

[22] Gary L. Miller,et al. A ug 2 01 0 Approaching optimality for solving SDD linear systems ∗ , 2011 .

[23] Piotr Sankowski,et al. Negative-Weight Shortest Paths and Unit Capacity Minimum Cost Flow in Õ (m10/7 log W) Time (Extended Abstract) , 2016, SODA.

[24] D. Keyes,et al. Jacobian-free Newton-Krylov methods: a survey of approaches and applications , 2004 .

[25] Satish Rao,et al. A new approach to computing maximum flows using electrical flows , 2013, STOC '13.

[26] Aleksander Madry,et al. Computing Maximum Flow with Augmenting Electrical Flows , 2016, 2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS).

[27] Anup Rao,et al. Fast, Provable Algorithms for Isotonic Regression in all L_p-norms , 2015, NIPS.

[28] Avi Wigderson,et al. Much Faster Algorithms for Matrix Scaling , 2017, 2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS).

[29] Andreas M. Tillmann. Computational aspects of compressed sensing , 2014 .

[30] Brian Bullins,et al. Fast minimization of structured convex quartics , 2018, 1812.10349.

[31] Shang-Hua Teng,et al. Electrical flows, laplacian systems, and faster approximation of maximum flow in undirected graphs , 2010, STOC '11.

[32] Boaz Patt-Shamir,et al. Near-Optimal Distributed Maximum Flow: Extended Abstract , 2015, PODC.

[33] Sanjeev Arora,et al. The Multiplicative Weights Update Method: a Meta-Algorithm and Applications , 2012, Theory Comput..

[34] Richard Peng,et al. Lp Row Sampling by Lewis Weights , 2015, STOC.

[35] Jonah Sherman,et al. Nearly Maximum Flows in Nearly Linear Time , 2013, 2013 IEEE 54th Annual Symposium on Foundations of Computer Science.

[36] Gary L. Miller,et al. Runtime guarantees for regression problems , 2011, ITCS '13.

[37] Yin Tat Lee,et al. A Faster Cutting Plane Method and its Implications for Combinatorial and Convex Optimization , 2015, 2015 IEEE 56th Annual Symposium on Foundations of Computer Science.

[38] Sushant Sachdeva,et al. Approximate Gaussian Elimination for Laplacians - Fast, Sparse, and Simple , 2016, 2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS).

[39] Shang-Hua Teng,et al. Nearly-Linear Time Algorithms for Preconditioning and Solving Symmetric, Diagonally Dominant Linear Systems , 2006, SIAM J. Matrix Anal. Appl..

[40] Gary L. Miller,et al. Faster approximate multicommodity flow using quadratically coupled flows , 2012, STOC '12.

[41] Daniel A. Spielman,et al. Faster approximate lossy generalized flow via interior point algorithms , 2008, STOC.

[42] Yin Tat Lee,et al. Efficient Inverse Maintenance and Faster Algorithms for Linear Programming , 2015, 2015 IEEE 56th Annual Symposium on Foundations of Computer Science.

[43] Daniel A. Spielman,et al. Algorithms for Lipschitz Learning on Graphs , 2015, COLT.