On the Optimality of the Backward Greedy Algorithm for the Subset Selection Problem

The following linear inverse problem is considered: Given a full column rank m × n data matrix A, and a length m observation vector b, find the best least-squares solution to A x = b with at most r < n nonzero components. The backward greedy algorithm computes a sparse solution to A x = b by removing greedily columns from A until r columns are left. A simple implementation based on a QR downdating scheme using Givens rotations is described. The backward greedy algorithm is shown to be optimal for the subset selection problem in the sense that it selects the "correct" subset of columns from A if the perturbation of the data vector b is small enough. The results generalize to any other norm of the residual.

[1]  Balas K. Natarajan,et al.  Sparse Approximate Solutions to Linear Systems , 1995, SIAM J. Comput..

[2]  A. Atkinson Subset Selection in Regression , 1992 .

[3]  Å. Björck,et al.  Accurate Downdating of Least Squares Solutions , 1994, SIAM J. Matrix Anal. Appl..

[4]  Alan H. Feiveson Finding the Best Regression Subset by Reduction in Nonfull-Rank Cases , 1994 .

[5]  S. J. Reeves An efficient implementation of the backward greedy algorithm for sparse signal reconstruction , 1999, IEEE Signal Processing Letters.

[6]  Ahmed H. Tewfik,et al.  Optimal subset selection for adaptive signal representation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[7]  George M. Furnival,et al.  Regressions by leaps and bounds , 2000 .

[8]  Yoram Bresler,et al.  Dictionary-based decomposition of linear mixtures of Gaussian processes , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[9]  Yoram Bresler,et al.  A new algorithm for computing sparse solutions to linear inverse problems , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.