Revisiting model selection and recovery of sparse signals using one-step thresholding

This paper studies non-asymptotic model selection and recovery of sparse signals in high-dimensional, linear inference problems. In contrast to the existing literature, the focus here is on the general case of arbitrary design matrices and arbitrary nonzero entries of the signal. In this regard, it utilizes two easily computable measures of coherence—termed as the worst-case coherence and the average coherence—among the columns of a design matrix to analyze a simple, model-order agnostic one-step thresholding (OST) algorithm. In particular, the paper establishes that if the design matrix has reasonably small worst-case and average coherence then OST performs near-optimal model selection when either (i) the energy of any nonzero entry of the signal is close to the average signal energy per nonzero entry or (ii) the signal-to-noise ratio (SNR) in the measurement system is not too high. Further, the paper shows that if the design matrix in addition has sufficiently small spectral norm then OST also exactly recovers most sparse signals whose nonzero entries have approximately the same magnitude even if the number of nonzero entries scales almost linearly with the number of rows of the design matrix. Finally, the paper also presents various classes of random and deterministic design matrices that can be used together with OST to successfully carry out near-optimal model selection and recovery of sparse signals under certain SNR regimes or for certain classes of signals.

[1]  C. L. Mallows Some comments on C_p , 1973 .

[2]  Lloyd R. Welch,et al.  Lower bounds on the maximum cross correlation of signals (Corresp.) , 1974, IEEE Trans. Inf. Theory.

[3]  H. Akaike A new look at the statistical model identification , 1974 .

[4]  Jean-Marie Goethals,et al.  Alternating Bilinear Forms over GF(q) , 1975, J. Comb. Theory, Ser. A.

[5]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[6]  William O. Alltop,et al.  Complex sequences with low periodic correlations , 1980 .

[7]  William O. Alltop,et al.  Complex sequences with low periodic correlations (Corresp.) , 1980, IEEE Trans. Inf. Theory.

[8]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[9]  I. Johnstone,et al.  Ideal spatial adaptation by wavelet shrinkage , 1994 .

[10]  Dean P. Foster,et al.  The risk inflation criterion for multiple regression , 1994 .

[11]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[12]  S. Mallat,et al.  Adaptive greedy approximations , 1997 .

[13]  Michael A. Saunders,et al.  Atomic Decomposition by Basis Pursuit , 1998, SIAM J. Sci. Comput..

[14]  C. H. Oh,et al.  Some comments on , 1998 .

[15]  Colin L. Mallows,et al.  Some Comments on Cp , 2000, Technometrics.

[16]  Sudipto Guha,et al.  Near-optimal sparse fourier representations via sampling , 2002, STOC '02.

[17]  Thomas Strohmer,et al.  GRASSMANNIAN FRAMES WITH APPLICATIONS TO CODING AND COMMUNICATION , 2003, math/0301135.

[18]  J. Lawrence,et al.  Linear Independence of Gabor Systems in Finite Dimensional Vector Spaces , 2005 .

[19]  Peter G. Casazza,et al.  Fourier Transforms of Finite Chirps , 2006, EURASIP J. Adv. Signal Process..

[20]  Peng Zhao,et al.  On Model Selection Consistency of Lasso , 2006, J. Mach. Learn. Res..

[21]  R. Vershynin,et al.  One sketch for all: fast algorithms for compressed sensing , 2007, STOC '07.

[22]  Pierre Vandergheynst,et al.  Average Performance Analysis for Thresholding , 2007, IEEE Signal Processing Letters.

[23]  J. Tropp Norms of Random Submatrices and Sparse Approximation , 2008 .

[24]  Mike E. Davies,et al.  Iterative Hard Thresholding for Compressed Sensing , 2008, ArXiv.

[25]  E. Candès The restricted isometry property and its implications for compressed sensing , 2008 .

[26]  J. Tropp On the conditioning of random subdictionaries , 2008 .

[27]  Mark A. Iwen,et al.  A deterministic sub-linear time sparse fourier algorithm via non-adaptive compressed sensing methods , 2007, SODA '08.

[28]  Sundeep Rangan,et al.  Necessary and Sufficient Conditions for Sparsity Pattern Recovery , 2008, IEEE Transactions on Information Theory.

[29]  Martin J. Wainwright,et al.  Information-theoretic limits on sparsity recovery in the high-dimensional and noisy setting , 2009, IEEE Trans. Inf. Theory.

[30]  R. Calderbank,et al.  Chirp sensing codes: Deterministic compressed sensing measurements for fast recovery , 2009 .

[31]  Martin J. Wainwright,et al.  Information-Theoretic Limits on Sparsity Recovery in the High-Dimensional and Noisy Setting , 2007, IEEE Transactions on Information Theory.

[32]  E. Candès,et al.  Near-ideal model selection by ℓ1 minimization , 2008, 0801.0345.

[33]  L. Wasserman,et al.  Revisiting Marginal Regression , 2009, 0911.4080.

[34]  G. Reeves,et al.  A note on optimal support recovery in compressed sensing , 2009, 2009 Conference Record of the Forty-Third Asilomar Conference on Signals, Systems and Computers.

[35]  Martin J. Wainwright,et al.  Sharp Thresholds for High-Dimensional and Noisy Sparsity Recovery Using $\ell _{1}$ -Constrained Quadratic Programming (Lasso) , 2009, IEEE Transactions on Information Theory.

[36]  Olgica Milenkovic,et al.  Subspace Pursuit for Compressive Sensing Signal Reconstruction , 2008, IEEE Transactions on Information Theory.

[37]  M. Rudelson,et al.  Non-asymptotic theory of random matrices: extreme singular values , 2010, 1003.2990.

[38]  A. Robert Calderbank,et al.  Construction of a Large Class of Deterministic Sensing Matrices That Satisfy a Statistical Isometry Property , 2009, IEEE Journal of Selected Topics in Signal Processing.

[39]  A. Robert Calderbank,et al.  Why Gabor frames? Two fundamental measures of coherence and their role in model selection , 2010, Journal of Communications and Networks.

[40]  A. Robert Calderbank,et al.  Reed Muller Sensing Matrices and the LASSO , 2010, ArXiv.

[41]  A. Robert Calderbank,et al.  Model selection: Two fundamental measures of coherence and their algorithmic significance , 2009, 2010 IEEE International Symposium on Information Theory.

[42]  Deanna Needell,et al.  CoSaMP: Iterative signal recovery from incomplete and inaccurate samples , 2008, ArXiv.

[43]  O. Antoine,et al.  Theory of Error-correcting Codes , 2022 .