Structured sparsity: theorems, algorithms and applications

Today, sparsity techniques have been widely used to address practical problems in the fields of medical imaging, machine learning, computer vision, data mining, compressive sensing, image processing, video analysis and multimedia. We will briefly introduce the related sparsity techniques and their successful applications on compressive sensing, sparse learning, computer vision and medical imaging. Then, we propose a new concept called strong group sparsity to develop a theory for group Lasso, which shows that group Lasso is superior to standard Lasso for strongly group-sparse data. It provides a convincing theoretical justification for using group sparsity regularization when the underlying group structure is consistent with the data. Moreover, the theory also predicts the limitations of the group Lasso formulations. To address those limitations, we further build a new framework called structured sparsity, which is a natural extension of the standard sparsity concept in statistical learning and compressive sensing. By allowing arbitrary structures on the feature set or data, this concept generalizes the group sparsity idea. A general theory (Group-RIP) is developed for learning with structured sparsity, based on the notion of coding complexity associated with the structure, which guarantees better performance with more structure information other than pure sparsity. The new sparsity techniques under this framework have been successfully applied to different applications, such as compressive sensing MR imaging, video background subtraction, object tracking in visual surveillance, dynamic scene registration, automatic image annotation, medical image analysis, fast tag separation in cardiac tMRIs, cervigram image segmentation and so on. The improved experimental results in these applications further validate the nice theoretical guarantees of our new theorems and demonstrate the effectiveness of our new framework in the practical applications involved large scale data.

[1]  Stochastic Rigidity: Image Registration for Nowhere-Static Scenes , 2001, ICCV.

[2]  Junzhou Huang,et al.  Learning with dynamic group sparsity , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[3]  Shmuel Peleg,et al.  Online Registration of Dynamic Scenes using Video Extrapolation , 2005 .

[4]  Nikos Paragios,et al.  Motion-based background subtraction using adaptive kernel density estimation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[5]  Antonio Criminisi,et al.  TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-class Object Recognition and Segmentation , 2006, ECCV.

[6]  Antonio Criminisi,et al.  Single-Histogram Class Models for Image Segmentation , 2006, ICVGIP.

[7]  Ming-Hsuan Yang,et al.  Incremental Learning for Robust Visual Tracking , 2008, International Journal of Computer Vision.

[8]  P. Tseng Applications of splitting algorithm to decomposition in convex programming and variational inequalities , 1991 .

[9]  Lawrence Carin,et al.  Bayesian Compressive Sensing , 2008, IEEE Transactions on Signal Processing.

[10]  Bhaskar D. Rao,et al.  An Empirical Bayesian Strategy for Solving the Simultaneous Sparse Approximation Problem , 2007, IEEE Transactions on Signal Processing.

[11]  R. Glowinski,et al.  Augmented Lagrangian and Operator-Splitting Methods in Nonlinear Mechanics , 1987 .

[12]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[13]  Kjersti Engan,et al.  Method of optimal directions for frame design , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[14]  Junzhou Huang,et al.  The Benefit of Group Sparsity , 2009 .

[15]  Aggelos K. Katsaggelos,et al.  Fast bayesian compressive sensing using Laplace priors , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[16]  Philip Schniter,et al.  Fast Bayesian Matching Pursuit: Model Uncertainty and Parameter Estimation for Sparse Linear Models , 2009 .

[17]  Kim-Chuan Toh,et al.  A coordinate gradient descent method for ℓ1-regularized convex minimization , 2011, Comput. Optim. Appl..

[18]  S. Mallat A wavelet tour of signal processing , 1998 .

[19]  Joel A. Tropp,et al.  Greed is good: algorithmic results for sparse approximation , 2004, IEEE Transactions on Information Theory.

[20]  Rick Chartrand,et al.  Exact Reconstruction of Sparse Signals via Nonconvex Minimization , 2007, IEEE Signal Processing Letters.

[21]  Mike E. Davies,et al.  Iterative Hard Thresholding for Compressed Sensing , 2008, ArXiv.

[22]  Matthias W. Seeger,et al.  Compressed sensing and Bayesian experimental design , 2008, ICML '08.

[23]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[24]  John Wright,et al.  Computation and Relaxation of Conditions for Equivalence between ` 1 and ` 0 Minimization ∗ , 2007 .

[25]  Tong Zhang Some sharp performance bounds for least squares regression with L1 regularization , 2009, 0908.2869.

[26]  Junzhou Huang,et al.  Fast Optimization for Mixture Prior Models , 2010, ECCV.

[27]  Emmanuel J. Candès,et al.  Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information , 2004, IEEE Transactions on Information Theory.

[28]  Francis R. Bach,et al.  Consistency of the group Lasso and multiple kernel learning , 2007, J. Mach. Learn. Res..

[29]  Jose Jeronimo,et al.  A probabilistic approach to segmentation and classification of neoplasia in uterine cervix images using color and geometric features , 2005, SPIE Medical Imaging.

[30]  Petia Radeva,et al.  Tag surface reconstruction and tracking of myocardial beads from SPAMM-MRI with parametric B-spline surfaces , 2001, IEEE Transactions on Medical Imaging.

[31]  Junzhou Huang,et al.  Tag Separation in Cardiac Tagged MRI , 2008, MICCAI.

[32]  A. Montillo,et al.  Use of Bandpass Gabor Filters for Enhancing Blood-Myocardium Contrast and Filling-in tags in tagged MR Images , 2004 .

[33]  Horst Bischof,et al.  Semi-supervised On-Line Boosting for Robust Tracking , 2008, ECCV.

[34]  Horst Bischof,et al.  On-line Boosting and Vision , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[35]  Gang Hua,et al.  Context-Aware Visual Tracking , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Michael I. Jordan,et al.  Union support recovery in high-dimensional multivariate regression , 2008, 2008 46th Annual Allerton Conference on Communication, Control, and Computing.

[37]  Pierre Morizet-Mahoudeaux,et al.  Hierarchical Penalization , 2007, NIPS.

[38]  Thong T. Do,et al.  Sparsity adaptive matching pursuit algorithm for practical compressed sensing , 2008, 2008 42nd Asilomar Conference on Signals, Systems and Computers.

[39]  Guillermo Sapiro,et al.  Discriminative learned dictionaries for local image analysis , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Ming-Hsuan Yang,et al.  Visual tracking with online Multiple Instance Learning , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[41]  S. Yun,et al.  An accelerated proximal gradient algorithm for nuclear norm regularized linear least squares problems , 2009 .

[42]  Mark W. Schmidt,et al.  GROUP SPARSITY VIA LINEAR-TIME PROJECTION , 2008 .

[43]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[44]  Jerry L. Prince,et al.  Tag and contour detection in tagged MR images of the left ventricle , 1994, IEEE Trans. Medical Imaging.

[45]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[46]  L. Rudin,et al.  Nonlinear total variation based noise removal algorithms , 1992 .

[47]  G. Pisier The volume of convex bodies and Banach space geometry , 1989 .

[48]  Mário A. T. Figueiredo,et al.  Gradient Projection for Sparse Reconstruction: Application to Compressed Sensing and Other Inverse Problems , 2007, IEEE Journal of Selected Topics in Signal Processing.

[49]  P. L. Combettes,et al.  A proximal decomposition method for solving convex variational inverse problems , 2008, 0807.2617.

[50]  Xiaolei Huang,et al.  Combining multiple 2ν-SVM classifiers for tissue segmentation , 2008, 2008 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[51]  Volkan Cevher,et al.  Model-Based Compressive Sensing , 2008, IEEE Transactions on Information Theory.

[52]  Larry S. Davis,et al.  Non-parametric Model for Background Subtraction , 2000, ECCV.

[53]  Xiaoming Yuan,et al.  Sparse and low-rank matrix decomposition via alternating direction method , 2013 .

[54]  Shiri Gordon,et al.  Content analysis of uterine cervix images: initial steps toward content based indexing and retrieval of cervigrams , 2006, SPIE Medical Imaging.

[55]  Ferdinand van der Heijden,et al.  Efficient adaptive density estimation per image pixel for the task of background subtraction , 2006, Pattern Recognit. Lett..

[56]  Minh N. Do,et al.  A Theory for Sampling Signals from a Union of Subspaces , 2022 .

[57]  David J. Kriegman,et al.  From Few to Many: Illumination Cone Models for Face Recognition under Variable Lighting and Pose , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[58]  William M. Wells,et al.  Simultaneous truth and performance level estimation (STAPLE): an algorithm for the validation of image segmentation , 2004, IEEE Transactions on Medical Imaging.

[59]  Mike E. Davies,et al.  Sampling Theorems for Signals From the Union of Finite-Dimensional Linear Subspaces , 2009, IEEE Transactions on Information Theory.

[60]  Richard G. Baraniuk,et al.  Bayesian Compressive Sensing Via Belief Propagation , 2008, IEEE Transactions on Signal Processing.

[61]  David B. Dunson,et al.  Multi-Task Compressive Sensing , 2007 .

[62]  Ming Yuan,et al.  Sparse Recovery in Large Ensembles of Kernel Machines On-Line Learning and Bandits , 2008, COLT.

[63]  Nikos Paragios,et al.  Background modeling and subtraction of dynamic scenes , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[64]  Benar Fux Svaiter,et al.  General Projective Splitting Methods for Sums of Maximal Monotone Operators , 2009, SIAM J. Control. Optim..

[65]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[66]  Wotao Yin,et al.  Alternating direction augmented Lagrangian methods for semidefinite programming , 2010, Math. Program. Comput..

[67]  Xiaolei Huang,et al.  Distance guided selection of the best base classifier in an ensemble with application to cervigram image segmentation , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[68]  J. Spingarn Partial inverse of a monotone operator , 1983 .

[69]  Stephen J. Wright,et al.  Sparse Reconstruction by Separable Approximation , 2008, IEEE Transactions on Signal Processing.

[70]  P. Zhao,et al.  Grouped and Hierarchical Model Selection through Composite Absolute Penalties , 2007 .

[71]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[72]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[73]  Deanna Needell,et al.  CoSaMP: Iterative signal recovery from incomplete and inaccurate samples , 2008, ArXiv.

[74]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[75]  Emmanuel J. Candès,et al.  Decoding by linear programming , 2005, IEEE Transactions on Information Theory.

[76]  Xiaoming Yuan,et al.  Alternating Direction Methods for Sparse Covariance Selection * , 2009 .

[77]  Franz Rendl,et al.  Regularization Methods for Semidefinite Programming , 2009, SIAM J. Optim..

[78]  Dimitris N. Metaxas,et al.  MetaMorphs: Deformable shape and texture models , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[79]  E.J. Candes Compressive Sampling , 2022 .

[80]  B. Ripley,et al.  Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.

[81]  Olgica Milenkovic,et al.  Subspace Pursuit for Compressive Sensing: Closing the Gap Between Performance and Complexity , 2008, ArXiv.

[82]  Babak Hassibi,et al.  On the Reconstruction of Block-Sparse Signals With an Optimal Number of Measurements , 2008, IEEE Transactions on Signal Processing.

[83]  Bruno Torrésani,et al.  Random Models for Sparse Signals Expansion on Unions of Bases With Application to Audio Signals , 2008, IEEE Transactions on Signal Processing.

[84]  Amnon Shashua,et al.  Manifold pursuit: a new approach to appearance based recognition , 2002, Object recognition supported by user interaction for service robots.

[85]  Jerome M. Shapiro,et al.  Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..

[86]  Marco F. Duarte,et al.  Compressive sensing recovery of spike trains using a structured sparsity model , 2009 .

[87]  Joel A. Tropp,et al.  ALGORITHMS FOR SIMULTANEOUS SPARSE APPROXIMATION , 2006 .

[88]  A. Rinaldo,et al.  On the asymptotic properties of the group lasso estimator for linear models , 2008 .

[89]  R. DeVore,et al.  A Simple Proof of the Restricted Isometry Property for Random Matrices , 2008 .

[90]  Junzhou Huang,et al.  Efficient MR image reconstruction for compressed MR imaging , 2011, Medical Image Anal..

[91]  B. Mercier,et al.  A dual algorithm for the solution of nonlinear variational problems via finite element approximation , 1976 .

[92]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[93]  Junzhou Huang,et al.  Optimization and Learning for Registration of Moving Dynamic Textures , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[94]  Junfeng Yang,et al.  A Fast Alternating Direction Method for TVL1-L2 Signal Reconstruction From Partial Fourier Data , 2010, IEEE Journal of Selected Topics in Signal Processing.

[95]  Yaakov Tsaig,et al.  Fast Solution of $\ell _{1}$ -Norm Minimization Problems When the Solution May Be Sparse , 2008, IEEE Transactions on Information Theory.

[96]  Michael P. Friedlander,et al.  Probing the Pareto Frontier for Basis Pursuit Solutions , 2008, SIAM J. Sci. Comput..

[97]  Richard G. Baraniuk,et al.  Random Projections of Smooth Manifolds , 2009, Found. Comput. Math..

[98]  D. Donoho,et al.  Sparse MRI: The application of compressed sensing for rapid MR imaging , 2007, Magnetic resonance in medicine.

[99]  Junzhou Huang,et al.  Learning with structured sparsity , 2009, ICML '09.

[100]  P. L. Combettes Iterative construction of the resolvent of a sum of maximal monotone operators , 2009 .

[101]  Rick Chartrand,et al.  Fast algorithms for nonconvex compressive sensing: MRI reconstruction from very few data , 2009, 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[102]  Bruno Torrésani,et al.  Sparsity and persistence: mixed norms provide simple signal models with dependent coefficients , 2009, Signal Image Video Process..

[103]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[104]  Y. Nesterov A method for solving the convex programming problem with convergence rate O(1/k^2) , 1983 .

[105]  Armando Manduca,et al.  Highly Undersampled Magnetic Resonance Image Reconstruction via Homotopic $\ell_{0}$ -Minimization , 2009, IEEE Transactions on Medical Imaging.

[106]  Y. Nesterov Gradient methods for minimizing composite objective function , 2007 .

[107]  Tong Zhang,et al.  Adaptive Forward-Backward Greedy Algorithm for Learning Sparse Representations , 2011, IEEE Transactions on Information Theory.

[108]  René Vidal,et al.  Optical flow estimation & segmentation of multiple moving dynamic textures , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[109]  Joel A. Tropp,et al.  Signal Recovery From Random Measurements Via Orthogonal Matching Pursuit , 2007, IEEE Transactions on Information Theory.

[110]  Jose Jeronimo,et al.  Tissue classification using cluster features for lesion detection in digital cervigrams , 2008, SPIE Medical Imaging.

[111]  Emmanuel J. Candès,et al.  Near-Optimal Signal Recovery From Random Projections: Universal Encoding Strategies? , 2004, IEEE Transactions on Information Theory.

[112]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[113]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[114]  David Grimm,et al.  A note on the representation of positive polynomials with structured sparsity , 2006, math/0611498.

[115]  Stefano Soatto,et al.  Dynamic Textures , 2003, International Journal of Computer Vision.

[116]  George Eastman House,et al.  Sparse Bayesian Learning and the Relevance Vector Machine , 2001 .

[117]  Larry Wasserman,et al.  Structured sparsity , 2012 .

[118]  Tong Zhang,et al.  Multi-stage Convex Relaxation for Learning with Sparse Regularization , 2008, NIPS.

[119]  P. Bickel,et al.  SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR , 2008, 0801.1095.

[120]  Jean-Philippe Vert,et al.  Group lasso with overlap and graph lasso , 2009, ICML '09.

[121]  Paul Tseng,et al.  A Modified Forward-backward Splitting Method for Maximal Monotone Mappings 1 , 1998 .

[122]  B. Torrésani,et al.  Structured Sparsity: from Mixed Norms to Structured Shrinkage , 2009 .

[123]  Michael Elad,et al.  Submitted to Ieee Transactions on Image Processing Image Decomposition via the Combination of Sparse Representations and a Variational Approach , 2022 .

[124]  J. C. Ye,et al.  Projection reconstruction MR imaging using FOCUSS , 2007, Magnetic resonance in medicine.

[125]  D. Gabay Applications of the method of multipliers to variational inequalities , 1983 .

[126]  Yaser Sheikh,et al.  Bayesian modeling of dynamic scenes for object detection , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[127]  M. Kowalski Sparse regression using mixed norms , 2009 .

[128]  Bingsheng He,et al.  A new inexact alternating directions method for monotone variational inequalities , 2002, Math. Program..

[129]  Junfeng Yang,et al.  A New Alternating Minimization Algorithm for Total Variation Image Reconstruction , 2008, SIAM J. Imaging Sci..

[130]  Shelly Lotenberg,et al.  Shape Priors for Segmentation of the Cervix Region Within Uterine Cervix Images , 2008, SPIE Medical Imaging.

[131]  Dimitris N. Metaxas,et al.  Automated Segmentation of the Left and Right Ventricles in 4D Cardiac SPAMM Images , 2002, MICCAI.

[132]  L. Carin,et al.  Exploiting Structure in Compressive Sensing with a JPEG Basis , 2009 .

[133]  A. N. Tikhonov,et al.  Solutions of ill-posed problems , 1977 .

[134]  Sameer Antani,et al.  Digital Tools for Collecting Data from Cervigrams for Research and Training in Colposcopy , 2006, Journal of lower genital tract disease.

[135]  Gérard G. Medioni,et al.  Online Tracking and Reacquisition Using Co-trained Generative and Discriminative Trackers , 2008, ECCV.

[136]  David L. Donoho,et al.  Sparse Solution Of Underdetermined Linear Equations By Stagewise Orthogonal Matching Pursuit , 2006 .

[137]  Lawrence Carin,et al.  Exploiting Structure in Wavelet-Based Bayesian Compressive Sensing , 2009, IEEE Transactions on Signal Processing.

[138]  Marc Teboulle,et al.  Fast Gradient-Based Algorithms for Constrained Total Variation Image Denoising and Deblurring Problems , 2009, IEEE Transactions on Image Processing.

[139]  Paul Tseng,et al.  Block coordinate relaxation methods for nonparamatric signal denoising , 1998, Defense, Security, and Sensing.

[140]  Shiqian Ma,et al.  Fast Multiple-Splitting Algorithms for Convex Optimization , 2009, SIAM J. Optim..

[141]  Patrick L. Combettes,et al.  Signal Recovery by Proximal Forward-Backward Splitting , 2005, Multiscale Model. Simul..

[142]  Jieping Ye,et al.  An accelerated gradient method for trace norm minimization , 2009, ICML '09.

[143]  Shiqian Ma,et al.  An efficient algorithm for compressed MR imaging using total variation and wavelets , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[144]  Stan Sclaroff,et al.  Segmenting foreground objects from a dynamic textured background via a robust Kalman filter , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[145]  Jerry L Prince,et al.  Cardiac motion tracking using CINE harmonic phase (HARP) magnetic resonance imaging , 1999, Magnetic resonance in medicine.