Globally Variance-Constrained Sparse Representation and Its Application in Image Set Coding

Sparse representation leads to an efficient way to approximately recover a signal by the linear composition of a few bases from a learnt dictionary based on which various successful applications have been achieved. However, in the scenario of data compression, its efficiency and popularity are hindered. It is because of the fact that encoding sparsely distributed coefficients may consume more bits for representing the index of nonzero coefficients. Therefore, introducing an accurate rate constraint in sparse coding and dictionary learning becomes meaningful, which has not been fully exploited in the context of sparse representation. According to the Shannon entropy inequality, the variance of Gaussian distributed data bound its entropy, indicating the actual bitrate can be well estimated by its variance. Hence, a globally variance-constrained sparse representation (GVCSR) model is proposed in this paper, where a variance-constrained rate term is introduced to the optimization process. Specifically, we employ the alternating direction method of multipliers (ADMMs) to solve the non-convex optimization problem for sparse coding and dictionary learning, both of them have shown the state-of-the-art rate-distortion performance for image representation. Furthermore, we investigate the potential of applying the GVCSR algorithm in the practical image set compression, where the optimized dictionary is trained to efficiently represent the images captured in similar scenarios by implicitly utilizing inter-image correlations. Experimental results have demonstrated superior rate-distortion performance against the state-of-the-art methods.

[1]  Kjersti Engan,et al.  Method of optimal directions for frame design , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[2]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1992 .

[3]  Michael Elad,et al.  Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[4]  David Zhang,et al.  A Generalized Iterated Shrinkage Algorithm for Non-convex Sparse Coding , 2013, 2013 IEEE International Conference on Computer Vision.

[5]  Jian Zhang,et al.  Image Restoration Using Joint Statistical Modeling in a Space-Transform Domain , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Olgica Milenkovic,et al.  Subspace Pursuit for Compressive Sensing Signal Reconstruction , 2008, IEEE Transactions on Information Theory.

[7]  Christine Guillemot,et al.  Image Compression Using Sparse Representations and the Iteration-Tuned and Aligned Dictionary , 2011, IEEE Journal of Selected Topics in Signal Processing.

[8]  Aapo Hyvärinen,et al.  Natural Image Statistics - A Probabilistic Approach to Early Computational Vision , 2009, Computational Imaging and Vision.

[9]  Michael Elad,et al.  Double Sparsity: Learning Sparse Dictionaries for Sparse Signal Approximation , 2010, IEEE Transactions on Signal Processing.

[10]  Weisi Lin,et al.  Learning based screen image compression , 2012, 2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP).

[11]  Guoyin Li,et al.  Global Convergence of Splitting Methods for Nonconvex Composite Optimization , 2014, SIAM J. Optim..

[12]  Xiaoyan Sun,et al.  Photo Album Compression for Cloud Storage Using Local Features , 2014, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[13]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Aline Roumy,et al.  Shallow sparse autoencoders versus sparse coding algorithms for image compression , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[15]  Michael W. Marcellin,et al.  JPEG2000 - image compression fundamentals, standards and practice , 2002, The Kluwer International Series in Engineering and Computer Science.

[16]  B. Mercier,et al.  A dual algorithm for the solution of nonlinear variational problems via finite element approximation , 1976 .

[17]  Tanaya Guha,et al.  Learning sparse models for image quality assessment , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[18]  Jianfei Cai,et al.  Dense correspondence based prediction for image set compression , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[19]  Wen Gao,et al.  Image Primitive Coding and Visual Quality Assessment , 2012, PCM.

[20]  Wen Gao,et al.  Group-Based Sparse Representation for Image Restoration , 2014, IEEE Transactions on Image Processing.

[21]  Joaquin Zepeda Salvatierra New sparse representation methods; application to image compression and indexing. (Nouvelles méthodes de représentations parcimonieuses ; application à la compression et l'indexation d'images) , 2010 .

[22]  Wen Gao,et al.  Image Super-Resolution via Hierarchical and Collaborative Sparse Representation , 2013, 2013 Data Compression Conference.

[23]  Joel A. Tropp,et al.  Signal Recovery From Random Measurements Via Orthogonal Matching Pursuit , 2007, IEEE Transactions on Information Theory.

[24]  Wen Gao,et al.  Sparse Structural Similarity for Objective Image Quality Assessment , 2015, 2015 IEEE International Conference on Systems, Man, and Cybernetics.

[25]  Hua Yang,et al.  Sparse Feature Fidelity for Perceptual Image Quality Assessment , 2013, IEEE Transactions on Image Processing.

[26]  Kjersti Engan,et al.  Image compression using learned dictionaries by RLS-DLA and compared with K-SVD , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[27]  Anamitra Makur,et al.  Sparse Sequential Generalization of K-means for dictionary training on noisy signals , 2016, Signal Process..

[28]  Guillermo Sapiro,et al.  Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[29]  Xiaoyan Sun,et al.  Multi-model prediction for image set compression , 2013, 2013 Visual Communications and Image Processing (VCIP).

[30]  Wen Gao,et al.  Globally Variance-Constrained Sparse Representation for Rate-Distortion Optimized Image Representation , 2017, 2017 Data Compression Conference (DCC).

[31]  Wen Gao,et al.  An inter-image redundancy measure for image set compression , 2015, 2015 IEEE International Symposium on Circuits and Systems (ISCAS).

[32]  Jian Zhang,et al.  Image compressive sensing recovery using adaptively learned sparsifying basis via L0 minimization , 2014, Signal Process..

[33]  Pascal Frossard,et al.  Adaptive entropy-constrained matching pursuit quantization , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[34]  Michael Elad,et al.  Sparse and Redundant Representations - From Theory to Applications in Signal and Image Processing , 2010 .

[35]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[36]  Wen Gao,et al.  Entropy of primitive: A top-down methodology for evaluating the perceptual visual information , 2013, 2013 Visual Communications and Image Processing (VCIP).

[37]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[38]  Debin Zhao,et al.  Image super-resolution via dual-dictionary learning and sparse representation , 2012, 2012 IEEE International Symposium on Circuits and Systems.

[39]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[40]  Deanna Needell,et al.  CoSaMP: Iterative signal recovery from incomplete and inaccurate samples , 2008, ArXiv.

[41]  Wen Gao,et al.  Entropy of Primitive: From Sparse Representation to Visual Information Evaluation , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[42]  Zhixun Su,et al.  Linearized Alternating Direction Method with Adaptive Penalty for Low-Rank Representation , 2011, NIPS.

[43]  Wen Gao,et al.  From Visual Search to Video Compression: A Compact Representation Framework for Video Feature Descriptors , 2016, 2016 Data Compression Conference (DCC).

[44]  Michael Elad,et al.  Low Bit-Rate Compression of Facial Images , 2007, IEEE Transactions on Image Processing.

[45]  Xu Li,et al.  Min Flow Rate Maximization for Software Defined Radio Access Networks , 2013, IEEE Journal on Selected Areas in Communications.

[46]  Wen Gao,et al.  A Joint Compression Scheme of Video Feature Descriptors and Visual Content , 2017, IEEE Transactions on Image Processing.

[47]  D. Shiva Rama Krishna,et al.  Fingerprint Compression Based on Sparse Representation , 2015 .

[48]  Pascal Fua,et al.  On benchmarking camera calibration and multi-view stereo for high resolution imagery , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[49]  Jianhua Lu,et al.  Online Dictionary Learning Based Intra-frame Video Coding , 2012, The 15th International Symposium on Wireless Personal Multimedia Communications.

[50]  Michael W. Marcellin,et al.  JPEG2000 - image compression fundamentals, standards and practice , 2013, The Kluwer international series in engineering and computer science.

[51]  Jianhua Lu,et al.  Compressibility Constrained Sparse Representation With Learnt Dictionary for Low Bit-Rate Image Compression , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[52]  Aggelos K. Katsaggelos,et al.  A rate-distortion optimal coding alternative to matching pursuit , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[53]  Nader Karimi,et al.  Boosted Dictionary Learning for Image Compression , 2016, IEEE Transactions on Image Processing.

[54]  Wen Gao,et al.  Towards accurate visual information estimation with Entropy of Primitive , 2015, 2015 IEEE International Symposium on Circuits and Systems (ISCAS).

[55]  Kjersti Engan,et al.  Family of iterative LS-based dictionary learning algorithms, ILS-DLA, for sparse signal representation , 2007, Digit. Signal Process..

[56]  I. Horev,et al.  Adaptive image compression using sparse dictionaries , 2012, 2012 19th International Conference on Systems, Signals and Image Processing (IWSSIP).

[57]  Guillermo Sapiro,et al.  Sparse Representation for Computer Vision and Pattern Recognition , 2010, Proceedings of the IEEE.

[58]  Guillermo Sapiro,et al.  Non-local sparse models for image restoration , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[59]  Oscar C. Au,et al.  Image compression via sparse reconstruction , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[60]  Pan Zhou,et al.  Bilevel Model-Based Discriminative Dictionary Learning for Recognition , 2017, IEEE Transactions on Image Processing.

[61]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[62]  Kjersti Engan,et al.  Recursive Least Squares Dictionary Learning Algorithm , 2010, IEEE Transactions on Signal Processing.

[63]  M. Gharavi-Aikhansari A model for entropy coding in matching pursuit , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[64]  Pascal Frossard,et al.  Sparse Approximation Using M-Term Pursuit and Application in Image and Video Coding , 2012, IEEE Transactions on Image Processing.

[65]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[66]  Pascal Frossard,et al.  Low-rate and flexible image coding with redundant representations , 2006, IEEE Transactions on Image Processing.

[67]  Baochun Li,et al.  An Alternating Direction Method Approach to Cloud Traffic Management , 2014 .

[68]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[69]  Moncef Gabbouj,et al.  Sparse/DCT (S/DCT) Two-Layered Representation of Prediction Residuals for Video Coding , 2013, IEEE Transactions on Image Processing.

[70]  S. Sahoo,et al.  Dictionary Training for Sparse Representation as Generalization of K-Means Clustering , 2013, IEEE Signal Processing Letters.

[71]  Xiaoyan Sun,et al.  Feature-based image set compression , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[72]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[73]  Howard Cheng,et al.  Bit allocation for lossy image set compression , 2015, 2015 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM).

[74]  Michael Elad,et al.  Compression of facial images using the K-SVD algorithm , 2008, J. Vis. Commun. Image Represent..

[75]  Lei Zhang,et al.  Centralized sparse representation for image restoration , 2011, 2011 International Conference on Computer Vision.

[76]  Stanley Osher,et al.  A Unified Primal-Dual Algorithm Framework Based on Bregman Iteration , 2010, J. Sci. Comput..