Approximation and Compression With Sparse Orthonormal Transforms

We propose a new transform design method that targets the generation of compression-optimized transforms for next-generation multimedia applications. The fundamental idea behind transform compression is to exploit regularity within signals such that redundancy is minimized subject to a fidelity cost. Multimedia signals, in particular images and video, are well known to contain a diverse set of localized structures, leading to many different types of regularity and to nonstationary signal statistics. The proposed method designs sparse orthonormal transforms (SOTs) that automatically exploit regularity over different signal structures and provides an adaptation method that determines the best representation over localized regions. Unlike earlier work that is motivated by linear approximation constructs and model-based designs that are limited to specific types of signal regularity, our work uses general nonlinear approximation ideas and a data-driven setup to significantly broaden its reach. We show that our SOT designs provide a safe and principled extension of the Karhunen-Loeve transform (KLT) by reducing to the KLT on Gaussian processes and by automatically exploiting non-Gaussian statistics to significantly improve over the KLT on more general processes. We provide an algebraic optimization framework that generates optimized designs for any desired transform structure (multiresolution, block, lapped, and so on) with significantly better n-term approximation performance. For each structure, we propose a new prototype codec and test over a database of images. Simulation results show consistent increase in compression and approximation performance compared with conventional methods.

[1]  Baltasar Beferull-Lozano,et al.  Directionlets: anisotropic multidirectional representation with separable filtering , 2006, IEEE Transactions on Image Processing.

[2]  Onur G. Guleryuz,et al.  Sparse orthonormal transforms for image compression , 2008, 2008 15th IEEE International Conference on Image Processing.

[3]  S. Mallat A wavelet tour of signal processing , 1998 .

[4]  Michael T. Orchard,et al.  Space-frequency quantization for wavelet image coding , 1997, IEEE Trans. Image Process..

[5]  Mohamed-Jalal Fadili,et al.  Curvelets and Ridgelets , 2009, Encyclopedia of Complexity and Systems Science.

[6]  Vivek K. Goyal,et al.  Theoretical foundations of transform coding , 2001, IEEE Signal Process. Mag..

[7]  E.J. Candes,et al.  An Introduction To Compressive Sampling , 2008, IEEE Signal Processing Magazine.

[8]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .

[9]  Michel Barlaud,et al.  Image coding using wavelet transform , 1992, IEEE Trans. Image Process..

[10]  Rémi Gribonval,et al.  Learning unions of orthonormal bases with thresholded singular value decomposition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[11]  D. Lorenz,et al.  Convergence rates and source conditions for Tikhonov regularization with sparsity constraints , 2008, 0801.1774.

[12]  Jerome M. Shapiro,et al.  Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..

[13]  Michael W. Marcellin,et al.  JPEG2000 - image compression fundamentals, standards and practice , 2002, The Kluwer International Series in Engineering and Computer Science.

[14]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[15]  Martin Vetterli,et al.  Rate Distortion Behavior of Sparse Sources , 2012, IEEE Transactions on Information Theory.

[16]  Martin Vetterli,et al.  Wavelets, approximation, and compression , 2001, IEEE Signal Process. Mag..

[17]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[18]  Michael W. Marcellin,et al.  JPEG2000 - image compression fundamentals, standards and practice , 2013, The Kluwer international series in engineering and computer science.

[19]  Stéphane Mallat,et al.  Sparse geometric image representations with bandelets , 2005, IEEE Transactions on Image Processing.

[20]  Emmanuel J. Candès,et al.  Modern statistical estimation via oracle inequalities , 2006, Acta Numerica.

[21]  Michael T. Orchard,et al.  On the importance of combining wavelet-based nonlinear approximation with coding strategies , 2002, IEEE Trans. Inf. Theory.

[22]  Mário A. T. Figueiredo,et al.  Class-adapted image compression using independent component analysis , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[23]  Bernd Girod,et al.  Direction-adaptive partitioned block transform for image coding , 2008, 2008 15th IEEE International Conference on Image Processing.

[24]  Yonina C. Eldar,et al.  Collaborative hierarchical sparse modeling , 2010, 2010 44th Annual Conference on Information Sciences and Systems (CISS).

[25]  Henrique S. Malvar Biorthogonal and nonuniform lapped transforms for transform coding with reduced blocking and ringing artifacts , 1998, IEEE Trans. Signal Process..

[26]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[27]  Avideh Zakhor,et al.  Corrections to "matching pursuit video coding-part I: dictionary approximation" , 2002, IEEE Trans. Circuits Syst. Video Technol..

[28]  William A. Pearlman,et al.  A new, fast, and efficient image codec based on set partitioning in hierarchical trees , 1996, IEEE Trans. Circuits Syst. Video Technol..

[29]  Didier Sornette,et al.  Encyclopedia of Complexity and Systems Science , 2009 .

[30]  E. Candès,et al.  Curvelets: A Surprisingly Effective Nonadaptive Representation for Objects with Edges , 2000 .

[31]  Martin Vetterli,et al.  Data Compression and Harmonic Analysis , 1998, IEEE Trans. Inf. Theory.

[32]  David L. Donoho,et al.  Ridge Functions and Orthonormal Ridgelets , 2001, J. Approx. Theory.

[33]  Marion Kee,et al.  Analysis , 2004, Machine Translation.

[34]  Michael T. Orchard,et al.  Optimized nonorthogonal transforms for image compression , 1997, IEEE Trans. Image Process..

[35]  Avideh Zakhor,et al.  Matching pursuit video coding .I. Dictionary approximation , 2002, IEEE Trans. Circuits Syst. Video Technol..

[36]  Ivan W. Selesnick,et al.  A diagonally-oriented DCT-like 2D block transform , 2011, Optical Engineering + Applications.

[37]  Justin K. Romberg,et al.  Wavelet-domain approximation and compression of piecewise smooth images , 2006, IEEE Transactions on Image Processing.

[38]  Stéphane Mallat,et al.  A Wavelet Tour of Signal Processing - The Sparse Way, 3rd Edition , 2008 .

[39]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[40]  Jianqin Zhou,et al.  On discrete cosine transform , 2011, ArXiv.

[41]  Erwin Lutwak,et al.  Information-theoretic inequalities for contoured probability distributions , 2002, IEEE Trans. Inf. Theory.

[42]  Henrique S. Malvar,et al.  Signal processing with lapped transforms , 1992 .

[43]  Yücel Altunbasak,et al.  A sparsity-distortion-optimized multiscale representation of geometry , 2010, 2010 IEEE International Conference on Image Processing.

[44]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[45]  Christine Guillemot,et al.  Sparse optimization with directional DCT bases for image compression , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[46]  Albert Cohen,et al.  Nonlinear Approximation of Random Functions , 1997, SIAM J. Appl. Math..

[47]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[48]  Pierre Moulin,et al.  A multiscale relaxation algorithm for SNR maximization in nonorthogonal subband coding , 1995, IEEE Trans. Image Process..

[49]  Michael Elad,et al.  K-SVD and its non-negative variant for dictionary design , 2005, SPIE Optics + Photonics.

[50]  Stéphane Mallat,et al.  Discrete bandelets with geometric orthogonal filters , 2005, IEEE International Conference on Image Processing 2005.

[51]  Bing Zeng,et al.  Directional Discrete Cosine Transforms—A New Framework for Image Coding , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[52]  Z. Xiong,et al.  A DCT-based embedded image coder , 1996, IEEE Signal Processing Letters.

[53]  I. Daubechies,et al.  Biorthogonal bases of compactly supported wavelets , 1992 .

[54]  Anthony Vetro,et al.  Robust Learning of 2-D Separable Transforms for Next-Generation Video Coding , 2011, 2011 Data Compression Conference.

[55]  Michael Elad,et al.  Stable recovery of sparse overcomplete representations in the presence of noise , 2006, IEEE Transactions on Information Theory.

[56]  Christine Guillemot,et al.  Image compression using the Iteration-Tuned and Aligned Dictionary , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[57]  R. Rockafellar Convex Analysis: (pms-28) , 1970 .

[58]  Minh N. Do,et al.  Ieee Transactions on Image Processing the Contourlet Transform: an Efficient Directional Multiresolution Image Representation , 2022 .

[59]  J. Woods,et al.  Probability and Random Processes with Applications to Signal Processing , 2001 .