Anisotropic multiscale sparse learned bases for image compression

This paper proposes a new compression algorithm based on multi-scale learned bases. We first explain the construction of a set of image bases using a bintree segmentation and the optimization procedure used to select the image basis from this set. We then present the sparse orthonormal transforms introduced by Sezer et al.1 and propose some extensions tending to improve the convergence of the learning algorithm on the one hand and to adapt the transforms to the coding scheme used on the other hand. Comparisons in terms of rate-distortion performance are finally made with the current compression standards JPEG and JPEG2000.

[1]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[2]  Stéphane Mallat,et al.  Bandelet Image Approximation and Compression , 2005, Multiscale Model. Simul..

[3]  K Ramchandran,et al.  Best wavelet packet bases in a rate-distortion sense , 1993, IEEE Trans. Image Process..

[4]  Minh N. Do,et al.  Contourlets: a directional multiresolution image representation , 2002, Proceedings. International Conference on Image Processing.

[5]  Onur G. Guleryuz,et al.  Sparse orthonormal transforms for image compression , 2008, 2008 15th IEEE International Conference on Image Processing.

[6]  Rémi Gribonval,et al.  Learning unions of orthonormal bases with thresholded singular value decomposition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[7]  Jaakko Astola,et al.  Image compression using multiple transforms , 2000, Signal Process. Image Commun..

[8]  Stéphane Mallat,et al.  Sparse geometric image representations with bandelets , 2005, IEEE Transactions on Image Processing.

[9]  E. Candès,et al.  Curvelets: A Surprisingly Effective Nonadaptive Representation for Objects with Edges , 2000 .

[10]  François G. Meyer Image compression with adaptive local cosines: a comparative study , 2002, IEEE Trans. Image Process..

[11]  C.-T. Chen,et al.  Adaptive transform coding via quadtree-based variable blocksize DCT , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[12]  Stéphane Mallat,et al.  Analysis of low bit rate image transform coding , 1998, IEEE Trans. Signal Process..

[13]  Yair Shoham,et al.  Efficient bit allocation for an arbitrary set of quantizers [speech coding] , 1988, IEEE Trans. Acoust. Speech Signal Process..

[14]  Bing Zeng,et al.  Directional Discrete Cosine Transforms—A New Framework for Image Coding , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..