论文信息 - Adaptive RD Optimal Sparse Coding With Quantization for Image Compression

Adaptive RD Optimal Sparse Coding With Quantization for Image Compression

In image and video compression for many multimedia applications, an image/frame is divided into component blocks or patches and is then encoded using some type of transform. Traditional transforms use a complete dictionary of basis functions. A recent technique of growing interest is signal approximation using a linear combination of basis functions from an overcomplete dictionary. The result is a sparse set of coefficients that can represent the original signal and is called sparse coding. This is an NP-hard problem. Orthogonal matching pursuit is a greedy algorithm that is effectively used to address this problem. Keeping in mind the iterative nature of this algorithm, in a recent conference publication, we proposed a rate distortion optimization (RDO) method to select the best sparse representation among iterations up to a target sparsity level. In this paper, we expand the work and consider an adaptive coding scheme that takes advantage of both discrete cosine transform (DCT) and sparse coding. This scheme shows a better performance over plain DCT or sparse coding schemes. We further propose a scheme to increase the coding efficiency of sparse coding by quantizing the sparse coefficients. We investigate an RDO method to select the value of the quantization parameter from a range, balancing distortion, and bit rate. Based on experimental results, we provide a comparison between conventional DCT-based coding, sparse coding scheme, our mixed coding scheme, and the proposed method that includes quantization of the sparse coefficients.

Jianhua Zheng | Madhusudan Kalluri | Minqiang Jiang | Nam Ling | Philipp Zhang

[1] Weisi Lin,et al. Image Sharpness Assessment by Sparse Representation , 2016, IEEE Transactions on Multimedia.

[2] Stéphane Mallat,et al. Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[3] Gregory K. Wallace,et al. The JPEG still picture compression standard , 1992 .

[4] Christine Guillemot,et al. A complementary matching pursuit algorithm for sparse approximation , 2008, 2008 16th European Signal Processing Conference.

[5] M. Elad,et al. $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[6] Jianhua Zheng,et al. An approach to image compression using R-D optimal OMP selection , 2016, 2016 IEEE International Symposium on Circuits and Systems (ISCAS).

[7] Avideh Zakhor,et al. Very low bit-rate video coding based on matching pursuits , 1997, IEEE Trans. Circuits Syst. Video Technol..

[8] C.-C. Jay Kuo,et al. Efficient dictionary based video coding with reduced side information , 2011, 2011 IEEE International Symposium of Circuits and Systems (ISCAS).

[9] A. Bruckstein,et al. K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[10] I. Horev,et al. Adaptive image compression using sparse dictionaries , 2012, 2012 19th International Conference on Systems, Signals and Image Processing (IWSSIP).

[11] Tanaya Guha,et al. Image Similarity Using Sparse Representation and Compression Distance , 2012, IEEE Transactions on Multimedia.

[12] Pascal Frossard,et al. Low-rate and flexible image coding with redundant representations , 2006, IEEE Transactions on Image Processing.

[13] Y. C. Pati,et al. Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[14] Avideh Zakhor,et al. Modulus quantization for matching-pursuit video coding , 2000, IEEE Trans. Circuits Syst. Video Technol..

[15] S. Mallat,et al. Adaptive greedy approximations , 1997 .

[16] Michael Elad,et al. E-cient Implementation of the K-SVD Algorithm and the Batch-OMP Method , 2008 .

[17] Pascal Frossard,et al. A posteriori quantization of progressive matching pursuit streams , 2004, IEEE Transactions on Signal Processing.

[18] Moncef Gabbouj,et al. Sparse/DCT (S/DCT) Two-Layered Representation of Prediction Residuals for Video Coding , 2013, IEEE Transactions on Image Processing.

[19] Zhiliang Zhu,et al. Fast Single Image Super-Resolution via Self-Example Learning and Sparse Representation , 2014, IEEE Transactions on Multimedia.