Video Coding Using Spatially Varying Transform

In this paper, a novel algorithm called spatially varying transform (SVT) is proposed to improve the coding efficiency of video coders. SVT enables video coders to vary the position of the transform block, unlike state-of-art video codecs where the position of the transform block is fixed. In addition to changing the position of the transform block, the size of the transform can also be varied within the SVT framework, to better localize the prediction error so that the underlying correlations are better exploited. It is shown in this paper that by varying the position of the transform block and its size, characteristics of prediction error are better localized, and the coding efficiency is thus improved. The proposed algorithm is implemented and studied in the H.264/AVC framework. We show that the proposed algorithm achieves 5.85% bitrate reduction compared to H.264/AVC on average over a wide range of test set. Gains become more significant at medium to high bitrates for most tested sequences and the bitrate reduction may reach 13.50%, which makes the proposed algorithm very suitable for future video coding solutions focusing on high fidelity video applications. The gain in coding efficiency is achieved with a similar decoding complexity which makes the proposed algorithm easy to be incorporated in video codecs. However, the encoding complexity of SVT can be relatively high because of the need to perform a number of rate distortion optimization (RDO) steps to select the best location parameter (LP), which indicates the position of the transform. In this paper, a novel low complexity algorithm is also proposed, operating on a macroblock and a block level, to reduce the encoding complexity of SVT. Experimental results show that the proposed low complexity algorithm can reduce the number of LPs to be tested in RDO by about 80% with only a marginal penalty in the coding efficiency.

[1]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[2]  Onur G. Guleryuz,et al.  Weighted Averaging for Denoising With Overcomplete Dictionaries , 2007, IEEE Transactions on Image Processing.

[3]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[4]  Jiro Katto,et al.  Denoising intra-coded moving pictures using motion estimation and pixel shift , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Mathias Wien,et al.  Variable block-size transforms for H.264/AVC , 2003, IEEE Trans. Circuits Syst. Video Technol..

[6]  Peter Lambert,et al.  Evaluation of transform performance when using shape-adaptive partitioning in video coding , 2009, 2009 Picture Coding Symposium.

[7]  D. Marpe,et al.  Video coding with H.264/AVC: tools, performance, and complexity , 2004, IEEE Circuits and Systems Magazine.

[8]  Bing Zeng,et al.  Directional Discrete Cosine Transforms—A New Framework for Image Coding , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Moncef Gabbouj,et al.  Video Coding Using Spatially Varying Transform , 2011, IEEE Trans. Circuits Syst. Video Technol..

[10]  Jani Lainema,et al.  Adaptive deblocking filter , 2003, IEEE Trans. Circuits Syst. Video Technol..

[11]  Aria Nosratinia Denoising JPEG images by re-application of JPEG , 1998, 1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175).

[12]  Sei Naito,et al.  Efficient coding scheme for super high definition video based on extending H.264 high profile , 2006, Electronic Imaging.

[13]  Jae S. Lim,et al.  Transforms for the motion compensation residual , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Amir Said,et al.  Deringing and deblocking DCT compression artifacts with efficient shifted transforms , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[15]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[16]  Moncef Gabbouj,et al.  Video coding using Variable Block-Size Spatially Varying Transforms , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[17]  C.-C. Jay Kuo,et al.  High-definition video coding with super-macroblocks , 2007, Electronic Imaging.