Adaptive transform skipping for improved coding of motion compensated residuals

New generations of video compression algorithms, such as those included in the under development High Efficiency Video Coding (HEVC) standard, provide substantially higher compression compared to their ancestors. The gain is achieved by improved prediction of pixels, both within a frame and between frames. Novel coding tools that contribute to the gain provide highly uncorrelated prediction residuals for which classical frequency decomposition methods, such as the discrete cosine transform, may not be able to supply a compact representation with few significant coefficients. To further increase the compression gains, this paper proposes transform skip modes which allow skipping one or both 1-D constituent transforms (i.e., vertical and horizontal), which is more suitable for sparse residuals. The proposed transform skip mode is tested in the HEVC codec and is able to provide bitrate reductions of up to 10% at the same objective quality when compared with the application of 2-D block transforms only. Moreover, the proposed transform skip mode outperforms the full transform skip currently investigated for possible adoption in the HEVC standard.

[1]  Chuohao Yeo,et al.  Low-complexity mode-dependent KLT for block-based intra coding , 2011, 2011 18th IEEE International Conference on Image Processing.

[2]  Long-Wen Chang,et al.  Designing JPEG quantization tables based on human visual system , 2001, Signal Process. Image Commun..

[3]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[4]  Jae S. Lim,et al.  Directional wavelet transforms for prediction residuals in video coding , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[5]  Ashraf A. Kassim,et al.  Digital Video Image Quality and Perceptual Coding , 2005, J. Electronic Imaging.

[6]  Jae S. Lim,et al.  Transforms for the motion compensation residual , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[8]  King Ngi Ngan,et al.  Spatio-Temporal Just Noticeable Distortion Profile for Grey Scale Image/Video in DCT Domain , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Jae S. Lim,et al.  Directional wavelet transforms for prediction residuals in video coding , 2009 .

[10]  Robert J. Safranek,et al.  Signal compression based on models of human perception , 1993, Proc. IEEE.

[11]  Long-Wen Chang,et al.  Designing JPEG quantization tables based on human visual system , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[12]  K. R. Rao,et al.  Digital Video Image Quality and Perceptual Coding (Signal Processing and Communications) , 2005 .

[13]  Heidi A. Peterson,et al.  Luminance-model-based DCT quantization for color image compression , 1992, Electronic Imaging.

[14]  Gary J. Sullivan,et al.  Compression performance of high efficiency video coding (HEVC) working draft 4 , 2012, 2012 IEEE International Symposium on Circuits and Systems.

[15]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .

[16]  Mark R. Pickering,et al.  Improved H.264-based video coding using an adaptive transform , 2010, 2010 IEEE International Conference on Image Processing.

[17]  Ahmet M. Kondoz,et al.  Application of directionlets in video coding , 2008 .

[18]  Yun He,et al.  Non-square quadtree transform structure for HEVC , 2012, 2012 Picture Coding Symposium.

[19]  Felix C. A. Fernandes,et al.  Rotational transform for image and video compression , 2011, 2011 18th IEEE International Conference on Image Processing.

[20]  Anthony Vetro,et al.  Direction-adaptive transforms for coding prediction residuals , 2010, 2010 IEEE International Conference on Image Processing.

[21]  Matthias Narroschke,et al.  Extending H.264/AVC by an adaptive coding of the prediction error , 2006 .

[22]  David J. Sakrison,et al.  The effects of a visual fidelity criterion of the encoding of images , 1974, IEEE Trans. Inf. Theory.

[23]  Jae S. Lim,et al.  Video compression with 1-D directional transforms in H.264/AVC , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[24]  Jani Lainema,et al.  Adaptive deblocking filter , 2003, IEEE Trans. Circuits Syst. Video Technol..

[25]  Jerry D. Gibson,et al.  Digital coding of waveforms: Principles and applications to speech and video , 1985, Proceedings of the IEEE.

[26]  Marta Karczewicz,et al.  Improved h.264 intra coding based on bi-directional intra prediction, directional transform, and adaptive coefficient scanning , 2008, 2008 15th IEEE International Conference on Image Processing.

[27]  Kenneth Rose,et al.  Towards jointly optimal spatial prediction and adaptive transform in video/image coding , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[28]  Heiko Schwarz,et al.  Video Compression Using Nested Quadtree Structures, Leaf Merging, and Improved Techniques for Motion Representation and Entropy Coding , 2010, IEEE Transactions on Circuits and Systems for Video Technology.