Towards Perceptually Guided Rate-Distortion Optimization For Hevc

This paper proposes a novel approach for perceptually guiding the rate-distortion optimization (RDO) process within the High Efficiency Video Coding (HEVC) standard. The reference codec does not consider effectively the perceptual characteristics of the input video and further, the particular perceptual sensitivity of each coding tree unit (CTU) inside a frame. The corresponding frame-level Lagrangian multiplier depends only on the quantization parameter. Inspired by the mechanisms of the human visual system, the proposed solution is a CTU-Ievel adjustment of the standard Lagrangian value based on a set of complementary measured features. These measures rely on the spatial and temporal analysis of the current CTU in the frequency domain. Based on perceptual quality indices and Bjontegaard delta measurements, over several resolutions of tested video sequences, the proposed method demonstrates a promising coding performance according to the rate-distortion compromise.

[1]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  King Ngi Ngan,et al.  Perceptual sensitivity-based rate control method for high efficiency video coding , 2015, Multimedia Tools and Applications.

[4]  King Ngi Ngan,et al.  Perceptual adaptive Lagrangian multiplier for high efficiency video coding , 2013, 2013 Picture Coding Symposium (PCS).

[5]  Jing Chen,et al.  Perceptual feature guided rate distortion optimization for high efficiency video coding , 2017, Multidimens. Syst. Signal Process..

[6]  Manoranjan Paul,et al.  Fast Coding Strategy for HEVC by Motion Features and Saliency Applied on Difference Between Successive Image Blocks , 2015, PSIVT.

[7]  Fan Zhang,et al.  A Parametric Framework for Video Compression Using Region-Based Texture Models , 2011, IEEE Journal of Selected Topics in Signal Processing.

[8]  C.-C. Jay Kuo,et al.  Compressed image quality metric based on perceptually weighted distortion , 2015, IEEE Transactions on Image Processing.

[9]  C. D. Kuglin,et al.  The phase correlation image alignment method , 1975 .

[10]  Hervé Chauris,et al.  Uniform Discrete Curvelet Transform , 2010, IEEE Transactions on Signal Processing.

[11]  Gary J. Sullivan,et al.  Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC) , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .

[13]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[14]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.