Rate-Distortion Optimization Using Adaptive Lagrange Multipliers

In current standardized hybrid video encoders, the Lagrange multiplier determination model is a key component in rate-distortion optimization. This originated some 20 years ago based on an entropy-constrained high-rate approximation and experimental results obtained using an H.263 reference encoder on limited test material. In this paper, we present a comprehensive analysis of the results of a Lagrange multiplier selection experiment conducted on various video content using H.264/AVC and HEVC reference encoders. These results show that the original Lagrange multiplier selection methods, employed in both video encoders, are able to achieve optimum rate-distortion performance for I and P frames, but fail to perform well for B frames. The relationship is identified between the optimum Lagrange multipliers for B frames and distortion information obtained from the experimental results, leading to a novel Lagrange multiplier determination approach. The proposed method adaptively predicts the optimum Lagrange multiplier for B frames based on the distortion statistics of recent reconstructed frames. After integration into both H.264/AVC and HEVC reference encoders, this approach was evaluated on 36 test sequences with various resolutions and differing content types. The results show consistent bitrate savings for various hierarchical B frame configurations with minimal additional complexity. BD savings average approximately 3% when constant quantization parameter (QP) values are used for all frames, and 0.5% when non-zero QP offset values are employed for different B frame hierarchical levels.

[1]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[2]  Mathias Wien,et al.  High Efficiency Video Coding: Coding Tools and Specification , 2014 .

[3]  Sanjit K. Mitra,et al.  Rate-distortion optimized mode selection for very low bit rate video coding and the emerging H.263 standard , 1996, IEEE Trans. Circuits Syst. Video Technol..

[4]  André Kaup,et al.  Laplace Distribution Based Lagrangian Rate Distortion Optimization for Hybrid Video Coding , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Jinho Choi,et al.  A stable feedback control of the buffer state using the controlled Lagrange multiplier method , 1994, IEEE Trans. Image Process..

[6]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[7]  Yücel Altunbasak,et al.  An analysis of the DCT coefficient distribution with the H.264 video coder , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Wen Gao,et al.  SSIM-Motivated Rate-Distortion Optimization for Video Coding , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Lulin Chen,et al.  Adaptive λ estimation in Lagrangian rate-distortion optimization for video coding , 2006, Electronic Imaging.

[11]  Herbert Gish,et al.  Asymptotically efficient quantizing , 1968, IEEE Trans. Inf. Theory.

[12]  Cisco Visual Networking Index: Forecast and Methodology 2016-2021.(2017) http://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual- networking-index-vni/complete-white-paper-c11-481360.html. High Efficiency Video Coding (HEVC) Algorithms and Architectures https://jvet.hhi.fraunhofer. , 2017 .

[13]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .

[14]  Thomas R. Fischer,et al.  Comparison of generalized Gaussian and Laplacian modeling in DCT image coding , 1995, IEEE Signal Processing Letters.

[15]  Ll. G. Chambers,et al.  Calculus: Concepts and Contexts, Single Variable , 1997 .

[16]  Chuohao Yeo,et al.  On Rate Distortion Optimization Using SSIM , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Fan Zhang,et al.  An adaptive QP offset determination method for HEVC , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[18]  Ke Wang,et al.  On the Optimal Presentation Duration for Subjective Video Quality Assessment , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Fan Zhang,et al.  A Perception-Based Hybrid Model for Video Quality Assessment , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  K. R. Rao,et al.  High efficiency video coding , 2016, 2016 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA).

[21]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[22]  Qingming Huang,et al.  Visual perception based Lagrangian rate distortion optimization for video coding , 2011, 2011 18th IEEE International Conference on Image Processing.

[23]  Wen Gao,et al.  Novel Statistical Modeling, Analysis and Implementation of Rate-Distortion Estimation for H.264/AVC Coders , 2010, IEEE Trans. Circuits Syst. Video Technol..

[24]  K. Rijkse,et al.  H.263: video coding for low-bit-rate communication , 1996, IEEE Commun. Mag..

[25]  Mark J. Huiskes,et al.  DynTex: A comprehensive database of dynamic textures , 2010, Pattern Recognit. Lett..

[26]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[27]  Jun Zhang,et al.  Context Adaptive Lagrange Multiplier (CALM) for Rate-Distortion Optimal Motion Estimation in Video Coding , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Stefan Winkler,et al.  Analysis of Public Image and Video Databases for Quality Assessment , 2012, IEEE Journal of Selected Topics in Signal Processing.

[29]  Edward J. Delp,et al.  Segmentation-Based Video Compression Using Texture and Motion Models , 2011, IEEE Journal of Selected Topics in Signal Processing.

[30]  OhmJens-Rainer,et al.  Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC) , 2012 .

[31]  Thomas Wiegand,et al.  Generic and Robust Video Coding with Texture Analysis and Synthesis , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[32]  Alan C. Bovik,et al.  Motion Tuned Spatio-Temporal Quality Assessment of Natural Videos , 2010, IEEE Transactions on Image Processing.

[33]  N. Nikolaidis,et al.  Video shot detection and condensed representation. a review , 2006, IEEE Signal Processing Magazine.

[34]  Thomas Wiegand,et al.  Perception-oriented video coding based on image analysis and completion: A review , 2011, Signal Process. Image Commun..

[35]  K. H. Barratt Digital Coding of Waveforms , 1985 .

[36]  Fan Zhang,et al.  A video texture database for perceptual compression and quality assessment , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[37]  S. Liu,et al.  Statistical analysis of the DCT coefficients and their quantization error , 1996, Conference Record of The Thirtieth Asilomar Conference on Signals, Systems and Computers.

[38]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[39]  Fan Zhang,et al.  A Parametric Framework for Video Compression Using Region-Based Texture Models , 2011, IEEE Journal of Selected Topics in Signal Processing.

[40]  Fan Zhang,et al.  An adaptive Lagrange multiplier determination method for rate-distortion optimisation in hybrid video codecs , 2015, 2015 IEEE International Conference on Image Processing (ICIP).