SSIM-Motivated Rate-Distortion Optimization for Video Coding

We propose a rate-distortion optimization (RDO) scheme based on the structural similarity (SSIM) index, which was found to be a better indicator of perceived image quality than mean-squared error, but has not been fully exploited in the context of image and video coding. At the frame level, an adaptive Lagrange multiplier selection method is proposed based on a novel reduced-reference statistical SSIM estimation algorithm and a rate model that combines the side information with the entropy of the transformed residuals. At the macroblock level, the Lagrange multiplier is further adjusted based on an information theoretical approach that takes into account both the motion information content and perceptual uncertainty of visual speed perception. Finally, the mode for H.264/AVC coding is selected by the SSIM index and the adjusted Lagrange multiplier. Extensive experiments show that the proposed scheme can achieve significantly better rate-SSIM performance and provide better visual quality than conventional RDO coding schemes.

[1]  Lai-Man Po,et al.  A Novel Motion Estimation Method Based on Structural Similarity for H.264 Inter Prediction , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[2]  Lulin Chen,et al.  Adaptive λ estimation in Lagrangian rate-distortion optimization for video coding , 2006, Electronic Imaging.

[3]  Susu Yao,et al.  Just noticeable distortion model and its applications in video coding , 2005, Signal Process. Image Commun..

[4]  Bo Yan,et al.  Lagrangian Multiplier Based Joint Three-Layer Rate Control for H.264/AVC , 2009, IEEE Signal Processing Letters.

[5]  Lai-Man Po,et al.  An SSIM-optimal H.264/AVC inter frame encoder , 2009, 2009 IEEE International Conference on Intelligent Computing and Intelligent Systems.

[6]  Wen Gao,et al.  Rate-distortion optimized transform for intra-frame coding , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  En-Hui Yang,et al.  Rate Distortion Optimization for H.264 Interframe Coding: A General Framework and Algorithms , 2007, IEEE Transactions on Image Processing.

[8]  Zhou Wang,et al.  Video quality assessment based on structural distortion measurement , 2004, Signal Process. Image Commun..

[9]  T. Vlachos Simple method for estimation of global motion parameters using sparse translational motion vector fields , 1998 .

[10]  André Kaup,et al.  Laplace Distribution Based Lagrangian Rate Distortion Optimization for Hybrid Video Coding , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  J. Bennett,et al.  Advanced video coding , 2003 .

[12]  Zhou Wang,et al.  Video quality assessment using a statistical model of human visual speed perception. , 2007, Journal of the Optical Society of America. A, Optics, image science, and vision.

[13]  Homer H. Chen,et al.  Perceptual-based coding mode decision , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[14]  Homer H. Chen,et al.  A perceptual-based approach to bit allocation for H.264 encoder , 2010, Visual Communications and Image Processing.

[15]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[16]  Yücel Altunbasak,et al.  An analysis of the DCT coefficient distribution with the H.264 video coder , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  Sanjit K. Mitra,et al.  . Optimum bit allocation and accurate rate control for video coding via ρ-domain source modeling , 2002, IEEE Trans. Circuits Syst. Video Technol..

[18]  Alan C. Bovik,et al.  Image information and visual quality , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[19]  Wen Gao,et al.  SSIM based perceptual distortion rate optimization coding , 2010, Visual Communications and Image Processing.

[20]  Bernd Girod,et al.  What's wrong with mean-squared error? , 1993 .

[21]  Abdul Rehman,et al.  Reduced-reference SSIM estimation , 2010, 2010 IEEE International Conference on Image Processing.

[22]  Homer H. Chen,et al.  Improving video coding quality by perceptual rate-distortion optimization , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[23]  Robert W. Heath,et al.  Rate Bounds on SSIM Index of Quantized Images , 2008, IEEE Transactions on Image Processing.

[24]  Minqiang Jiang,et al.  On Lagrange multiplier and quantizer adjustment for H.264 frame-layer video rate control , 2006, IEEE Trans. Circuits Syst. Video Technol..

[25]  Thrasyvoulos N. Pappas,et al.  Structural Similarity Quality Metrics in a Coding Context: Exploring the Space of Realistic Distortions , 2008, IEEE Transactions on Image Processing.

[26]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[27]  Hua Li,et al.  Macroblock-Level Rate-Distortion Optimization with Perceptual Adjustment for Video Coding , 2008, Data Compression Conference (dcc 2008).

[28]  Babu Hemanth Kumar Aswathappa,et al.  Rate-distortion optimization using structural information in H.264 strictly Intra-frame encoder , 2010, 2010 42nd Southeastern Symposium on System Theory (SSST).

[29]  Jun Zhang,et al.  Context Adaptive Lagrange Multiplier (CALM) for Rate-Distortion Optimal Motion Estimation in Video Coding , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[30]  Christine Guillemot,et al.  Perceptually-Friendly H.264/AVC Video Coding Based on Foveated Just-Noticeable-Distortion Model , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[31]  Weisi Lin,et al.  Motion-compensated residue preprocessing in video coding based on just-noticeable-distortion profile , 2005, IEEE Trans. Circuits Syst. Video Technol..

[32]  Homer H. Chen,et al.  Perceptual Rate-Distortion Optimization Using Structural Similarity Index as Quality Metric , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[34]  Jun Sun,et al.  Statistical model, analysis and approximation of rate-distortion function in MPEG-4 FGS videos , 2006, IEEE Trans. Circuits Syst. Video Technol..

[35]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[36]  Lai-Man Po,et al.  A New Rate-Distortion Optimization Using Structural Information in H.264 I-Frame Encoder , 2005, ACIVS.

[37]  Alan C. Bovik,et al.  Mean squared error: Love it or leave it? A new look at Signal Fidelity Measures , 2009, IEEE Signal Processing Magazine.

[38]  Lai-Man Po,et al.  Improved Inter Prediction based on Structural Similarity in H.264 , 2007, 2007 IEEE International Conference on Signal Processing and Communications.

[39]  Ashraf A. Kassim,et al.  Complexity-based rate distortion optimization with perceptual tuning for scalable video coding , 2005, IEEE International Conference on Image Processing 2005.

[40]  En-Hui Yang,et al.  Soft Decision Quantization for H.264 With Main Profile Compatibility , 2009, IEEE Trans. Circuits Syst. Video Technol..

[41]  Yi-Hsin Huang,et al.  Predictive Lagrange Multiplier Selection for Perceptual Rate-Distortion Optimization , 2009 .

[42]  Thomas Wiegand,et al.  Lagrange multiplier selection in hybrid video coder control , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[43]  Sheila S. Hemami,et al.  VSNR: A Wavelet-Based Visual Signal-to-Noise Ratio for Natural Images , 2007, IEEE Transactions on Image Processing.

[44]  Eero P. Simoncelli,et al.  Noise characteristics and prior expectations in human visual speed perception , 2006, Nature Neuroscience.

[45]  Do-Kyoung Kwon,et al.  Rate Control for H.264 Video With Enhanced Rate and Distortion Models , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[46]  Chun-Ling Yang,et al.  Improved best prediction mode(s) selection methods based on structural similarity in H.264 I-frame encoder , 2005, 2005 IEEE International Conference on Systems, Man and Cybernetics.

[47]  Herbert Gish,et al.  Asymptotically efficient quantizing , 1968, IEEE Trans. Inf. Theory.