SSIM based perceptual distortion rate optimization coding

The current rate distortion optimization (RDO) coding schemes usually use the Sum of Absolute Difference (SAD) or Sum of Square Difference (SSD) as the distortion metric. However, neither SAD nor SSD correlates with the human visual system (HVS) very well. To develop a perceptual distortion based video encoder, we employ the Structural Similarity (SSIM) Index as the distortion metric and propose a SSIM based Lagrangian perceptual distortion rate optimization (PDRO) method in this paper. Furthermore, to adapt the different input sequences dynamically, we present an adaptive Lagrange multiplier selection scheme based on the properties of the input sequences. By modeling the transformed residuals with Laplace distribution, the statistical SSIM and rate models are deduced to obtain the adaptive Lagrange multiplier. Extensive experiments show that the proposed scheme can achieve better perceptual distortion rate performance and provide better visual quality than the SAD/SSD based RDO coding scheme.

[1]  André Kaup,et al.  Laplace Distribution Based Lagrangian Rate Distortion Optimization for Hybrid Video Coding , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Thomas Wiegand,et al.  Lagrange multiplier selection in hybrid video coder control , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[3]  Robert W. Heath,et al.  Rate Bounds on SSIM Index of Quantized Images , 2008, IEEE Transactions on Image Processing.

[4]  Do-Kyoung Kwon,et al.  Rate Control for H.264 Video With Enhanced Rate and Distortion Models , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Lulin Chen,et al.  Adaptive λ estimation in Lagrangian rate-distortion optimization for video coding , 2006, Electronic Imaging.

[6]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[7]  Lai-Man Po,et al.  A Novel Motion Estimation Method Based on Structural Similarity for H.264 Inter Prediction , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[8]  Lai-Man Po,et al.  Improved Inter Prediction based on Structural Similarity in H.264 , 2007, 2007 IEEE International Conference on Signal Processing and Communications.

[9]  Bernd Girod,et al.  What's wrong with mean-squared error? , 1993 .