Perceptual Rate-Distortion Optimization Using Structural Similarity Index as Quality Metric

The rate-distortion optimization (RDO) framework for video coding achieves a tradeoff between bit-rate and quality. However, objective distortion metrics such as mean squared error traditionally used in this framework are poorly correlated with perceptual quality. We address this issue by proposing an approach that incorporates the structural similarity index as a quality metric into the framework. In particular, we develop a predictive Lagrange multiplier estimation method to resolve the chicken and egg dilemma of perceptual-based RDO and apply it to H.264 intra and inter mode decision. Given a perceptual quality level, the resulting video encoder achieves on the average 9% bit-rate reduction for intra-frame coding and 11% for inter-frame coding over the JM reference software. Subjective test further confirms that, at the same bit-rate, the proposed perceptual RDO indeed preserves image details and prevents block artifact better than traditional RDO.

[1]  Alan C. Bovik,et al.  Mean squared error: Love it or leave it? A new look at Signal Fidelity Measures , 2009, IEEE Signal Processing Magazine.

[2]  Hua Li,et al.  Perceptually Adaptive Lagrange Multiplier for Rate-Distortion Optimization in H.264 , 2007, Future Generation Communication and Networking (FGCN 2007).

[3]  Zhibing Wang,et al.  HVS-based structural similarity for image quality assessment , 2008, 2008 9th International Conference on Signal Processing.

[4]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[5]  Homer H. Chen,et al.  Perceptual-based coding mode decision , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[6]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[7]  Diane K. Michelson,et al.  Applied Statistics for Engineers and Scientists , 2001, Technometrics.

[8]  Homer H. Chen,et al.  A perceptual-based approach to bit allocation for H.264 encoder , 2010, Visual Communications and Image Processing.

[9]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[10]  Robert W. Heath,et al.  Design of Linear Equalizers Optimized for the Structural Similarity Index , 2008, IEEE Transactions on Image Processing.

[11]  Lai-Man Po,et al.  A New Rate-Distortion Optimization Using Structural Information in H.264 I-Frame Encoder , 2005, ACIVS.

[12]  Yi-Hsin Huang,et al.  Predictive Lagrange Multiplier Selection for Perceptual Rate-Distortion Optimization , 2009 .

[13]  Homer H. Chen,et al.  Improving video coding quality by perceptual rate-distortion optimization , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[14]  Thomas Wiegand,et al.  Lagrange multiplier selection in hybrid video coder control , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[15]  Chaofeng Li,et al.  Three-component weighted structural similarity index , 2009, Electronic Imaging.

[16]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[17]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[18]  André Kaup,et al.  Laplace Distribution Based Lagrangian Rate Distortion Optimization for Hybrid Video Coding , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Chun-Ling Yang,et al.  Gradient-Based Structural Similarity for Image Quality Assessment , 2006, 2006 International Conference on Image Processing.

[20]  Chun-Jen Tsai,et al.  Adaptive rate-distortion optimization using perceptual hints , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[21]  Luís Corte-Real,et al.  H.264 Rate-Distortion Analysis Using Subjective Quality Metric , 2009, FMN.

[22]  Lai-Man Po,et al.  A Novel Motion Estimation Method Based on Structural Similarity for H.264 Inter Prediction , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[23]  Thomas Wiegand,et al.  Draft ITU-T recommendation and final draft international standard of joint video specification , 2003 .

[24]  Antonio Ortega,et al.  Rate-distortion methods for image and video compression , 1998, IEEE Signal Process. Mag..

[25]  Matthew G. Reyes,et al.  Structural texture similarity metrics for retrieval applications , 2008, 2008 15th IEEE International Conference on Image Processing.