Improved best prediction mode(s) selection methods based on structural similarity in H.264 I-frame encoder

In H.264 I-frame encoder, the best infra prediction modes are chosen by utilizing the rate-distortion (R-D) optimization whose distortion is the sum of the squared differences (SSD, means the same as MSE) between the reconstructed and the original blocks. Recently a new image measurement called structural similarity (SSIM) based on the degradation of structural information was brought forward. It is proved that the SSIM can provide a better approximation to the perceived image distortion than the currently used PSNR (or MSE). In this paper, we propose two improved prediction modes selection methods based on SSIM for H.264 I-frame encoder. The first one is the SSIM-based R-D optimization (SBRDO) method, the other is the fast mode selection method based on SSIM (FMSBS). Experiments show that both the proposed method can improve the coding efficiency while maintaining the same perceptual reconstructed image quality.

[1]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[2]  Peng Gao,et al.  Rate control for advance video coding (AVC) standard , 2003, Proceedings of the 2003 International Symposium on Circuits and Systems, 2003. ISCAS '03..

[3]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[4]  Zhou Wang,et al.  Why is image quality assessment so difficult? , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[6]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[7]  Thomas Wiegand,et al.  Lagrange multiplier selection in hybrid video coder control , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).