Rate Distortion Optimization for H.264 Interframe Coding: A General Framework and Algorithms

Rate distortion (RD) optimization for H.264 interframe coding with complete baseline decoding compatibility is investigated on a frame basis. Using soft decision quantization (SDQ) rather than the standard hard decision quantization, we first establish a general framework in which motion estimation, quantization, and entropy coding (in H.264) for the current frame can be jointly designed to minimize a true RD cost given previously coded reference frames. We then propose three RD optimization algorithms-a graph-based algorithm for near optimal SDQ in H.264 baseline encoding given motion estimation and quantization step sizes, an algorithm for near optimal residual coding in H.264 baseline encoding given motion estimation, and an iterative overall algorithm to optimize H.264 baseline encoding for each individual frame given previously coded reference frames-with them embedded in the indicated order. The graph-based algorithm for near optimal SDQ is the core; given motion estimation and quantization step sizes, it is guaranteed to perform optimal SDQ if the weak adjacent block dependency utilized in the context adaptive variable length coding of H.264 is ignored for optimization. The proposed algorithms have been implemented based on the reference encoder JM82 of H.264 with complete compatibility to the baseline profile. Experiments show that for a set of typical video testing sequences, the graph-based algorithm for near optimal SDQ, the algorithm for near optimal residual coding, and the overall algorithm achieve on average, 6%, 8%, and 12%, respectively, rate reduction at the same PSNR (ranging from 30 to 38 dB) when compared with the RD optimization method implemented in the H.264 reference software.

[1]  Harvey J. Everett Generalized Lagrange Multiplier Method for Solving Problems of Optimum Allocation of Resources , 1963 .

[2]  Toby Berger,et al.  Rate distortion theory : a mathematical basis for data compression , 1971 .

[3]  Abraham Lempel,et al.  A universal algorithm for sequential data compression , 1977, IEEE Trans. Inf. Theory.

[4]  Abraham Lempel,et al.  Compression of individual sequences via variable-rate coding , 1978, IEEE Trans. Inf. Theory.

[5]  En-Hui Yang,et al.  Distortion program-size complexity with respect to a fidelity criterion and rate-distortion function , 1993, IEEE Trans. Inf. Theory.

[6]  Antonio Ortega,et al.  Bit allocation for dependent quantization with applications to multiresolution and MPEG video coders , 1994, IEEE Trans. Image Process..

[7]  Kannan Ramchandran,et al.  Rate-distortion optimal fast thresholding with complete JPEG/MPEG decoder compatibility , 1994, IEEE Trans. Image Process..

[8]  Wei Ding,et al.  Rate control of MPEG video coding and recording by rate-quantization modeling , 1996, IEEE Trans. Circuits Syst. Video Technol..

[9]  Sanjit K. Mitra,et al.  Rate-distortion optimized mode selection for very low bit rate video coding and the emerging H.263 standard , 1996, IEEE Trans. Circuits Syst. Video Technol..

[10]  Toby Berger,et al.  Fixed-slope universal lossy data compression , 1997, IEEE Trans. Inf. Theory.

[11]  Hsueh-Ming Hang,et al.  Source model for transform video coder and its application. I. Fundamental theory , 1997, IEEE Trans. Circuits Syst. Video Technol..

[12]  Kannan Ramchandran,et al.  Joint thresholding and quantizer selection for transform image coding: entropy-constrained analysis and applications to baseline JPEG , 1997, IEEE Trans. Image Process..

[13]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[14]  Zhen Zhang,et al.  Variable rate trellis source encoding , 1998, Proceedings. 1998 IEEE International Symposium on Information Theory (Cat. No.98CH36252).

[15]  Antonio Ortega,et al.  Rate-distortion methods for image and video compression , 1998, IEEE Signal Process. Mag..

[16]  R. Ladner Entropy-constrained Vector Quantization , 2000 .

[17]  Bernd Girod,et al.  Efficiency analysis of multihypothesis motion-compensated prediction for video coding , 2000, IEEE Trans. Image Process..

[18]  John D. Villasenor,et al.  Trellis-based R-D optimal quantization in H.263+ , 2000, IEEE Trans. Image Process..

[19]  John D. Villasenor,et al.  Trellis-based R-D optimal quantization in H.263+ , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[20]  Thomas Wiegand,et al.  Lagrange multiplier selection in hybrid video coder control , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[21]  Iain E. G. Richardson,et al.  H.264 and MPEG-4 Video Compression: Video Coding for Next-Generation Multimedia , 2003 .

[22]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[23]  젱 지안펑 Method, system and software product for color image encoding , 2005 .

[24]  Anastasis A. Sofokleous,et al.  Review: H.264 and MPEG-4 Video Compression: Video Coding for Next-generation Multimedia , 2005, Comput. J..

[25]  Yücel Altunbasak,et al.  Frame bit allocation for H.264 using Cauchy-distribution based source modelling , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[26]  En-Hui Yang,et al.  On joint optimization of motion compensation, quantization and baseline entropy coding in H.264 with complete decoder compatibility , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[27]  En-Hui Yang,et al.  Rate Distortion Optimization of H.264 with Main Profile Compatibility , 2006, 2006 IEEE International Symposium on Information Theory.

[28]  En-Hui Yang,et al.  Soft Decision Quantization for H.264 With Main Profile Compatibility , 2009, IEEE Trans. Circuits Syst. Video Technol..

[29]  En-Hui Yang,et al.  Joint Optimization of Run-Length Coding, Huffman Coding, and Quantization Table With Complete Baseline JPEG Decoder Compatibility , 2009, IEEE Transactions on Image Processing.