In this paper, we study the best rate distortion performance that an H.264 encoder can possibly achieve. Using soft decision quantization rather than the standard hard decision quantization, we first establish a general framework for jointly designing motion compensation, quantization, and entropy coding in the hybrid coding structure of H.264 to minimize a true rate distortion cost. We then propose three rate distortion optimization algorithms—a graph-based algorithm for optimal soft decision quantization in H.264 baseline profile encoding given motion compensation and quantization step sizes, an iterative algorithm for optimal residual coding in H.264 baseline profile encoding given motion compensation, and an iterative overall algorithm for optimal H.264 baseline profile encoding—with them embedded in the indicated order. The graph-based algorithm for optimal soft decision quantization is the core; given motion compensation and quantization step sizes, it is guaranteed to perform optimal soft decision quantization to certain degree. The proposed iterative overall algorithm has been implemented based on the reference encoder JM82 of H.264. Comparative studies show that it achieves a significant performance gain, which can be as high as 25% rate reduction at the same PSNR when compared to the reference encoder.
[1]
Toby Berger,et al.
Fixed-slope universal lossy data compression
,
1997,
IEEE Trans. Inf. Theory.
[2]
Antonio Ortega,et al.
Bit allocation for dependent quantization with applications to multiresolution and MPEG video coders
,
1994,
IEEE Trans. Image Process..
[3]
Kannan Ramchandran,et al.
Joint thresholding and quantizer selection for transform image coding: entropy-constrained analysis and applications to baseline JPEG
,
1997,
IEEE Trans. Image Process..
[4]
En-Hui Yang,et al.
Distortion program-size complexity with respect to a fidelity criterion and rate-distortion function
,
1993,
IEEE Trans. Inf. Theory.
[5]
Gary J. Sullivan,et al.
Rate-constrained coder control and comparison of video coding standards
,
2003,
IEEE Trans. Circuits Syst. Video Technol..
[6]
John D. Villasenor,et al.
Trellis-based R-D optimal quantization in H.263+
,
2000,
Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).
[7]
Toby Berger,et al.
Rate distortion theory : a mathematical basis for data compression
,
1971
.