. Optimum bit allocation and accurate rate control for video coding via ρ-domain source modeling

We present a new framework for rate-distortion (R-D) analysis, where the coding rate R and distortion D are considered as functions of /spl rho/ which is the percentage of zeros among the quantized transform coefficients. Previously (see He, Z. et al., Int. Conf. Acoustics, Speech and Sig. Proc., 2001), we observed that, in transform coding of images and videos, the rate function R(/spl rho/) is approximately linear. Based on this linear rate model, a simple and unified rate control algorithm was proposed for all standard video coding systems, such as MPEG-2, H.263, and MPEG-4. We further develop a distortion model and an optimum bit allocation scheme in the /spl rho/ domain. This bit allocation scheme is applied to MPEG-4 video coding to allocate the available bits among different video objects. The bits target of each object is then achieved by our /spl rho/-domain rate control algorithm. When coupled with a macroblock classification scheme, the above bit allocation and rate control scheme can also be applied to other video coding systems, such as H.263, at the macroblock level. Our extensive experimental results show that the proposed algorithm controls the encoder bit rate very accurately and improves the video quality significantly (by up to 1.5 dB).

[1]  Herbert Gish,et al.  Asymptotically efficient quantizing , 1968, IEEE Trans. Inf. Theory.

[2]  Hsueh-Ming Hang,et al.  Source model for transform video coder and its application. I. Fundamental theory , 1997, IEEE Trans. Circuits Syst. Video Technol..

[3]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1991, CACM.

[4]  Roberto H. Bamberger,et al.  Optimum classification in subband coding of images , 1994, Proceedings of 1st International Conference on Image Processing.

[5]  Wei Ding,et al.  Rate control of MPEG video coding and recording by rate-quantization modeling , 1996, IEEE Trans. Circuits Syst. Video Technol..

[6]  Joseph W. Goodman,et al.  A mathematical analysis of the DCT coefficient distributions for images , 2000, IEEE Trans. Image Process..

[7]  Tihao Chiang,et al.  Scalable rate control for MPEG-4 video , 2000, IEEE Trans. Circuits Syst. Video Technol..

[8]  M. Kunt,et al.  Second-generation image-coding techniques , 1985, Proceedings of the IEEE.

[9]  ZhangYa-Qin,et al.  Scalable rate control for MPEG-4 video , 2000 .

[10]  P. Schultheiss,et al.  Block Quantization of Correlated Gaussian Random Variables , 1963 .

[11]  K. Rijkse,et al.  H.263: video coding for low-bit-rate communication , 1996, IEEE Commun. Mag..

[12]  Yair Shoham,et al.  Efficient bit allocation for an arbitrary set of quantizers [speech coding] , 1988, IEEE Trans. Acoust. Speech Signal Process..

[13]  Tihao Chiang,et al.  A new rate control scheme using quadratic rate distortion model , 1997, IEEE Trans. Circuits Syst. Video Technol..

[14]  Kannan Ramchandran,et al.  Rate-distortion optimal fast thresholding with complete JPEG/MPEG decoder compatibility , 1994, IEEE Trans. Image Process..

[15]  Sanjit K. Mitra,et al.  Blockwise zero mapping image coding , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[16]  Antonio Ortega,et al.  Bit-rate control using piecewise approximated rate-distortion characteristics , 1998, IEEE Trans. Circuits Syst. Video Technol..

[17]  Bennett Fox,et al.  Discrete Optimization Via Marginal Analysis , 1966 .

[18]  Michael G. Strintzis,et al.  Video coding for wireless varying bit-rate communications based on area of interest and region representation , 1997, Proceedings of International Conference on Image Processing.

[19]  T. Cover,et al.  Rate Distortion Theory , 2001 .

[20]  Sanjit K. Mitra,et al.  A novel linear source model and a unified rate control algorithm for H.263/MPEG-2/MPEG-4 , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[21]  K. Yang,et al.  A normalized rate-distortion model for H.263-compatible codecs and its application to quantizer selection , 1997, Proceedings of International Conference on Image Processing.

[22]  Dimitris N. Metaxas,et al.  Optical Flow Constraints on Deformable Models with Applications to Face Tracking , 2000, International Journal of Computer Vision.

[23]  Thomas Sikora,et al.  The MPEG-4 video standard verification model , 1997, IEEE Trans. Circuits Syst. Video Technol..

[24]  Itu-T Video coding for low bitrate communication , 1996 .

[25]  Adrian Segall Bit allocation and encoding for vector sources , 1976, IEEE Trans. Inf. Theory.

[26]  Jordi Ribas-Corbera,et al.  Rate control in DCT video coding for low-delay communications , 1999, IEEE Trans. Circuits Syst. Video Technol..

[27]  D. Legall,et al.  MPEG : A video compression standard for multimedia applications , 1991 .

[28]  Tihao Chiang,et al.  A new rate control scheme using quadratic rate distortion model , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[29]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[30]  Bo Tao,et al.  A rate-quantization model for MPEG encoders , 1997, Proceedings of International Conference on Image Processing.