Region-of-interest based rate control for low-bit-rate video conferencing

We present a region of interest (ROI) based rate control for H.263 compatible video conferencing. A face detection and tracking scheme with very low complexity is proposed for segmentation. By analyzing quadratic rate models in frame layer, video object plane (VOP) layer, and macroblock (MB) layer extracted from the test data, a quadratic rate model at the MB layer with a modified physical meaning is proposed to improve the model accuracy. The basic idea is to use a group of uncoded MBs in the current VOP instead of individual MBs to update model parameters. A joint VOP layer and MB layer rate control algorithm is proposed. The VOP layer rate control assigns target bit rate for each VOP based on the coding complexity and visual importance, and determines an average quantization parameter (QP) for each VOP. Some new features of MB layer rate control are designed to utilize both average statistics of a VOP and individual statistics of MB together. The performance is compared with conventional TMN8 and object-based VM8 rate control, better peak SNR (PSNR) for ROI, and more accurate rate control can be achieved for various video sequences. The proposed rate control algorithm can be extended for H.264 ROI-based scalable video coding.

[1]  Weisi Lin,et al.  Rate control for videophone using local perceptual cues , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Thomas S. Huang,et al.  Human face detection in a scene , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Iain E. G. Richardson,et al.  H.264 and MPEG-4 Video Compression: Video Coding for Next-Generation Multimedia , 2003 .

[4]  Hwangjun Song,et al.  A region-based H.263+ codec and its rate control for low VBR video , 2004, IEEE Transactions on Multimedia.

[5]  Tihao Chiang,et al.  Scalable rate control for MPEG-4 video , 2000, IEEE Trans. Circuits Syst. Video Technol..

[6]  Anil K. Jain,et al.  Face Detection in Color Images , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Alexandros Eleftheriadis,et al.  Model-assisted coding of video teleconferencing sequences at low bit rates , 1994, Proceedings of IEEE International Symposium on Circuits and Systems - ISCAS '94.

[8]  Jordi Ribas-Corbera,et al.  Rate control in DCT video coding for low-delay communications , 1999, IEEE Trans. Circuits Syst. Video Technol..

[9]  R. V. Prasad,et al.  Techniques and Standards for Image, Video and Audio Coding , 1998 .

[10]  Hiroyuki Okada,et al.  Object-oriented H.263 compatible video coding platform for conferencing applications , 1998, IEEE J. Sel. Areas Commun..

[11]  Atul Puri,et al.  Motion-compensated video coding with adaptive perceptual quantization , 1991, IEEE Trans. Circuits Syst. Video Technol..

[12]  Mohammed Ghanbari,et al.  Standard Codecs: Image Compression to Advanced Video Coding , 2003 .

[13]  R. Krishnamurthy,et al.  Model based multi-pass macroblock-level rate control for visually improved video coding , 2001, Proceedings of Workshop and Exhibition on MPEG-4 (Cat. No.01EX511).

[14]  Jordi Ribas-Corbera,et al.  Face-based visually-optimized image sequence coding , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[15]  Shih-Fu Chang,et al.  A highly efficient system for automatic face region detection in MPEG video , 1997, IEEE Trans. Circuits Syst. Video Technol..