Region of interest based H.263 compatible codec and its rate control for low bit rate video conferencing

This paper presents a region of interest (ROI) based H.263 compatible video codec, which combines the idea of object-based coding from MPEG4 visual into the traditional block-based H.263 codec. A face detection and tracking scheme with very low complexity is proposed to segment human face from video conferencing sequences in real-time. With the segmentation information, the ROI based codec and its associated rate control schemes are designed. For VBR video, the proposed rate control is a joint frame layer and macroblock layer scheme. For CBR video, a macroblock layer rate control is proposed. TMN8 is adopted as the platform, and the modified quantization mode in Annex T of H.263 is adopted to achieve flexibility in assigning quantization parameters among different macroblocks.

[1]  Jordi Ribas-Corbera,et al.  Rate control in DCT video coding for low-delay communications , 1999, IEEE Trans. Circuits Syst. Video Technol..

[2]  K. R. Rao,et al.  Techniques and Standards for Image, Video, and Audio Coding , 1996 .

[3]  R. Krishnamurthy,et al.  Model based multi-pass macroblock-level rate control for visually improved video coding , 2001, Proceedings of Workshop and Exhibition on MPEG-4 (Cat. No.01EX511).

[4]  Weisi Lin,et al.  Rate control for videophone using local perceptual cues , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Michael G. Strintzis,et al.  Real-time compressed-domain spatiotemporal segmentation and ontologies for video indexing and retrieval , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Alexandros Eleftheriadis,et al.  Model-assisted coding of video teleconferencing sequences at low bit rates , 1994, Proceedings of IEEE International Symposium on Circuits and Systems - ISCAS '94.

[7]  Shih-Fu Chang,et al.  A highly efficient system for automatic face region detection in MPEG video , 1997, IEEE Trans. Circuits Syst. Video Technol..

[8]  ChangShih-Fu,et al.  A highly efficient system for automatic face region detection in MPEG video , 1997 .

[9]  Tihao Chiang,et al.  Scalable rate control for MPEG-4 video , 2000, IEEE Trans. Circuits Syst. Video Technol..

[10]  Atul Puri,et al.  Motion-compensated video coding with adaptive perceptual quantization , 1991, IEEE Trans. Circuits Syst. Video Technol..

[11]  Anastasis A. Sofokleous,et al.  Review: H.264 and MPEG-4 Video Compression: Video Coding for Next-generation Multimedia , 2005, Comput. J..

[12]  Thomas S. Huang,et al.  Human face detection in a scene , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Jordi Ribas-Corbera,et al.  Face-based visually-optimized image sequence coding , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[14]  Hwangjun Song,et al.  A region-based H.263+ codec and its rate control for low VBR video , 2004, IEEE Transactions on Multimedia.

[15]  Hiroyuki Okada,et al.  Object-oriented H.263 compatible video coding platform for conferencing applications , 1998, IEEE J. Sel. Areas Commun..

[16]  Iain E. G. Richardson,et al.  H.264 and MPEG-4 Video Compression: Video Coding for Next-Generation Multimedia , 2003 .