Rate Control and Error Resilience for Object-Based Video Coding

The MPEG-4 audiovisual coding standard introduced the object-based video data representation model where video data is no longer seen as a sequence of frames or fields, but consists of independent (semantically) relevant video objects that together build the video scene. This representation approach allows new and improved functionalities, but it has also created new relevant problems in terms of typical non-normative parts of the standard, such as rate control and error resilience, which need to be solved in order to successfully transmit object-based video with an acceptable quality over networks that have critical bandwidth and channel error characteristics, such as mobile networks and the Internet. To deal with the specific problems of object-based video coding, rate control demands two levels of action: 1) the scene-level, which is responsible for dynamically allocating the available resources between the various objects in the scene (i.e., between the different encoding time instants and the different video objects to encode in each time instant), aiming at minimizing quality variations along time and between the various objects in the scene; and 2) the object-level, which is responsible for allocating the resources attributed to each object among the various types of data to code (for that object), notably texture and shape, and for computing the best encoding parameters to achieve the target bit allocations while maintaining smooth quality fluctuations. In terms of error resilience techniques, the object-based coding approach means that shape and composition information also have to be taken into account for error resilience purposes, in addition to motion and texture data. To do this, at the encoder side, the coding of video objects is typically supervised by a resilience configuration module, which is responsible for choosing the most adequate coding parameters in terms of resilience for each video object. This is important because the decoding performance will much depend on the protective actions the encoder has taken. At the decoder side, defensive actions have to be taken. This includes error detection and error localization for each decoded video object, followed by independent object-level error concealment. Afterwards, advanced scene-level error concealment is also performed, which has access to all the video objects in the scene and is used immediately before the final concealed video scene is presented to the user. In this chapter, the most recent basics, advances and trends in terms of rate control and error resilience for object-based video coding will be described.

[1]  Iain E. Garden Richardson,et al.  Adaptive algorithms for variable-complexity video coding , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[2]  Fernando Pereira,et al.  Temporal shape error concealment by global motion compensation with local refinement , 2006, IEEE Transactions on Image Processing.

[3]  Jordi Ribas-Corbera,et al.  Rate control in DCT video coding for low-delay communications , 1999, IEEE Trans. Circuits Syst. Video Technol..

[4]  Yo-Sung Ho,et al.  Bit allocation for MPEG-4 video coding with spatio-temporal tradeoffs , 2003, IEEE Trans. Circuits Syst. Video Technol..

[5]  Fernando Pereira,et al.  Refreshment need metrics for improved shape and texture object-based resilient video coding , 2003, IEEE Trans. Image Process..

[6]  Paulo Nunes,et al.  Rate and distortion models for MPEG-4 video encoding , 2004, SPIE Optics + Photonics.

[7]  Fernando Pereira,et al.  An alternative complexity model for the MPEG-4 video verifier mechanism , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[8]  Sethuraman Panchanathan,et al.  Error resiliency schemes in H.264/AVC standard , 2006, J. Vis. Commun. Image Represent..

[9]  Sanjit K. Mitra,et al.  A linear source model and a unified rate control algorithm for DCT video coding , 2002, IEEE Trans. Circuits Syst. Video Technol..

[10]  Yao Wang,et al.  Maximally smooth image recovery in transform coding , 1993, IEEE Trans. Commun..

[11]  Aggelos K. Katsaggelos,et al.  Shape error concealment using Hermite splines , 2004, IEEE Transactions on Image Processing.

[12]  Aggelos K. Katsaggelos,et al.  Error resilient video coding techniques , 2000, IEEE Signal Process. Mag..

[13]  Narciso García,et al.  Rate control and bit allocation for MPEG-4 , 1999, IEEE Trans. Circuits Syst. Video Technol..

[14]  Rene Klein Gunnewiek,et al.  Bit-rate control for MPEG encoders , 1995, Signal Process. Image Commun..

[15]  F. Pereira,et al.  MPEG-4 compliant video encoding: analysis and rate control strategies , 2000, Conference Record of the Thirty-Fourth Asilomar Conference on Signals, Systems and Computers (Cat. No.00CH37154).

[16]  Fernando Pereira,et al.  Joint Rate Control Algorithm for Low-Delay MPEG-4 Object-Based Video Encoding , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Amy R. Reibman,et al.  Constraints on variable bit-rate video for ATM networks , 1992, IEEE Trans. Circuits Syst. Video Technol..

[18]  Jungwoo Lee,et al.  Hierarchical video indexing and retrieval for subband-coded video , 2000, IEEE Trans. Circuits Syst. Video Technol..

[19]  Mark Davis,et al.  The Unicode Standard, Version 3.0 , 2000 .

[20]  Yao Wang,et al.  Error control and concealment for video communication: a review , 1998, Proc. IEEE.

[21]  Sanjit K. Mitra,et al.  A unified rate-distortion analysis framework for transform coding , 2001, IEEE Trans. Circuits Syst. Video Technol..

[22]  Fernando Pereira,et al.  Adaptive shape and texture intra refreshment schemes for improved error resilience in object-based video coding , 2004, IEEE Transactions on Image Processing.

[23]  Paulo Nunes,et al.  Rate Control for Scenes with Multiple Arbitrarily Shaped Video Objects , 1997 .

[24]  Anthony Vetro,et al.  MPEG-4 rate control for multiple video objects , 1999, IEEE Trans. Circuits Syst. Video Technol..

[25]  Bernd Girod,et al.  Distributed Video Coding , 2005, Proceedings of the IEEE.

[26]  Paulo Nunes,et al.  Rate and Distortion Modeling Analysis for MPEG-4 Video Intra Coding , 2004 .

[27]  Touradj Ebrahimi,et al.  The MPEG-4 Book , 2002 .

[28]  S. Aign,et al.  Temporal and spatial error concealment techniques for hierarchical MPEG-2 video codec , 1995, Proceedings IEEE International Conference on Communications ICC '95.

[29]  Yu Sun,et al.  A robust and adaptive rate control algorithm for object-based video coding , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[30]  Faouzi Kossentini,et al.  A Concealment Method for Shape Information in MPEG-4 Coded Video Sequences , 2000, IEEE Trans. Multim..

[31]  Tihao Chiang,et al.  A new rate control scheme using quadratic rate distortion model , 1997, IEEE Trans. Circuits Syst. Video Technol..

[32]  Fernando Pereira,et al.  Spatial shape error concealment for object-based image and video coding , 2004, IEEE Transactions on Image Processing.

[33]  Michael J. Gormish,et al.  Computation-rate-distortion in transform coders for image compression , 1993, Electronic Imaging.

[34]  Yu Sun,et al.  Asynchronous rate control for multi-object videos , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[35]  Sanjit K. Mitra,et al.  /spl rho/-domain source modeling and rate control for video coding and transmission , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[36]  Karl Johan Åström,et al.  Adaptive Control , 1989, Embedded Digital Control with Microcontrollers.

[37]  Hung-Ju Lee,et al.  Scalable rate control for very low bit rate (VLBR) video , 1997, Proceedings of International Conference on Image Processing.

[38]  Tihao Chiang,et al.  Scalable rate control for MPEG-4 video , 2000, IEEE Trans. Circuits Syst. Video Technol..

[39]  King Ngi Ngan,et al.  Dynamic Bit Allocation for Multiple Video Object Coding , 2006, IEEE Transactions on Multimedia.

[40]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[41]  Yafan Zhao Complexity management for video encoders , 2002, MULTIMEDIA '02.

[42]  Fernando Pereira,et al.  Spatio-Temporal Scene Level Error Concealment for Shape and Texture Data in Segmented Video Content , 2006, 2006 International Conference on Image Processing.

[43]  A. Vetro,et al.  Joint shape and texture rate control for MPEG-4 encoders , 1998, ISCAS '98. Proceedings of the 1998 IEEE International Symposium on Circuits and Systems (Cat. No.98CH36187).

[44]  Sanjit K. Mitra,et al.  . Optimum bit allocation and accurate rate control for video coding via ρ-domain source modeling , 2002, IEEE Trans. Circuits Syst. Video Technol..

[45]  Mark R. Pickering,et al.  Error concealment for arbitrarily shaped video objects , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[46]  Fernando Pereira,et al.  Error resilience and concealment performance for MPEG-4 frame-based video coding , 1999, Signal Process. Image Commun..

[47]  Fernando Pereira,et al.  Evaluating MPEG-4 video decoding complexity for an alternative video complexity verifier model , 2002, IEEE Trans. Circuits Syst. Video Technol..

[48]  Teresa H. Y. Meng,et al.  Transform coded image reconstruction exploiting interblock correlation , 1995, IEEE Trans. Image Process..

[49]  Fernando Pereira,et al.  Improved Feedback Compensation Mechanisms for Multiple Video Object Encoding Rate Control , 2007, 2007 IEEE International Conference on Image Processing.

[50]  Cheng Huang,et al.  Error concealment for shape coding , 2002, Proceedings. International Conference on Image Processing.

[51]  Sanjit K. Mitra,et al.  Object-level bit allocation and scalable rate control for MPEG-4 video coding , 2001, Proceedings of Workshop and Exhibition on MPEG-4 (Cat. No.01EX511).

[52]  Fernando Pereira,et al.  Scene level rate control algorithm for MPEG-4 video coding , 2000, IS&T/SPIE Electronic Imaging.

[53]  Aggelos K. Katsaggelos,et al.  Motion compensated shape error concealment , 2006, IEEE Transactions on Image Processing.