Adaptive motion-compensated video coding scheme towards content-based bit rate allocation

An adaptive motion-compensated video coding scheme, that is based on structural video component segmentation and cod- ing complexity analysis, is proposed in this paper. The bits are allo- cated more efficiently among different frame types and variant video components. A novel scene cut detection algorithm is proposed for partitioning the input video sequences into a set of shots and each shot may be encoded as one or multiple GOPs according to its length. Moreover, the positions of the reference frames (I and P frames) in a video shot are adapted to improve the temporal predict- ability among frames and provide high coding efficiency, thus high picture quality with the same bit rate. More bits are allocated for these reference frames for providing high quality of the recon- structed pictures. The residue frames in a video shot are encoded as the bidirectional interpolation frames (B frames) and can be also quantized more coarsely because they have high temporal predict- ability and are not used as references. The bits, that have been allocated for the three different frame types (I, P, B frames), can be further distributed more efficiently among variant video components to avoid the coding artifacts. Experimental results show that this proposed adaptive video coding scheme is more efficient than the traditional fixed GOP coding algorithms and may be an efficient de- velopment of the present adaptive coding techniques. © 2000 SPIE and IS&T. (S1017-9909(00)00504-3)

[1]  Dietmar Hepper,et al.  Efficiency analysis and application of uncovered background prediction in a low bit rate image coder , 1990, IEEE Trans. Commun..

[2]  A. Murat Tekalp,et al.  Temporal video segmentation using unsupervised clustering and semantic object tracking , 1998, J. Electronic Imaging.

[3]  Shih-Fu Chang,et al.  Scene change detection in an MPEG-compressed video sequence , 1995, Electronic Imaging.

[4]  Rong Wang,et al.  Image sequence segmentation based on 2D temporal entropic thresholding , 1996, Pattern Recognit. Lett..

[5]  Til Aach,et al.  Bayesian algorithms for adaptive change detection in image sequences using Markov random fields , 1995, Signal Process. Image Commun..

[6]  Levent Onural,et al.  Image sequence analysis for emerging interactive multimedia services-the European COST 211 framework , 1998, IEEE Trans. Circuits Syst. Video Technol..

[7]  Fuxi Gan,et al.  Spatiotemporal segmentation based on two-dimensional spatiotemporal entropic thresholding , 1997 .

[8]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[9]  A. Murat Tekalp,et al.  Content-based access to video objects: Temporal Segmentation, visual summarization, and feature extraction , 1998, Signal Process..

[10]  Atul Puri,et al.  Motion-compensated video coding with adaptive perceptual quantization , 1991, IEEE Trans. Circuits Syst. Video Technol..

[11]  A. D. Brink Thresholding of digital images using two-dimensional entropies , 1992, Pattern Recognit..

[12]  Ramesh C. Jain,et al.  Knowledge-guided parsing in video databases , 1993, Electronic Imaging.

[13]  Hiroshi Watanabe,et al.  Two-stage motion compensation using adaptive global MC and local affine MC , 1997, IEEE Trans. Circuits Syst. Video Technol..

[14]  Cesar A. Gonzales,et al.  Motion video adaptive quantization in the transform domain , 1991, IEEE Trans. Circuits Syst. Video Technol..

[15]  Fuxi Gan,et al.  Motion estimation based on global and local uncompensability analysis , 1997, IEEE Trans. Image Process..

[16]  Jianping Fan,et al.  Hierarchical object-oriented image and video segmentation algorithm based on 2D entropic thresholding , 1998, Other Conferences.

[17]  Jungwoo Lee,et al.  Temporally adaptive motion interpolation exploiting temporal masking in visual perception , 1994, IEEE Trans. Image Process..

[18]  John S. Boreczky,et al.  Comparison of video shot boundary detection techniques , 1996, J. Electronic Imaging.

[19]  Nariman Farvardin,et al.  A perceptually motivated three-component image model-Part I: description of the model , 1995, IEEE Trans. Image Process..

[20]  King Ngi Ngan,et al.  Reduction of blocking artifacts in image and video coding , 1999, IEEE Trans. Circuits Syst. Video Technol..

[21]  Wei Ding,et al.  Rate control of MPEG video coding and recording by rate-quantization modeling , 1996, IEEE Trans. Circuits Syst. Video Technol..

[22]  Bernd Girod,et al.  The Information Theoretical Significance of Spatial and Temporal Masking in Video Signals , 1989, Photonics West - Lasers and Applications in Science and Engineering.

[23]  Thomas D. C. Little,et al.  A Survey of Technologies for Parsing and Indexing Digital Video1 , 1996, J. Vis. Commun. Image Represent..

[24]  Yushan Tan,et al.  Automated three-dimensional surface profilometry using dual-frequency optic fiber phase-shifting method , 1997 .

[25]  Nilesh V. Patel,et al.  Video shot detection and characterization for video databases , 1997, Pattern Recognit..

[26]  Fuxi Gan,et al.  Adaptive motion-compensated interpolation based on spatiotemporal segmentation , 1998, Signal Process. Image Commun..

[27]  Antonio Ortega,et al.  Bit allocation for dependent quantization with applications to multiresolution and MPEG video coders , 1994, IEEE Trans. Image Process..