Efficient background video coding with static sprite generation and arbitrary-shape spatial prediction techniques

An efficient static sprite-generation and compression scheme is proposed for background video coding. First, considering both short- and long-term motion influences, a hybrid technique is proposed in the sprite generation to search the background motion relative to the generated sprite image. In order to prevent the sprite image from being blurred due to the inaccurate segmentation, a reliability-based blending technique is developed. Both the background video at every frame and the sprite image are categorized into reliable, unreliable, and undefined regions. Each region uses a unique strategy in blending. This makes the proposed technique work well even in the case of coarse segmentation. Second, an arbitrary-shape spatial prediction technique is proposed to efficiently compress the sprite image. Due to the property of the background video object, the sprite image often contains many flat and texture-correlation regions. Therefore, utilizing directional spatial prediction can significantly improve the efficiency of sprite coding. In general, the generated sprite is not a rectangular image. Since traditional spatial-prediction methods have difficulty in handling those contour blocks, a padding technique derived from rate-distortion optimization is proposed. Experimental results show that the proposed scheme can generate the background sprite image with better visual quality. Compared with the MPEG-4 sprite-coding scheme, it can also significantly improve the coding efficiency up to 3.0 dB at low bit rates.

[1]  King Ngi Ngan,et al.  Video segmentation for content-based coding , 1999, IEEE Trans. Circuits Syst. Video Technol..

[2]  Edward H. Adelson,et al.  Representing moving images with layers , 1994, IEEE Trans. Image Process..

[3]  Nikolaos Grammalidis,et al.  Sprite generation and coding of multiview image sequences , 1999, Proceedings 10th International Conference on Image Analysis and Processing.

[4]  Frédéric Dufaux,et al.  Efficient, robust, and fast global motion estimation for video coding , 2000, IEEE Trans. Image Process..

[5]  N. Grammalidis,et al.  Multiview sprite generation and coding , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[6]  King Ngi Ngan,et al.  Automatic segmentation of moving objects for video object plane generation , 1998, IEEE Trans. Circuits Syst. Video Technol..

[7]  Richard Szeliski,et al.  Creating full view panoramic image mosaics and environment maps , 1997, SIGGRAPH.

[8]  Michal Irani,et al.  Video indexing based on mosaic representations , 1998, Proc. IEEE.

[9]  Aljoscha Smolic,et al.  Long-term global motion estimation and its application for sprite coding, content description, and segmentation , 1999, IEEE Trans. Circuits Syst. Video Technol..

[10]  Wen Gao,et al.  Sprite generation for frame-based video coding , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[11]  Nikolaos Grammalidis,et al.  Sprite generation and coding in multiview image sequences , 2000, IEEE Trans. Circuits Syst. Video Technol..

[12]  Frédéric Dufaux,et al.  Background mosaicking for low bit rate video coding , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[13]  M. Kunt,et al.  Second-generation image-coding techniques , 1985, Proceedings of the IEEE.

[14]  M. Meribout Video Segmentation for Content-based Coding , 2004 .

[15]  Wen Gao,et al.  Fast and Robust Sprite Generation for MPEG-4 Video Coding , 2001, IEEE Pacific Rim Conference on Multimedia.

[16]  K.N. Ngan,et al.  Foreground/background video coding scheme , 1997, Proceedings of 1997 IEEE International Symposium on Circuits and Systems. Circuits and Systems in the Information Age ISCAS '97.

[17]  Richard Szeliski,et al.  Video mosaics for virtual environments , 1996, IEEE Computer Graphics and Applications.

[18]  Thomas Sikora,et al.  The MPEG-4 video standard verification model , 1997, IEEE Trans. Circuits Syst. Video Technol..

[19]  P. Anandan,et al.  Efficient representations of video sequences and their applications , 1996, Signal Process. Image Commun..

[20]  Richard Szeliski,et al.  A layered video object coding system using sprite and affine motion model , 1997, IEEE Trans. Circuits Syst. Video Technol..