Modeling Motion for Spatial Scalability

The dramatic proliferation of visual displays, from cell phones, through video iPods, PDAs, and notebooks, to high-quality HDTV screens, has raised the demand for a video compression scheme capable of decoding a "once-encoded" video at a range of supported video resolutions and with high quality. A promising solution to this problem has been recently proposed in the form of wavelet video coding based on motion-compensated temporal filtering (MCTF); scalability is naturally supported while efficiency is comparable to state-of-the-art hybrid coders. However, although rate (quality) and temporal scalability are natural in mainstream "t+2D" wavelet video coders, spatial scalability suffers from drift problems. In the light of the recently proposed "2D+t+2D" modification, which targets spatial scalability performance, we present a framework for the modeling of spatially-scalable motion that is well matched to this new structure. We propose a motion estimation scheme in which motion fields at different spatial scales are jointly estimated and coded. In addition, at lower spatial resolutions, we extend the block-wise constant motion model to a higher-order model based on cubic splines, effectively creating a "mixture motion model" that combines different models at different supported spatial scales. This advanced spatial modeling of motion significantly improves the coding efficiency of motion at low resolutions and leads to an excellent over-all compression performance; spatial scalability performance of the proposed scheme approaches that of a non-scalable coder

[1]  Alberto Signoroni,et al.  A fully scalable video coder with inter-scale wavelet prediction and morphological coding , 2005, Visual Communications and Image Processing.

[2]  Feng Wu,et al.  Spatial Scalability in 3D Wavelet Coding with Spatial Domain MCTF Encoder , 2004 .

[3]  Jens-Rainer Ohm,et al.  Three-dimensional subband coding with motion compensation , 1994, IEEE Trans. Image Process..

[4]  David S. Taubman,et al.  Motion-compensated highly scalable video compression using an adaptive 3D wavelet transform based on lifting , 2001, ICIP.

[5]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[6]  William H. Press,et al.  The Art of Scientific Computing Second Edition , 1998 .

[7]  Avideh Zakhor,et al.  Multirate 3-D subband coding of video , 1994, IEEE Trans. Image Process..

[8]  Feng Wu,et al.  Optimal subband rate allocation for spatial scalability in 3D wavelet video coding with motion aligned temporal filtering , 2005, Visual Communications and Image Processing.

[9]  Nariman Farvardin,et al.  Three-dimensional subband coding of video , 1995, IEEE Trans. Image Process..

[10]  Shipeng Li,et al.  Motion compensated lifting wavelet and its application in video coding , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[11]  Béatrice Pesquet-Popescu,et al.  Three-dimensional lifting schemes for motion compensated video compression , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[12]  Mihaela van der Schaar,et al.  Complete-to-overcomplete discrete wavelet transforms for scalable video coding with MCTF , 2003, Visual Communications and Image Processing.

[13]  E. Dubois,et al.  The sampling and reconstruction of time-varying imagery with application in video systems , 1985, Proceedings of the IEEE.

[14]  Gunnar Karlsson,et al.  Three dimensional sub-band coding of video , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[15]  Feng Wu,et al.  Studies on spatial scalable frameworks for motion aligned 3D wavelet video coding , 2005, Visual Communications and Image Processing.

[16]  David S. Taubman,et al.  A flexible structure for fully scalable motion-compensated 3-D DWT with emphasis on the impact of spatial scalability , 2006, IEEE Transactions on Image Processing.

[17]  Zixiang Xiong,et al.  Three-dimensional embedded subband coding with optimized truncation , 2001 .

[18]  Harry Shum,et al.  Motion estimation with quadtree splines , 1995, Proceedings of IEEE International Conference on Computer Vision.