Quad-Tree Motion Modeling With Leaf Merging

In this paper, we are concerned with the modeling of motion between frames of a video sequence. Typically, it is not possible to represent the motion between frames by a single model and therefore a quad-tree structure is often employed where smaller, variable size regions or blocks are allowed to take on separate motion models. Previous work into quad-tree representations has demonstrated the sub-optimal performance of quad-trees where the dependency between neighboring leaf nodes with different parents is not exploited. Leaf merging has been proposed to rectify this performance loss as it allows joint coding and optimization of related nodes. In this paper, we describe how the merging step can be incorporated into quad-tree motion representations for a range of motion modeling contexts. In particular, we study the impact of rate-distortion optimized merging for two motion coding schemes, these being spatially predictive coding, as used in H.264, and hierarchical coding. We present experimental results which demonstrate that node merging can provide significant gains for both the hierarchical and spatial prediction schemes. Interestingly, experimental results also show that in the presence of merging, the rate-distortion performance of hierarchical coding is comparable to that of spatial prediction. We pursue the case of hierarchical coding further in this paper, introducing polynomial motion models to the quad-tree representation and exploring resolution scalability of the merged quad-tree structure. We also present a theoretical study of the impact of leaf merging in modeling motion, identifying the inherent advantages of merging which give rise to a more efficient description of frame motion.

[1]  Marta Mrak,et al.  Evaluation of Techniques for Modeling of Layered Motion Structure , 2006, 2006 International Conference on Image Processing.

[2]  Reji K. Mathew Quad-tree motion models for scalable video coding applications , 2009 .

[3]  Michael G. Strintzis,et al.  Optimization of Quadfree Segmentation and Hybrid Two-Dimensional and Three-Dimensional Motion Estimation in a Rate-Distortion Framework , 1997, IEEE J. Sel. Areas Commun..

[4]  Thomas Wiegand,et al.  Multiple-reference-picture video coding using polynomial motion models , 1998, Electronic Imaging.

[5]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[6]  Augusto Sarti,et al.  Scalable coding of variable size blocks motion vectors , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[7]  Hsueh-Ming Hang,et al.  Motion information scalability for MC-EZBC , 2004, Signal Process. Image Commun..

[8]  Feng Wu,et al.  Layered motion estimation and coding for fully scalable 3D wavelet video coding , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[9]  Frank Müller,et al.  Hierarchical coding of motion vector fields , 1995, Proceedings., International Conference on Image Processing.

[10]  Reji Mathew,et al.  Fully scalable video compression with sample-adaptive lifting and overlapped block motion , 2005, IS&T/SPIE Electronic Imaging.

[11]  Stefano Tubaro,et al.  Motion Estimation by Quadtree Pruning and Merging , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[12]  Reji Mathew,et al.  Hierarchical and Polynomial Motion Modeling with Quad-Tree Leaf Merging , 2006, 2006 International Conference on Image Processing.

[13]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[14]  Mark R. Pickering,et al.  Extended motion compensation using larger blocks and an elastic motion model , 2008, 2008 IEEE 10th Workshop on Multimedia Signal Processing.

[15]  David S. Taubman,et al.  On the benefits of leaf merging in quad-tree motion models , 2005, IEEE International Conference on Image Processing 2005.

[16]  Wai-kuen Cham,et al.  Fast Motion Estimation for H.264/AVC in Walsh–Hadamard Domain , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Minh N. Do,et al.  Rate-distortion optimized tree-structured compression algorithms for piecewise polynomial images , 2005, IEEE Transactions on Image Processing.

[18]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[19]  David S. Taubman,et al.  Highly scalable video compression with scalable motion coding , 2003, ICIP.

[20]  M. Vetterli,et al.  Approximation and compression of piecewise smooth functions , 1999, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[21]  Philip W. Kuchel,et al.  Average Lengths of Chords in a Square , 1981 .

[22]  David S. Taubman,et al.  Highly scalable video compression with scalable motion coding , 2003, IEEE Transactions on Image Processing.

[23]  S. Chae,et al.  Hierarchical motion estimation in Hadamard transform domain , 1999 .

[24]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[25]  David S. Taubman,et al.  Lifting-based invertible motion adaptive transform (LIMAT) framework for highly scalable video compression , 2003, IEEE Trans. Image Process..