An optimal quadtree-based motion estimation and motion-compensated interpolation scheme for video compression

We propose an optimal quadtree (QT)-based motion estimator for video compression. It is optimal in the sense that for a given bit budget for encoding the displacement vector field (DVF) and the QT segmentation, the scheme finds a DVF and a QT segmentation which minimizes the energy of the resulting displaced frame difference (DFD). We find the optimal QT decomposition and the optimal DVF jointly using the Lagrangian multiplier method and a multilevel dynamic program. We introduce a new, very fast convex search for the optimal Lagrangian multiplier lambda(*), which results in a very fast convergence of the Lagrangian multiplier method. The resulting DVF is spatially inhomogeneous, since large blocks are used in areas with simple motion and small blocks in areas with complex motion. We also propose a novel motion-compensated interpolation scheme which uses the same mathematical tools developed for the QT-based motion estimator. One of the advantages of this scheme is the globally optimal control of the tradeoff between the interpolation error energy and the DVF smoothness. Another advantage is that no interpolation of the DVF is required since we directly estimate the DVF and the QT-segmentation for the frame which needs to be interpolated. We present results with the proposed QT-based motion estimator which show that for the same DFD energy the proposed estimator uses about 25% fewer bits than the commonly used block matching algorithm. We also experimentally compare the interpolated frames using the proposed motion compensated interpolation scheme with the reconstructed original frames.

[1]  Aggelos K. Katsaggelos,et al.  A Theory for the Optimal Bit Allocation Between Displacement Vector Field and Displaced Frame Difference , 1997, IEEE J. Sel. Areas Commun..

[2]  Baback Moghaddam,et al.  Space-filling curves for image compression , 1991, Defense, Security, and Sensing.

[3]  Gary J. Sullivan,et al.  Efficient quadtree coding of images and video , 1994, IEEE Trans. Image Process..

[4]  Jarkko Kari,et al.  Video compression by mean-corrected motion compensation of partial quadtrees , 1997, IEEE Trans. Circuits Syst. Video Technol..

[5]  Philip A. Chou,et al.  Optimal pruning with applications to tree-structured source coding and modeling , 1989, IEEE Trans. Inf. Theory.

[6]  Bernd Girod,et al.  Rate-constrained motion estimation , 1994, Other Conferences.

[7]  Aggelos K. Katsaggelos,et al.  Low bit rate video coding using robust motion vector regeneration in the decoder , 1994, IEEE Trans. Image Process..

[8]  Jungwoo Lee,et al.  Optimal quadtree for variable block size motion estimation , 1995, Proceedings., International Conference on Image Processing.

[9]  Henri Nicolas,et al.  Region-based motion estimation using deterministic relaxation schemes for image sequence coding , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Aggelos K. Katsaggelos,et al.  Rate-Distortion Based Video Compression: Optimal Video Frame Compression and Object Boundary Encoding , 1996 .

[11]  Antonio Ortega,et al.  Bit allocation for dependent quantization with applications to multiresolution and MPEG video coders , 1994, IEEE Trans. Image Process..

[12]  Rangaraj M. Rangayyan,et al.  Lossless compression of Peanoscanned images , 1994, J. Electronic Imaging.

[13]  Yair Shoham,et al.  Efficient bit allocation for an arbitrary set of quantizers [speech coding] , 1988, IEEE Trans. Acoust. Speech Signal Process..

[14]  Frederic Dufaux,et al.  Digital video codec for medium bitrate transmission , 1991, Other Conferences.

[15]  K Ramchandran,et al.  Best wavelet packet bases in a rate-distortion sense , 1993, IEEE Trans. Image Process..

[16]  Hanan Samet,et al.  The Quadtree and Related Hierarchical Data Structures , 1984, CSUR.

[17]  M. Bierling,et al.  Displacement Estimation By Hierarchical Blockmatching , 1988, Other Conferences.

[18]  D. A. Bell,et al.  Information Theory and Reliable Communication , 1969 .

[19]  Jörn Ostermann,et al.  Object-oriented analysis-synthesis coding of moving images , 1989, Signal Process. Image Commun..

[20]  Aggelos K. Katsaggelos,et al.  Fast and efficient mode and quantizer selection in the rate distortion sense for H.263 , 1996, Other Conferences.

[21]  Harvey J. Everett Generalized Lagrange Multiplier Method for Solving Problems of Optimum Allocation of Resources , 1963 .

[22]  Aggelos K. Katsaggelos,et al.  Rate-Distortion Based Video Compression , 1997, Springer US.

[23]  SametHanan The Quadtree and Related Hierarchical Data Structures , 1984 .

[24]  Benoit B. Mandelbrot,et al.  Fractal Geometry of Nature , 1984 .