Temporal image sequence prediction using motion field interpolation

Abstract A new method for motion-compensated temporal prediction of image sequences is proposed. Motion vector fields in natural scenes should possess two basic properties. First, the field should be smoothly varying within moving objects to compensate for nonrigid or rotational motion, and scaling of objects. Second, the field should be discontinuous along the boundaries of the objects. In the proposed method the motion vector field is modelled using finite element methods and interpolated using adaptive interpolators to satisfy the above-stated requirements. This is particularly important when only very sparse estimates of motion vector fields are available in the decoder due to bit-rate constraints limiting the amount of overhead information that can be transmitted. The proposed prediction method can be applied for low-bit-rate video coding in conventional codecs based on motion-compensated prediction and transform coding, as well as in model-based codecs. The performance of the proposed method is compared with standard motion-compensated prediction based on block matching. It is shown that for simple video telephony scenes a reduction of more than 30% in the energy of the prediction error can be achieved with an unchanged number of transmitted motion vectors and with only a modest increase in computational complexity. When implemented in an H.261 codec the new prediction method can improve the peak SNR 1–2 dB producing a significant visual improvement.

[1]  Michael T. Orchard Predictive motion-field segmentation for image sequence coding , 1993, IEEE Trans. Circuits Syst. Video Technol..

[2]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[3]  Gary J. Sullivan,et al.  Motion compensation for video compression using control grid interpolation , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[4]  N. Diehl,et al.  Model-Based Image Sequence Coding , 1993 .

[5]  Michael Hötter,et al.  Object-oriented analysis-synthesis coding based on moving two-dimensional objects , 1990, Signal Process. Image Commun..

[6]  M. Sezan,et al.  Motion analysis and image sequence processing , 1993 .

[7]  George Wolberg,et al.  Digital image warping , 1990 .

[8]  J. Nieweglowski,et al.  A novel video coding scheme based on temporal prediction using digital image warping , 1993, IEEE 1993 International Conference on Consumer Electronics Digest of Technical Papers.

[9]  Pertti Roivainen,et al.  3-D Motion Estimation in Model-Based Facial Image Coding , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Gary J. Sullivan,et al.  Multi-hypothesis motion compensation for low bit-rate video coding , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  K. Huebner The finite element method for engineers , 1975 .

[12]  Allen M. Waxman,et al.  The Analytic Structure of Image Flows: Deformation and Segmentation , 1990, Comput. Vis. Graph. Image Process..

[13]  Jake K. Aggarwal,et al.  On the computation of motion from sequences of images-A review , 1988, Proc. IEEE.

[14]  Berthold K. P. Horn Robot vision , 1986, MIT electrical engineering and computer science series.

[15]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[16]  P. Pirsch,et al.  Advances in picture coding , 1985, Proceedings of the IEEE.

[17]  Norbert Diehl,et al.  Object-oriented motion estimation and segmentation in image sequences , 1991, Signal Process. Image Commun..