Motion-based object segmentation and estimation using the MDL principle

There are increasing demands for ultralow bit-rate video image transmission. We present a new formulation of moving object segmentation and motion estimation to quantify the potential gain in using object-oriented motion compensation in image sequence coding. Motivated by real-valued parameter estimation required in object-oriented motion-compensated video source coding, a framework motivated by Rissanen's (1983) minimum description length (MDL) principle is proposed to more tightly couple motion estimation and object segmentation algorithms to the overall objective of minimizing source bit rate. A new objective function is constructed, and a suboptimal procedure to segment and estimate moving objects in a scene is proposed. Each object is represented by chain-coded block boundaries, affine motion parameters, and motion-compensated prediction error. A number of experimental comparisons between block- and object-oriented coding schemes suggests a significant potential coding gain using object-oriented motion-compensated coding.

[1]  J. Rissanen Stochastic complexity and the mdl principle , 1987 .

[2]  C. S. Wallace,et al.  An Information Measure for Classification , 1968, Comput. J..

[3]  M. Kunt,et al.  Second-generation image-coding techniques , 1985, Proceedings of the IEEE.

[4]  Michael Werman,et al.  Variations on regularization , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[5]  Ping Wah Wong,et al.  Chain codes and their linear reconstruction filters , 1992, IEEE Trans. Inf. Theory.

[6]  Minoru Asada,et al.  The optimal partition of moving edge segments , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Norbert Diehl,et al.  Object-oriented motion estimation and segmentation in image sequences , 1991, Signal Process. Image Commun..

[8]  A. Pentland,et al.  Robust estimation of a multi-layered motion representation , 1991, Proceedings of the IEEE Workshop on Visual Motion.

[9]  Rama Chellappa,et al.  Segmentation and 2-D motion estimation of noisy image sequences , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[10]  M. Eden,et al.  On the performance of a contour coding algorithm in the context of image coding part I: Contour segment coding , 1985 .

[11]  J. Rissanen A UNIVERSAL PRIOR FOR INTEGERS AND ESTIMATION BY MINIMUM DESCRIPTION LENGTH , 1983 .

[12]  Michal Irani,et al.  Image sequence enhancement using multiple motions analysis , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  P. Pirsch,et al.  Advances in picture coding , 1985, Proceedings of the IEEE.

[14]  F. Rocca,et al.  Interframe Redundancy Reduction of Video Signals Generated by Translating Objects , 1977, IEEE Trans. Commun..

[15]  Anil K. Jain,et al.  Displacement Measurement and Its Application in Interframe Image Coding , 1981, IEEE Trans. Commun..

[16]  Hiroshi Harashima,et al.  Model-based/waveform hybrid coding for videotelephone images , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[17]  Eric Dubois,et al.  Bayesian Estimation of Motion Vector Fields , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Michael Hötter,et al.  Object-oriented analysis-synthesis coding based on moving two-dimensional objects , 1990, Signal Process. Image Commun..

[19]  Demetri Terzopoulos,et al.  Analysis and Synthesis of Facial Image Sequences Using Physical and Anatomical Models , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Yianni Attikiouzel,et al.  Model-based region growing segmentation of textured images , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[21]  Thomas S. Huang,et al.  Image sequence analysis , 1981 .

[22]  Magdy A. Bayoumi,et al.  Image segmentation on a 2D array by a directed split and merge procedure , 1992, IEEE Trans. Signal Process..

[23]  Ming Lei Liou,et al.  Overview of the p×64 kbit/s video coding standard , 1991, CACM.

[24]  Michael Werman,et al.  Segmentation by minimum length encoding , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[25]  Michael T. Orchard Predictive motion-field segmentation for image sequence coding , 1993, IEEE Trans. Circuits Syst. Video Technol..

[26]  A. Yuille,et al.  A common theoretical framework for visual motion's spatial and temporal coherence , 1989, [1989] Proceedings. Workshop on Visual Motion.

[27]  Joachim Dengler,et al.  Estimation of discontinuous displacement vector fields with the minimum description length criterion , 1990, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[28]  Jake K. Aggarwal,et al.  On the computation of motion from sequences of images-A review , 1988, Proc. IEEE.

[29]  Edward H. Adelson,et al.  Layered representation for image sequence coding , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[30]  Alex Pentland,et al.  Segmentation by minimal description , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[31]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[32]  Henri Nicolas,et al.  Global motion identification for image sequence analysis and coding , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.