The recovery of a near optimal layer representation for an entire image sequence

Wang and Adelson see (IEEE Trans. Image Proc., vol.3, no.5, p.625-38, 1994) proposed the layer representation as a convenient representation for video coding, format conversion and special effects. We present a technique to extract the near optimal layer representation of an image sequence using the EM algorithm. We incorporate forward/backward motion estimation to obtain a sharp segmentation in the presence of occlusion and uncovering. We demonstrate that the algorithm is capable of tracking layers within a sequence, and detecting when layers enter or leave the sequence.

[1]  Edward H. Adelson,et al.  Representing moving images with layers , 1994, IEEE Trans. Image Process..

[2]  Harpreet S. Sawhney,et al.  Layered representation of motion video using robust maximum-likelihood estimation of mixture models and MDL encoding , 1995, Proceedings of IEEE International Conference on Computer Vision.

[3]  Steven D. Blostein,et al.  Motion-based object segmentation and estimation using the MDL principle , 1995, IEEE Trans. Image Process..

[4]  Noel E. O'Connor,et al.  Object detection and tracking using an EM-based motion estimation and segmentation framework , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[5]  Michael J. Black,et al.  Mixture models for optical flow computation , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .