Probabilistic Motion Segmentation of Videos for Temporal Super Resolution

A novel scheme is proposed for achieving motion segmentation in low-frame rate videos, with application to temporal super resolution. Probabilistic generative models are commonly used to perform unsupervised motion segmentation in videos. While they provide a general and elegant framework, they are hampered by severe local minima problems and often converge to inaccurate solutions, when there are more than one foreground object in videos. This paper proposes a scheme, where discriminative global constraints are enforced in combination with generative learning, to overcome the local minima problems. We demonstrate the effectiveness of the proposed scheme by learning the appearances and motions of multiple objects from a low frame rate video with a small number of frames.

[1]  Andrew Blake,et al.  Generative Affine Localisation and Tracking , 2004, NIPS.

[2]  Brendan J. Frey,et al.  Learning flexible sprites in video layers , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[3]  Roberto Cipolla,et al.  Hole Filling Through Photomontage , 2005, BMVC.

[4]  Charles M. Bishop,et al.  Variational Message Passing , 2005, J. Mach. Learn. Res..

[5]  Eli Shechtman,et al.  Space-time video completion , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[6]  Edward H. Adelson,et al.  Representing moving images with layers , 1994, IEEE Trans. Image Process..

[7]  Brendan J. Frey,et al.  Video Epitomes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8]  Andrew Zisserman,et al.  Learning Layered Motion Segmentations of Video , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.