论文信息 - Object Segmentation by Long Term Analysis of Point Trajectories

Object Segmentation by Long Term Analysis of Point Trajectories

Unsupervised learning requires a grouping step that defines which data belong together. A natural way of grouping in images is the segmentation of objects or parts of objects. While pure bottom-up segmentation from static cues is well known to be ambiguous at the object level, the story changes as soon as objects move. In this paper, we present a method that uses long term point trajectories based on dense optical flow. Defining pair-wise distances between these trajectories allows to cluster them, which results in temporally consistent segmentations of moving objects in a video shot. In contrast to multi-body factorization, points and even whole objects may appear or disappear during the shot. We provide a benchmark dataset and an evaluation method for this so far uncovered setting.

Jitendra Malik | Thomas Brox | T. Brox | Jitendra Malik

[1] O. Reiser,et al. Principles Of Gestalt Psychology , 1936 .

[2] Elizabeth S. Spelke,et al. Principles of Object Perception , 1990, Cogn. Sci..

[3] Edward H. Adelson,et al. Representing moving images with layers , 1994, IEEE Trans. Image Process..

[4] Takeo Kanade,et al. A multi-body factorization method for motion analysis , 1995, Proceedings of IEEE International Conference on Computer Vision.

[5] Yair Weiss,et al. Smoothness in layers: Motion segmentation using nonparametric mixture estimation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7] Jitendra Malik,et al. Finding Boundaries in Natural Images: A New Method Using Point Descriptors and Area Completion , 1998, ECCV.

[8] Jitendra Malik,et al. Motion segmentation and tracking using normalized cuts , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[9] Bernd Neumann,et al. Computer Vision — ECCV’98 , 1998, Lecture Notes in Computer Science.

[10] Michael I. Jordan,et al. On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[11] Paul Smith,et al. Layered motion segmentation and depth ordering by tracking edges , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12] Daniel Cremers,et al. Motion Competition: A variational framework for piecewise parametric motion segmentation , 2005 .

[13] Mubarak Shah,et al. Motion layer extraction in the presence of occlusion using graph cuts , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Daniel Cremers,et al. Motion Competition: A Variational Approach to Piecewise Parametric Motion Segmentation , 2005, International Journal of Computer Vision.

[15] Andrew Zisserman,et al. Object Level Grouping for Video Shots , 2004, International Journal of Computer Vision.

[16] Andrew Zisserman,et al. Learning Layered Motion Segmentations of Video , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[17] W. Eric L. Grimson,et al. Learning Semantic Scene Models by Trajectory Analysis , 2006, ECCV.

[18] Marc Pollefeys,et al. A General Framework for Motion Segmentation: Independent, Articulated, Rigid, Non-rigid, Degenerate and Non-degenerate , 2006, ECCV.

[19] Axel Pinz,et al. Computer Vision – ECCV 2006 , 2006, Lecture Notes in Computer Science.

[20] Roberto Cipolla,et al. Unsupervised Bayesian Detection of Independent Motion in Crowds , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[21] Seth J. Teller,et al. Particle Video: Long-Range Motion Estimation Using Point Trajectories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[22] René Vidal,et al. A Benchmark for the Comparison of 3-D Motion Segmentation Algorithms , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[23] René Vidal,et al. Motion segmentation via robust subspace separation in the presence of outlying, incomplete, or corrupted trajectories , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[24] Ehsan Elhamifar,et al. Sparse subspace clustering , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[25] Patrick Pérez,et al. Clustering Point Trajectories with Various Life-Spans , 2009, 2009 Conference for Visual Media Production.

[26] Anil M. Cheriyadat,et al. Non-negative matrix factorization of partial track data for motion segmentation , 2010, 2009 IEEE 12th International Conference on Computer Vision.

[27] Kurt Keutzer,et al. Dense Point Trajectories by GPU-Accelerated Large Displacement Optical Flow , 2010, ECCV.

[28] Jitendra Malik,et al. Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.