论文信息 - Ieee Transactions on Pattern Analysis and Machine Intelligence Segmentation of Moving Objects by Long Term Video Analysis

Ieee Transactions on Pattern Analysis and Machine Intelligence Segmentation of Moving Objects by Long Term Video Analysis

Motion is a strong cue for unsupervised object-level grouping. In this paper, we demonstrate that motion will be exploited most effectively, if it is regarded over larger time windows. Opposed to classical two-frame optical flow, point trajectories that span hundreds of frames are less susceptible to short-term variations that hinder separating different objects. As a positive side effect, the resulting groupings are temporally consistent over a whole video shot, a property that requires tedious post-processing in the vast majority of existing approaches. We suggest working with a paradigm that starts with semi-dense motion cues first and that fills up textureless areas afterwards based on color. This paper also contributes the Freiburg-Berkeley motion segmentation (FBMS) dataset, a large, heterogeneous benchmark with 59 sequences and pixel-accurate ground truth annotation of moving objects.

[1] Christoph Schnörr,et al. Spectral clustering of linear subspaces for motion segmentation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[2] Antonin Chambolle,et al. A First-Order Primal-Dual Algorithm for Convex Problems with Applications to Imaging , 2011, Journal of Mathematical Imaging and Vision.

[3] Jan-Michael Frahm,et al. Fast gain-adaptive KLT tracking on the GPU , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[4] Gareth Funka-Lea,et al. Graph Cuts and Efficient N-D Image Segmentation , 2006, International Journal of Computer Vision.

[5] Stefano Soatto,et al. Sparse Occlusion Detection with Optical Flow , 2012, International Journal of Computer Vision.

[6] Scott Cohen,et al. LIVEcut: Learning-based interactive video segmentation by evaluation of multiple propagated cues , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[7] Nahum Kiryati,et al. Piecewise-Smooth Dense Optical Flow via Level Sets , 2006, International Journal of Computer Vision.

[8] O. Reiser,et al. Principles Of Gestalt Psychology , 1936 .

[9] Zhixun Su,et al. Fixed-rank representation for unsupervised visual learning , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[10] Philip H. S. Torr,et al. Combining Appearance and Structure from Motion Features for Road Scene Understanding , 2009, BMVC.

[11] Edward H. Adelson,et al. Representing moving images with layers , 1994, IEEE Trans. Image Process..

[12] D. Cremers. Convex Relaxation Techniques for Segmentation , Stereo and Multiview Reconstruction , 2010 .

[13] C. V. Jawahar,et al. Scene Text Recognition using Higher Order Language Priors , 2009, BMVC.

[14] Michael I. Jordan,et al. On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[15] Thomas Brox,et al. Higher order motion models and spectral clustering , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[16] William Brendel,et al. Video object segmentation by tracking regions , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[17] Jitendra Malik,et al. Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18] Takeo Kanade,et al. An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[19] Jitendra Malik,et al. Motion segmentation and tracking using normalized cuts , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[20] René Vidal,et al. Multiframe Motion Segmentation with Missing Data Using PowerFactorization and GPCA , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[21] Jitendra Malik,et al. Finding Boundaries in Natural Images: A New Method Using Point Descriptors and Area Completion , 1998, ECCV.

[22] Carlo Tomasi,et al. Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[23] Katerina Fragkiadaki,et al. Detection free tracking: Exploiting motion and topology for segmenting and tracking under entanglement , 2011, CVPR 2011.

[24] Nikos Komodakis,et al. Performance vs computational efficiency for optimizing single and dynamic MRFs: Setting the state of the art with primal-dual strategies , 2008, Comput. Vis. Image Underst..

[25] Anil M. Cheriyadat,et al. Non-negative matrix factorization of partial track data for motion segmentation , 2010, 2009 IEEE 12th International Conference on Computer Vision.

[26] Cordelia Schmid,et al. Learning object class detectors from weakly annotated video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[27] Yong Yu,et al. Robust Subspace Segmentation by Low-Rank Representation , 2010, ICML.

[28] Nanning Zheng,et al. Video object segmentation by clustering region trajectories , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[29] Ivan Laptev,et al. Track to the future: Spatio-temporal video segmentation with long-range motion cues , 2011, CVPR 2011.

[30] Guillermo Sapiro,et al. A Geodesic Framework for Fast Interactive Image and Video Segmentation and Matting , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[31] Daniel Cremers,et al. On Local Region Models and a Statistical Interpretation of the Piecewise Smooth Mumford-Shah Functional , 2009, International Journal of Computer Vision.

[32] Daniel Cremers,et al. Motion Competition: A Variational Approach to Piecewise Parametric Motion Segmentation , 2005, International Journal of Computer Vision.

[33] Joaquim Salvi,et al. Enhanced Local Subspace Affinity for feature-based motion segmentation , 2011, Pattern Recognit..

[34] Daniel Cremers,et al. TVSeg - Interactive Total Variation Based Image Segmentation , 2008, BMVC.

[35] Christoph Schnörr,et al. Convex optimization for multi-class image labeling with a novel family of total variation based regularizers , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[36] Jan-Michael Frahm,et al. Feature tracking and matching in video using programmable graphics hardware , 2007, Machine Vision and Applications.

[37] René Vidal,et al. A Benchmark for the Comparison of 3-D Motion Segmentation Algorithms , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[38] Thomas Brox,et al. Object segmentation in video: A hierarchical variational approach for turning point trajectories into dense regions , 2011, 2011 International Conference on Computer Vision.

[39] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[40] Leo Grady,et al. A Seeded Image Segmentation Framework Unifying Graph Cuts And Random Walker Which Yields A New Algorithm , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[41] René Vidal,et al. Motion segmentation via robust subspace separation in the presence of outlying, incomplete, or corrupted trajectories , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[42] Joachim M. Buhmann,et al. Seeing the Objects Behind the Dots: Recognition in Videos from a Moving Camera , 2009, International Journal of Computer Vision.

[43] Katerina Fragkiadaki,et al. Video segmentation by tracing discontinuities in a trajectory embedding , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[44] Katerina Fragkiadaki,et al. Two-Granularity Tracking: Mediating Trajectory and Detection Graphs for Tracking under Occlusions , 2012, ECCV.

[45] Takeo Kanade,et al. A multi-body factorization method for motion analysis , 1995, Proceedings of IEEE International Conference on Computer Vision.

[46] Mubarak Shah,et al. Motion layer extraction in the presence of occlusion using graph cuts , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47] Andrew Zisserman,et al. Learning Layered Motion Segmentations of Video , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[48] Daniel Cremers,et al. Motion Competition: A variational framework for piecewise parametric motion segmentation , 2005 .

[49] Nikos Komodakis,et al. Approximate Labeling via Graph Cuts Based on Linear Programming , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50] Roberto Cipolla,et al. Segmentation and Recognition Using Structure from Motion Point Clouds , 2008, ECCV.

[51] Seth J. Teller,et al. Particle Video: Long-Range Motion Estimation Using Point Trajectories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[52] Mei Han,et al. Efficient hierarchical graph-based video segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[53] Leo Grady,et al. Random Walks for Image Segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[54] Thomas Brox,et al. Variational Motion Segmentation with Level Sets , 2006, ECCV.

[55] René Vidal,et al. Projective Factorization of Multiple Rigid-Body Motions , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[56] Kurt Keutzer,et al. Dense Point Trajectories by GPU-Accelerated Large Displacement Optical Flow , 2010, ECCV.

[57] David Suter,et al. A Model-Selection Framework for Multibody Structure-and-Motion of Image Sequences , 2007, International Journal of Computer Vision.

[58] Daniel Cremers,et al. A Convex Approach to Minimal Partitions , 2012, SIAM J. Imaging Sci..

[59] Bernt Schiele,et al. Monocular Visual Scene Understanding: Understanding Multi-Object Traffic Scenes , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[60] Roberto Cipolla,et al. Unsupervised Bayesian Detection of Independent Motion in Crowds , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[61] Shrinivas J. Pundlik,et al. Joint tracking of features and edges , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[62] Jitendra Malik,et al. Object Segmentation by Long Term Analysis of Point Trajectories , 2010, ECCV.

[63] Jitendra Malik,et al. A real-time computer vision system for measuring traffic parameters , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[64] Marc Pollefeys,et al. A General Framework for Motion Segmentation: Independent, Articulated, Rigid, Non-rigid, Degenerate and Non-degenerate , 2006, ECCV.

[65] Andrew Zisserman,et al. Object Level Grouping for Video Shots , 2004, International Journal of Computer Vision.

[66] T. Boult,et al. Factorization-based segmentation of motions , 1991, Proceedings of the IEEE Workshop on Visual Motion.

[67] René Vidal,et al. Sparse Subspace Clustering: Algorithm, Theory, and Applications , 2012, IEEE transactions on pattern analysis and machine intelligence.

[68] C. W. Gear,et al. Multibody Grouping from Motion Images , 1998, International Journal of Computer Vision.

[69] Eric L. Miller,et al. Multiple Hypothesis Video Segmentation from Superpixel Flows , 2010, ECCV.

[70] Patrick Pérez,et al. Clustering Point Trajectories with Various Life-Spans , 2009, 2009 Conference for Visual Media Production.

[71] Daniel Cremers,et al. Interactive Motion Segmentation , 2010, DAGM-Symposium.

[72] Charless C. Fowlkes,et al. Contour Detection and Hierarchical Image Segmentation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[73] Ethan M. Meyers,et al. Visual Parsing After Recovery From Blindness , 2009, Psychological science.

[74] Michael J. Black,et al. Layered segmentation and optical flow estimation over time , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[75] Mila Nikolova,et al. Algorithms for Finding Global Minimizers of Image Segmentation and Denoising Models , 2006, SIAM J. Appl. Math..