论文信息 - Video Segmentation by Non-Local Consensus voting

Video Segmentation by Non-Local Consensus voting

We address the problem of Foreground/Background segmentation of “unconstrained” video. By “unconstrained” we mean that the moving objects and the background scene may be highly non-rigid (e.g., waves in the sea); the camera may undergo a complex motion with 3D parallax; moving objects may suffer from motion blur, large scale and illumination changes, etc. Most existing segmentation methods fail on such unconstrained videos, especially in the presence of highly non-rigid motion and low resolution. We propose a computationally efficient algorithm which is able to produce accurate results on a large variety of unconstrained videos. This is obtained by casting the video segmentation problem as a voting scheme on the graph of similar (‘re-occurring’) regions in the video sequence. We start from crude saliency votes at each pixel, and iteratively correct those votes by ‘consensus voting’ of re-occurring regions across the video sequence. The power of our consensus voting comes from the non-locality of the region re-occurrence, both in space and in time – enabling propagation of diverse and rich information across the entire video sequence. Qualitative and quantitative experiments indicate that our approach outperforms current state-of-the-art methods.

Michal Irani | Alon Faktor | M. Irani | Alon Faktor

[1] T. Kanade,et al. A multi-body factorization method for motion analysis , 1995, ICCV 1995.

[2] Andrew Zisserman,et al. Concerning Bayesian Motion Segmentation, Model, Averaging, Matching and the Trifocal Tensor , 1998, ECCV.

[3] Jitendra Malik,et al. Motion segmentation and tracking using normalized cuts , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[4] P. Anandan,et al. A Unified Approach to Moving Object Detection in 2D and 3D Scenes , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[5] W. Eric L. Grimson,et al. Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[6] I. Haritaoglu,et al. Background and foreground modeling using nonparametric kernel density estimation for visual surveillance , 2002 .

[7] Jan-Olof Eklundh,et al. Statistical background subtraction for a mobile observer , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[8] Chin-Seng Chua,et al. Statistical background modeling for non-stationary camera , 2003, Pattern Recognit. Lett..

[9] Michal Irani,et al. Computing occluding and transparent motions , 1994, International Journal of Computer Vision.

[10] A. Criminisi,et al. Bilayer Segmentation of Live Video , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11] Gérard G. Medioni,et al. Detecting Motion Regions in the Presence of a Strong Parallax from a Moving Camera by Multiview Geometric Constraints , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12] René Vidal,et al. Motion segmentation via robust subspace separation in the presence of outlying, incomplete, or corrupted trajectories , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[13] Ce Liu,et al. Exploring new representations and applications for motion analysis , 2009 .

[14] Guillermo Sapiro,et al. Video SnapCut: robust video object cutout using localized classifiers , 2009, ACM Trans. Graph..

[15] William Brendel,et al. Video object segmentation by tracking regions , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[16] Scott Cohen,et al. LIVEcut: Learning-based interactive video segmentation by evaluation of multiple propagated cues , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[17] René Vidal,et al. Sparse subspace clustering , 2009, CVPR.

[18] James M. Rehg,et al. Motion Coherent Tracking with Multi-label MRF optimization , 2010, BMVC.

[19] Jitendra Malik,et al. Object Segmentation by Long Term Analysis of Point Trajectories , 2010, ECCV.

[20] Eric L. Miller,et al. Multiple Hypothesis Video Segmentation from Superpixel Flows , 2010, ECCV.

[21] Mei Han,et al. Efficient hierarchical graph-based video segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22] Benjamin Höferlin,et al. Evaluation of background subtraction techniques for video surveillance , 2011, CVPR 2011.

[23] Thomas Brox,et al. Object segmentation in video: A hierarchical variational approach for turning point trajectories into dense regions , 2011, 2011 International Conference on Computer Vision.

[24] Yong Jae Lee,et al. Key-segments for video object segmentation , 2011, 2011 International Conference on Computer Vision.

[25] Longin Jan Latecki,et al. Maximum weight cliques with mutex constraints for video object segmentation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[26] Tal Hassner,et al. The Action Similarity Labeling Challenge , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27] Ignas Budvytis,et al. Mixture of Trees Probabilistic Graphical Model for Video Segmentation , 2013, International Journal of Computer Vision.

[28] Katerina Fragkiadaki,et al. Video segmentation by tracing discontinuities in a trajectory embedding , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[29] Mubarak Shah,et al. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.

[30] Yu-Wing Tai,et al. Video Matting Using Multi-frame Nonlocal Matting Laplacian , 2012, ECCV.

[31] Thomas Brox,et al. Higher order motion models and spectral clustering , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[32] James M. Rehg,et al. Video Segmentation by Tracking Many Figure-Ground Segments , 2013, 2013 IEEE International Conference on Computer Vision.

[33] Santiago Manen,et al. Online Video SEEDS for Temporal Window Objectness , 2013, 2013 IEEE International Conference on Computer Vision.

[34] Mubarak Shah,et al. Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[35] Vittorio Ferrari,et al. Fast Object Segmentation in Unconstrained Video , 2013, 2013 IEEE International Conference on Computer Vision.

[36] C. Lawrence Zitnick,et al. Structured Forests for Fast Edge Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[37] Cristian Sminchisescu,et al. Video Object Segmentation by Salient Segment Chain Composition , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[38] Lihi Zelnik-Manor,et al. What Makes a Patch Distinct? , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.