Semantic Multi-body Motion Segmentation

This paper presents a method to deal with the multi-body segmentation problem using a set of 2D points matches between two views. The key feature of our approach is the explicit inclusion of a higher semantic information as given by general purpose object detectors that boost the segmentation of the moving objects. In the classical formulation of the problem, only 2D matched points between views are used to identify independently moving objects based on the principle that a set of points belonging to a moving object would satisfy some given multi-view relations (e.g. multi-body epipolar constraints). We improve and speedup such process by including the information that a set of 2D matches may belong to the same object given the output of a detector. As such, instead of sampling points uniformly with a RANSAC based strategy, the selection of the matches is driven by the position and score confidence of the object detectors. Evaluation on challenging synthetic and real datasets shows a remarkable improvement in respect to previous approaches, regarding both the number of iterations required to segment a scene and the effectiveness of the segmentation itself, often making the difference between satisfying segmentation and almost complete failure.

[1]  David Suter,et al.  Robust adaptive-scale parametric model estimation for computer vision , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  D. Freedman,et al.  Fast Mean Shift by compact density representation , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Gerard Medioni,et al.  StaRSaC: Stable random sample consensus for parameter estimation , 2009, CVPR.

[4]  Andrew Owens,et al.  SUN3D: A Database of Big Spaces Reconstructed Using SfM and Object Labels , 2013, 2013 IEEE International Conference on Computer Vision.

[5]  René Vidal,et al.  Multiframe Motion Segmentation with Missing Data Using PowerFactorization and GPCA , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[6]  Jan-Michael Frahm,et al.  RECON: Scale-adaptive robust estimation via Residual Consensus , 2011, 2011 International Conference on Computer Vision.

[7]  Tat-Jun Chin,et al.  The Random Cluster Model for robust geometric fitting , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Tat-Jun Chin,et al.  Efficient Multi-structure Robust Fitting with Incremental Top-k Lists Comparison , 2010, ACCV.

[9]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[10]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  João Paulo Costeira,et al.  The Normalized Subspace Inclusion: Robust clustering of motion subspaces , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[12]  Wilhelm Burger,et al.  Digital Image Processing - An Algorithmic Introduction using Java , 2016, Texts in Computer Science.

[13]  Tat-Jun Chin,et al.  Accelerated Hypothesis Generation for Multistructure Data via Preference Analysis , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Tat-Jun Chin,et al.  Robust fitting of multiple structures: The statistical learning approach , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[15]  Anil M. Cheriyadat,et al.  Non-negative matrix factorization of partial track data for motion segmentation , 2010, 2009 IEEE 12th International Conference on Computer Vision.

[16]  Sang Wook Lee,et al.  Deterministic Fitting of Multiple Structures Using Iterative MaxFS with Inlier Scale Estimation , 2013, 2013 IEEE International Conference on Computer Vision.

[17]  Robert T. Collins,et al.  Mean-shift blob tracking through scale space , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[18]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Tat-Jun Chin,et al.  Dynamic and hierarchical multi-structure geometric model fitting , 2011, 2011 International Conference on Computer Vision.

[20]  B. S. Manjunath,et al.  The multiRANSAC algorithm and its application to detect planar homographies , 2005, IEEE International Conference on Image Processing 2005.

[21]  Jana Kosecka,et al.  Nonparametric Estimation of Multiple Structures with Outliers , 2006, WDV.

[22]  René Vidal,et al.  Motion segmentation via robust subspace separation in the presence of outlying, incomplete, or corrupted trajectories , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Sunglok Choi,et al.  Performance Evaluation of RANSAC Family , 2009, BMVC.

[24]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  David Suter,et al.  A Model-Selection Framework for Multibody Structure-and-Motion of Image Sequences , 2007, International Journal of Computer Vision.

[26]  T. Kanade,et al.  A multi-body factorization method for motion analysis , 1995, ICCV 1995.

[27]  Tat-Jun Chin,et al.  Simultaneously Fitting and Segmenting Multiple-Structure Data with Outliers , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Marc Pollefeys,et al.  Joint 3D Scene Reconstruction and Class Segmentation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Tat-Jun Chin,et al.  Accelerated Hypothesis Generation for Multi-structure Robust Fitting , 2010, ECCV.

[30]  Tat-Jun Chin,et al.  The Random Cluster Model for robust geometric fitting , 2012, CVPR.

[31]  Henrik Aanæs,et al.  Interesting Interest Points , 2011, International Journal of Computer Vision.

[32]  Nikolas P. Galatsanos,et al.  Generalized likelihood ratio test based algorithms for object recognition in photon-limited images , 2005, IEEE International Conference on Image Processing 2005.

[33]  Andrew W. Fitzgibbon,et al.  Multibody Structure and Motion: 3-D Reconstruction of Independently Moving Objects , 2000, ECCV.

[34]  Joaquim Salvi,et al.  Enhanced Local Subspace Affinity for feature-based motion segmentation , 2011, Pattern Recognit..

[35]  Christoph Schnörr,et al.  Spectral clustering of linear subspaces for motion segmentation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[36]  Joaquim Salvi,et al.  Adaptive Motion Segmentation Algorithm Based on the Principal Angles Configuration , 2010, ACCV.

[37]  Roberto Tron RenVidal A Benchmark for the Comparison of 3-D Motion Segmentation Algorithms , 2007 .

[38]  Luc Van Gool,et al.  Multibody Structure-from-Motion in Practice , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  René Vidal,et al.  Three-View Multibody Structure from Motion , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[41]  Silvio Savarese,et al.  Semantic structure from motion , 2011, CVPR 2011.

[42]  Andrea Fusiello,et al.  Robust Multiple Structures Estimation with J-Linkage , 2008, ECCV.

[43]  René Vidal,et al.  Sparse subspace clustering , 2009, CVPR.