What to Select: Pursuing Consistent Motion Segmentation from Multiple Geometric Models

Motion segmentation aims at separating motions of different moving objects in a video sequence. Facing the complicated real-world scenes, recent studies reveal that combining multiple geometric models would be a more effective way than just employing a single one. This motivates a new wave of model-fusion based motion segmentation methods. However, the vast majority of models of this kind merely seek consensus in spectral embeddings. We argue that a simple consensus might be insufficient to filter out the harmful information which is either unreliable or semantically unrelated to the segmentation task. Therefore, how to automatically select valuable patterns across multiple models should be regarded as a key challenge here. In this paper, we present a novel geometric-model-fusion framework for motion segmentation, which targets at constructing a consistent affinity matrix across all the geometric models. Specifically, it incorporates the structural information shared by affinity matrices to select those semantically consistent entries. Meanwhile, a multiplicative decomposition scheme is adopted to ensure structural consistency among multiple affinities. To solve this problem, an alternative optimization scheme is proposed, together with a proof of its global convergence. Experiments on four real-world benchmarks show the superiority of the proposed method.

[1]  Andrea Fusiello,et al.  Fitting Multiple Heterogeneous Models by Multi-Class Cascaded T-Linkage , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Michael J. Black,et al.  Competitive Collaboration: Joint Unsupervised Learning of Depth, Camera Motion, Optical Flow and Motion Segmentation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Tao Mei,et al.  Subspace Clustering by Block Diagonal Representation , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  K. Fan On a Theorem of Weyl Concerning Eigenvalues of Linear Transformations I. , 1949, Proceedings of the National Academy of Sciences of the United States of America.

[5]  René Vidal,et al.  Multiframe Motion Segmentation with Missing Data Using PowerFactorization and GPCA , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[6]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[7]  David Suter,et al.  Motion Segmentation of RGB-D Sequences: Combining Semantic and Motion Information Using Statistical Inference , 2020, IEEE Transactions on Image Processing.

[8]  Konrad Schindler,et al.  Perspective n-View Multibody Structure-and-Motion Through Model Selection , 2006, ECCV.

[9]  Andrea Fusiello,et al.  Multiple Models Fitting as a Set Coverage Problem , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Shi-Min Hu,et al.  ClusterSLAM: A SLAM backend for simultaneous rigid body clustering and motion estimation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[11]  Xun Xu,et al.  Motion Segmentation by Exploiting Complementary Geometric Models , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[12]  René Vidal,et al.  Motion Segmentation in the Presence of Outlying, Incomplete, or Corrupted Trajectories , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Jiri Matas,et al.  Multi-Class Model Fitting by Energy Minimization and Mode-Seeking , 2017, ECCV.

[14]  Zhuwen Li,et al.  Perspective Motion Segmentation via Collaborative Clustering , 2013, 2013 IEEE International Conference on Computer Vision.

[15]  Xun Xu,et al.  3D Rigid Motion Segmentation with Mixed and Unknown Number of Models , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Guillermo Sapiro,et al.  Nonnegative Matrix Underapproximation for Robust Multiple Model Fitting , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Nassir Navab,et al.  Joint motion boundary detection and CNN-based feature visualization for video object segmentation , 2019, Neural Computing and Applications.

[18]  René Vidal,et al.  Sparse Subspace Clustering: Algorithm, Theory, and Applications , 2012, IEEE transactions on pattern analysis and machine intelligence.

[19]  Yong Yu,et al.  Robust Recovery of Subspace Structures by Low-Rank Representation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Eric Brachmann,et al.  CONSAC: Robust Multi-Model Fitting by Conditional Sample Consensus , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Xiaochun Cao,et al.  Task-Feature Collaborative Learning with Application to Personalized Attribute Prediction , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Yan Yan,et al.  Motion Segmentation Via a Sparsity Constraint , 2017, IEEE Transactions on Intelligent Transportation Systems.

[23]  David Suter,et al.  Hypergraph Optimization for Multi-Structural Geometric Model Fitting , 2019, AAAI.

[24]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[25]  Sandeep Singh Sengar,et al.  Motion segmentation-based surveillance video compression using adaptive particle swarm optimization , 2019, Neural Computing and Applications.

[26]  Junmo Kim,et al.  Randomized Voting-Based Rigid-Body Motion Segmentation , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[27]  Pei Chen,et al.  Optimization Algorithms on Subspaces: Revisiting Missing Data Problem in Low-Rank Matrix , 2008, International Journal of Computer Vision.

[28]  Andrea Fusiello,et al.  T-Linkage: A Continuous Relaxation of J-Linkage for Multi-model Fitting , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[30]  René Vidal,et al.  A Benchmark for the Comparison of 3-D Motion Segmentation Algorithms , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Qingming Huang,et al.  Split Multiplicative Multi-View Subspace Clustering , 2019, IEEE Transactions on Image Processing.

[32]  Tat-Jun Chin,et al.  The Ordered Residual Kernel for Robust Motion Subspace Clustering , 2009, NIPS.

[33]  Mohan S. Kankanhalli,et al.  Unsupervised Online Video Object Segmentation With Motion Property Understanding , 2018, IEEE Transactions on Image Processing.

[34]  Jiri Matas,et al.  Progressive-X: Efficient, Anytime, Multi-Model Fitting Algorithm , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).