Inlier Estimation for Moving Camera Motion Segmentation

In moving camera videos, motion segmentation is often performed on the optical flow. However, there exist two challenges: (1) Camera motions lead to three primary flows in optical flow: translation, rotation, and radial flow. They are not all solved in existing frameworks under Cartesian coordinate system; (2) A moving camera introduces 3D motion, the depth discontinuities cause the motion discontinuities that severely confuse the motion segmentation. Meanwhile, the mixture of the camera motion and moving objects’ motions make indistinctness between foreground and background. In this work, our solution is to find a low order polynomial to model the background flow field due to its coherence. To this end, we first amend the Helmholts-Hodge Decomposition by adding coherence constraints, which can handle translation, rotation, and radial flow fields under a unified framework. Secondly, we introduce an Incoherence Map and a progressive Quad-Tree partition to reject moving objects and motion discontinuities. Finally, the low order polynomial is achieved from the rest flow samples on two potentials in HHD. We present results on more than twenty videos from four benchmarks. Extensive experiments demonstrate a better performance in dealing with challenging scenes with complex backgrounds. Our method improves the segmentation accuracy of state-of-the-art by \(10\%\sim 30\%\).

[1]  Michal Irani,et al.  Recovery of ego-motion using image stabilization , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Ming-Ting Sun,et al.  Global motion estimation from coarsely sampled motion vector field and the applications , 2003, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Valerio Pascucci,et al.  The Helmholtz-Hodge Decomposition—A Survey , 2013, IEEE Transactions on Visualization and Computer Graphics.

[4]  Jitendra Malik,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence Segmentation of Moving Objects by Long Term Video Analysis , 2022 .

[5]  Ivan V. Bajic,et al.  Motion Vector Outlier Rejection Cascade for Global Motion Estimation , 2010, IEEE Signal Processing Letters.

[6]  Thomas Brox,et al.  High Accuracy Optical Flow Estimation Based on a Theory for Warping , 2004, ECCV.

[7]  PascucciValerio,et al.  The Helmholtz-Hodge Decomposition—A Survey , 2013 .

[8]  Ahmed M. Elgammal,et al.  Online Moving Camera Background Subtraction , 2012, ECCV.

[9]  Ivan V. Bajic,et al.  Global motion estimation under translation-zoom ambiguity , 2013, 2013 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM).

[10]  Eyke Hüllermeier,et al.  Optimizing the F-Measure in Multi-Label Classification: Plug-in Rule Approach versus Structured Loss Minimization , 2013, ICML.

[11]  H. Helmholtz LXIII. On Integrals of the hydrodynamical equations, which express vortex-motion , 1858 .

[12]  Alan Johnston,et al.  Biologically inspired framework for spatial and spectral velocity estimations. , 2011, Journal of the Optical Society of America. A, Optics, image science, and vision.

[13]  James M. Rehg,et al.  Motion Coherent Tracking with Multi-label MRF optimization , 2010, BMVC.

[14]  Ivan V. Bajic,et al.  A Joint Approach to Global Motion Estimation and Motion Segmentation From a Coarsely Sampled Motion Vector Field , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  René Vidal,et al.  A Benchmark for the Comparison of 3-D Motion Segmentation Algorithms , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Jitendra Malik,et al.  Object Segmentation by Long Term Analysis of Point Trajectories , 2010, ECCV.

[17]  Bohyung Han,et al.  Generalized background subtraction based on hybrid inference by belief propagation and Bayesian filtering , 2011, 2011 International Conference on Computer Vision.

[18]  Allen R. Hanson,et al.  Coherent Motion Segmentation in Moving Camera Videos Using Optical Flow Orientations , 2013, 2013 IEEE International Conference on Computer Vision.