Globally optimal solution to exploit rigidity when recovering structure from motion under occlusion

Widely used SVD-based matrix factorization approaches to the recovery of 3D rigid structure from motion (SFM), require a set of feature points to be visible in a set of images. When there is occlusion, several feature points disappear, the observation matrix misses some entries, there is not equivalent to the SVD, and only suboptimal solutions have been proposed to exploit rigidity. In this paper, we propose a method to complete the trajectories that correspond to a rigid scene, in an optimal way. Our algorithm is not iterative (thus avoiding problems like sensitivity to initialization and local optima); it rather computes in a finite number of steps the globally optimal completion of the observation matrix. We describe experiments that illustrate the gain in accuracy of SFM.

[1]  José M. F. Moura,et al.  Rank 1 Weighted Factorization for 3D Structure Recovery: Algorithms and Performance Analysis , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Sebastian Thrun,et al.  Affine Structure From Sound , 2005, NIPS.

[3]  Takayuki Okatani,et al.  On the Wiberg Algorithm for Matrix Factorization in the Presence of Missing Components , 2007, International Journal of Computer Vision.

[4]  P. Aguiar,et al.  On singular values of partially prescribed matrices , 2008 .

[5]  Andrew W. Fitzgibbon,et al.  Damped Newton algorithms for matrix factorization with missing data , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[7]  João M. F. Xavier,et al.  Spectrally optimal factorization of incomplete matrices , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  David W. Jacobs,et al.  Linear fitting with missing data: applications to structure-from-motion and to characterizing intensity images , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Takeo Kanade,et al.  Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[10]  Pedro M. Q. Aguiar,et al.  3D structure from video streams with partially overlapping images , 2002, Proceedings. International Conference on Image Processing.

[11]  Joshua B. Tenenbaum,et al.  Learning bilinear models for two-factor problems in vision , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.