论文信息 - Rigid Structure from Motion from a Blind Source Separation Perspective

Rigid Structure from Motion from a Blind Source Separation Perspective

We present an information theoretic approach to define the problem of structure from motion (SfM) as a blind source separation one. Given that for almost all practical joint densities of shape points, the marginal densities are non-Gaussian, we show how higher-order statistics can be used to provide improvements in shape estimates over the methods of factorization via Singular Value Decomposition (SVD), bundle adjustment and Bayesian approaches. Previous techniques have either explicitly or implicitly used only second-order statistics in models of shape or noise. A further advantage of viewing SfM as a blind source problem is that it easily allows for the inclusion of noise and shape models, resulting in Maximum Likelihood (ML) or Maximum a Posteriori (MAP) shape and motion estimates. A key result is that the blind source separation approach has the ability to recover the motion and shape matrices without the need to explicitly know the motion or shape pdf. We demonstrate that it suffices to know whether the pdf is sub- or super-Gaussian (i.e., semi-parametric estimation) and derive a simple formulation to determine this from the data. We provide extensive experimental results on synthetic and real tracked points in order to quantify the improvement obtained from this technique.

Aleix M. Martínez | Jeff Fortuna

[1] P. Anandan,et al. Factorization with Uncertainty , 2000, International Journal of Computer Vision.

[2] Takeo Kanade,et al. Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[3] Takeo Kanade,et al. Robust L/sub 1/ norm factorization in the presence of outliers and missing data by alternative convex programming , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4] Jun Fujiki,et al. Nonlinear PCA/ICA for the Structure from Motion Problem , 2004, ICA.

[5] William Adams. L. Le Cam, The central limit theorem around 1935 , 2009 .

[6] Takeo Kanade,et al. A unified factorization algorithm for points, line segments and planes with uncertainty models , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[7] Richard Szeliski,et al. Vision Algorithms: Theory and Practice , 2002, Lecture Notes in Computer Science.

[8] Aleix M. Martínez,et al. A Blind Source Separation Approach to Structure from Motion , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[9] Visa Koivunen,et al. Identifiability, separability, and uniqueness of linear ICA models , 2004, IEEE Signal Processing Letters.

[10] Eric Moulines,et al. A blind source separation technique using second-order statistics , 1997, IEEE Trans. Signal Process..

[11] Aleix M. Martínez,et al. Low-Rank Matrix Fitting Based on Subspace Perturbation Analysis with Applications to Structure from Motion , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12] Michael Weber,et al. Independent Component Analysis of Incomplete Data , 1999 .

[13] Erkki Oja,et al. Independent Component Analysis , 2001 .

[14] David Suter,et al. Recovering the missing components in a large noisy low-rank matrix: application to SFM , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15] Andrzej Cichocki,et al. Robust techniques for independent component analysis (ICA) with noisy data , 1998, Neurocomputing.

[16] Andrew W. Fitzgibbon,et al. Automatic 3D model acquisition and generation of new images from video sequences , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[17] Matthew Brand,et al. A direct method for 3D factorization of nonrigid motion observed in 2D , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[18] Aapo Hyvärinen,et al. Independent component analysis in the presence of Gaussian noise by maximizing joint likelihood , 1998, Neurocomputing.

[19] Bradley J. Nelson,et al. A Deformable Object Tracking Algorithm Based on the Boundary Element Method that is Robust to Occlusions and Spurious Edges , 2008, International Journal of Computer Vision.

[20] Michael J. Black,et al. A Framework for Robust Subspace Learning , 2003, International Journal of Computer Vision.

[21] Terrence J. Sejnowski,et al. Independent Component Analysis Using an Extended Infomax Algorithm for Mixed Subgaussian and Supergaussian Sources , 1999, Neural Computation.

[22] Anders Heyden,et al. Affine Structure and Motion from Points, Lines and Conics , 1999, International Journal of Computer Vision.

[23] Aleix M. Martínez,et al. A weighted probabilistic approach to face recognition from multiple images and video sequences , 2006, Image Vis. Comput..

[24] S. Haykin. Unsupervised adaptive filtering, vol. 1: Blind source separation , 2000 .

[25] David A. Forsyth,et al. Bayesian structure from motion , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[26] Andrew W. Fitzgibbon,et al. Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[27] Andrew W. Fitzgibbon,et al. Damped Newton algorithms for matrix factorization with missing data , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[28] Henning Biermann,et al. Recovering non-rigid 3D shape from image streams , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[29] Li Zhang,et al. Shape and motion under varying illumination: unifying structure from motion, photometric stereo, and multiview stereo , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[30] Terrence J. Sejnowski,et al. Blind source separation of more sources than mixtures using overcomplete representations , 1999, IEEE Signal Processing Letters.

[31] Yun Fu,et al. Image-Based Human Age Estimation by Manifold Learning and Locally Adjusted Robust Regression , 2008, IEEE Transactions on Image Processing.

[32] Anders Heyden,et al. Euclidean reconstruction and auto-calibration from continuous motion , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[33] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[34] Liya Ding,et al. Precise detailed detection of faces and facial features , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[35] Olivier Faugeras,et al. Three-Dimensional Computer Vision , 1993 .

[36] L. L. Cam,et al. The Central Limit Theorem Around 1935 , 1986 .

[37] O. Faugeras. Three-dimensional computer vision: a geometric viewpoint , 1993 .