Statistical bias in 3-D reconstruction from a monocular video

The present state-of-the-art in computing the error statistics in three-dimensional (3-D) reconstruction from video concentrates on estimating the error covariance. A different source of error which has not received much attention is the fact that the reconstruction estimates are often significantly statistically biased. In this paper, we derive a precise expression for the bias in the depth estimate, based on the continuous (differentiable) version of structure from motion (SfM). Many SfM algorithms, or certain portions of them, can be posed in a linear least-squares (LS) framework Ax=b. Examples include initialization procedures for bundle adjustment or algorithms that alternately estimate depth and camera motion. It is a well-known fact that the LS estimate is biased if the system matrix A is noisy. In SfM, the matrix A contains point correspondences, which are always difficult to obtain precisely; thus, it is expected that the structure and motion estimates in such a formulation of the problem would be biased. Existing results on the minimum achievable variance of the SfM estimator are extended by deriving a generalized Cramer-Rao lower bound. A detailed analysis of the effect of various camera motion parameters on the bias is presented. We conclude by presenting the effect of bias compensation on reconstructing 3-D face models from rendered images.

[1]  Rama Chellappa,et al.  Estimating the Kinematics and Structure of a Rigid Object from a Sequence of Monocular Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Rama Chellappa,et al.  Performance bounds for estimating three-dimensional motion parameters from a sequence of noisy images , 1989 .

[3]  R. Nelson,et al.  Visual Navigation , 1996 .

[4]  Vishvjit S. Nalwa,et al.  A guided tour of computer vision , 1993 .

[5]  S. B. Kang,et al.  Recovering 3 D Shape and Motion from Image Streams using Non-Linear Least Squares , 1993 .

[6]  Kostas Daniilidis,et al.  Understanding noise sensitivity in structure from motion , 1996 .

[7]  A. Murat Tekalp,et al.  Error Characterization of the Factorization Method , 2001, Comput. Vis. Image Underst..

[8]  Alex Pentland,et al.  Recursive Estimation of Motion, Structure, and Focal Length , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Narendra Ahuja,et al.  Optimal Motion and Structure Estimation , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Rama Chellappa,et al.  Estimation of Object Motion Parameters from Noisy Images , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Thomas Kailath,et al.  Linear Systems , 1980 .

[12]  Cornelia Fermüller,et al.  Uncertainty in 3D Shape Estimation , 2003 .

[13]  Takeo Kanade,et al.  Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[14]  R. F.,et al.  Mathematical Statistics , 1944, Nature.

[15]  Kenichi Kanatani,et al.  Unbiased Estimation and Statistical Analysis of 3-D Rigid Motion from Two Views , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Yakup Genc,et al.  Fast and Accurate Algorithms for Projective Multi-Image Structure from Motion , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Rama Chellappa,et al.  3-D Motion Estimation Using a Sequence of Noisy Stereo Images: Models, Estimation, and Uniqueness Results , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Rama Chellappa,et al.  Face reconstruction from monocular video using uncertainty analysis and a generic model , 2003, Comput. Vis. Image Underst..

[19]  Richard Szeliski,et al.  Recovering 3D Shape and Motion from Image Streams Using Nonlinear Least Squares , 1994, J. Vis. Commun. Image Represent..

[20]  Rama Chellappa,et al.  Statistical Error Propagation in 3D Modeling From Monocular Video , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[21]  S. Shankar Sastry,et al.  c ○ 2000 Kluwer Academic Publishers. Manufactured in The Netherlands. Linear Differential Algorithm for Motion Recovery: A Geometric Approach , 2022 .

[22]  John Oliensis,et al.  A Multi-Frame Structure-from-Motion Algorithm under Perspective Projection , 1999, International Journal of Computer Vision.

[23]  Kenichi Kanatani,et al.  Statistical optimization for geometric computation - theory and practice , 1996, Machine intelligence and pattern recognition.

[24]  E L Thomas,et al.  Movements of the eye. , 1968, Scientific American.

[25]  Rama Chellappa,et al.  Statistical Analysis of Inherent Ambiguities in Recovering 3-D Motion from a Noisy Flow Field , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Hans-Hellmut Nagel,et al.  The coupling of rotation and translation in motion estimation of planar surfaces , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Yiannis Aloimonos,et al.  The Statistics of Optical Flow , 2001, Comput. Vis. Image Underst..

[28]  W. Rudin Principles of mathematical analysis , 1964 .

[29]  Olivier Faugeras,et al.  3D Dynamic Scene Analysis , 1992 .

[30]  Sridhar Srinivasan,et al.  Extracting Structure from Optical Flow Using the Fast Error Search Technique , 2000, International Journal of Computer Vision.

[31]  Stefano Soatto,et al.  Optimal Structure from Motion: Local Ambiguities and Global Estimates , 2004, International Journal of Computer Vision.

[32]  James T. Todd,et al.  Theoretical and biological limitations on the visual perception of three-dimensional structure from , 1998 .

[33]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[34]  金谷 健一 Statistical optimization for geometric computation : theory and practice , 2005 .

[35]  Rama Chellappa,et al.  Stochastic Approximation and Rate-Distortion Analysis for Robust Structure and Motion Estimation , 2003, International Journal of Computer Vision.

[36]  Sabine Van Huffel,et al.  The total least squares problem , 1993 .

[37]  O. Faugeras Three-dimensional computer vision: a geometric viewpoint , 1993 .

[38]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[39]  Zhengyou Zhang,et al.  Determining the Epipolar Geometry and its Uncertainty: A Review , 1998, International Journal of Computer Vision.

[40]  John Oliensis,et al.  A Critique of Structure-from-Motion Algorithms , 2000, Comput. Vis. Image Underst..

[41]  Amnon Shashua,et al.  Direct estimation of motion and extended scene structure from a moving stereo rig , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).