Deterministic and Statistical Properties of Multi-Resolution 3 D Modeling

Multi-resolution schemes for 3D modeling from an input video sequence are becoming very popular. However, multi-resolution techniques may not be the best solution strategy in many scenarios and it is important to understand the characteristics of such algorithms. In this paper, we present a multiresolution structure from motion algorithm, using monocular video as input, that exploits the bilinear relationship between depth and translation parameters to propagate the estimates through a coarseto-fine reconstruction framework. We present a detailed analysis of the deterministic and statistical properties of such an algorithm. We show that our optimization procedure is guaranteed to converge to the local minimum at each resolution. We also derive analytical expressions for the error covariances of depth and motion estimates, obtained using a multi-resolution structure from motion reconstruction algorithm, as a function of the error covariance of the feature tracks in the the input video. The method is based on using the implicit function theorem for deriving the error covariance at each resolution, and propagating the statistics from coarse to fine resolution, again taking advantage of the bilinear nature of the problem. The statistical calculations do not require the assumptions of Gaussianity of the error distributions and are valid for dense depth reconstruction estimates. Simulations have been carried out using real-life video sequences. It is shown how multi-resolution techniques can either succeed or fail in reconstructing the same scene depending on the quality of the input video of that scene, thus justifying the need for a theoretical analysis of error propagation in the 3D modeling from video.

[1]  金谷 健一 Statistical optimization for geometric computation : theory and practice , 2005 .

[2]  Rama Chellappa,et al.  Stochastic Approximation and Rate-Distortion Analysis for Robust Structure and Motion Estimation , 2003, International Journal of Computer Vision.

[3]  John Oliensis,et al.  A Multi-Frame Structure-from-Motion Algorithm under Perspective Projection , 1999, International Journal of Computer Vision.

[4]  Zhengyou Zhang,et al.  Determining the Epipolar Geometry and its Uncertainty: A Review , 1998, International Journal of Computer Vision.

[5]  Takeo Kanade,et al.  Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[6]  S. Shankar Sastry,et al.  c ○ 2000 Kluwer Academic Publishers. Manufactured in The Netherlands. Linear Differential Algorithm for Motion Recovery: A Geometric Approach , 2022 .

[7]  Rama Chellappa,et al.  Statistical Error Propagation in 3D Modeling From Monocular Video , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[8]  Azriel Rosenfeld,et al.  A hierarchical approach for obtaining structure from two-frame optical flow , 2002, Workshop on Motion and Video Computing, 2002. Proceedings..

[9]  Jeffrey A. Fessler,et al.  Mean and variance of implicitly defined biased estimators (such as penalized maximum likelihood): applications to tomography , 1996, 5th IEEE EMBS International Summer School on Biomedical Imaging, 2002..

[10]  Takeo Kanade,et al.  Gauge fixing for accurate 3D estimation , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[11]  Yakup Genc,et al.  Fast and Accurate Algorithms for Projective Multi-Image Structure from Motion , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  A. Murat Tekalp,et al.  Error Characterization of the Factorization Method , 2001, Comput. Vis. Image Underst..

[13]  Yiannis Aloimonos,et al.  The Statistics of Optical Flow , 2001, Comput. Vis. Image Underst..

[14]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[15]  John Oliensis,et al.  A Critique of Structure-from-Motion Algorithms , 2000, Comput. Vis. Image Underst..

[16]  John Oliensis,et al.  Dealing with Noise in Multiframe Structure from Motion , 1999, Comput. Vis. Image Underst..

[17]  Harpreet S. Sawhney,et al.  Correlation-based estimation of ego-motion and structure from motion and stereo , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[18]  R. Brockett,et al.  Optimal Structure from Motion: Local Ambiguities and Global Estimates , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[19]  Alex Pentland,et al.  Recursive Estimation of Motion, Structure, and Focal Length , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  O. Faugeras Three-dimensional computer vision: a geometric viewpoint , 1993 .

[21]  Richard Szeliski,et al.  Recovering 3D shape and motion from image streams using nonlinear least squares , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Hans-Hellmut Nagel,et al.  The coupling of rotation and translation in motion estimation of planar surfaces , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Vishvjit S. Nalwa,et al.  A guided tour of computer vision , 1993 .

[24]  Olivier Faugeras,et al.  3D Dynamic Scene Analysis , 1992 .

[25]  K. Hanna Direct multi-resolution estimation of ego-motion and structure from motion , 1991, Proceedings of the IEEE Workshop on Visual Motion.

[26]  Rama Chellappa,et al.  Estimating the Kinematics and Structure of a Rigid Object from a Sequence of Monocular Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Rama Chellappa,et al.  Statistical analysis of inherent ambiguities in recovering 3-D motion from a noisy flow field , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[28]  Narendra Ahuja,et al.  Optimal motion and structure estimation , 1989, Proceedings CVPR '89: IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[29]  Rama Chellappa,et al.  Performance bounds for estimating three-dimensional motion parameters from a sequence of noisy images , 1989 .

[30]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[31]  P. Kumar,et al.  Theory and practice of recursive identification , 1985, IEEE Transactions on Automatic Control.

[32]  Thomas S. Huang,et al.  Correction to "Estimating 3-D motion parameters of a rigid planar patch,II: Singular value decomposition" , 1983 .

[33]  Donald B. Gennery,et al.  Tracking Known Three-Dimensional Objects , 1982, AAAI.

[34]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[35]  W. Rudin Principles of mathematical analysis , 1964 .