Linear and bilinear subspace methods for structure from motion

Structure from Motion (SFM), which is recovering camera motion and scene structure from image sequences, has various applications, such as scene modeling, robot navigation and object recognition. Most of previous research on SFM requires simplifying assumptions on the camera or the scene. Common assumptions are (a) the camera intrinsic parameters, such as focal lengths, are known or unchanged throughout the sequence, and/or (b) the scene does not contain moving objects. In practice, these are unrealistic assumptions. In this thesis we present a collection of reconstruction methods for dealing with image sequences taken with uncalibrated cameras and/or of multiple motion scenes. The methods produce Euclidean reconstruction directly from feature point locations and are based on the bilinear relationship of camera motion and scene structure. For uncalibrated image sequences, we embed the camera intrinsic parameters within the camera motion representation. For image sequences of multiple motion scenes, we incorporate multiple motions into the scene structure representation. In this way, we derive linear and bilinear subspace constraints on the large amount of information integrated over the entire image sequences. By taking advantage of this redundant information we can achieve accurate and reliable reconstruction. Firstly, we propose a uncalibrated Euclidean reconstruction method from multiple uncalibrated views. This method first performs a projective reconstruction using a bilinear factorization algorithm, and then converts the projective solution to a Euclidean one by enforcing metric constraints. We present three normalization algorithms to generate the Euclidean reconstruction and the intrinsic parameters. The first two algorithms are linear, one for dealing with the case that only the focal lengths are unknown, and another for the case that the focal lengths and the constant principal point are unknown. The third algorithm is bilinear, dealing with the case that the focal lengths, the principal points and the aspect ratios are all unknown. Secondly, we present a linear method to reconstruct a scene containing multiple moving objects together with the camera motion. The number of the moving objects is automatically detected without prior motion segmentation. Assuming that the objects are moving linearly with constant speeds, we propose a unified geometrical representation of the static scene and the moving objects. This representation enables the embedding of the linear motion constraints into the scene structure, which naturally leads to a factorization-based method. Thirdly, we describe a method for multiple motion scene reconstruction from uncalibrated views. The method recovers the scene structure, the trajectories of the moving objects and the camera intrinsic (except skews) and extrinsic parameters simultaneously assuming that the objects are moving with constant velocities. We embed the assumptions within the scene representation and therefore propose a bilinear factorization algorithm to generate a projective reconstruction, and then impose metric constraints to compute the Euclidean reconstruction and the camera intrinsic parameters. We also discuss other issues related to the accuracy and reliability of these reconstruction methods, such as minimum data requirement and gauge selection. The reconstruction methods have been tested on a series of synthetic sequences to evaluate the quality of the methods, and real image sequences to demonstrate their applicability.

[1]  Bill Triggs,et al.  Matching constraints and the joint image , 1995, Proceedings of IEEE International Conference on Computer Vision.

[2]  Mei Han,et al.  Homography-Based 3D Scene Analysis of Video Sequences * , 1999 .

[3]  Olivier D. Faugeras,et al.  What can be seen in three dimensions with an uncalibrated stereo rig , 1992, ECCV.

[4]  Mei Han,et al.  Multiple Motion Scene Reconstruction from Uncalibrated Views , 2001, ICCV.

[5]  Kerfichi Kanatani,et al.  Statistical Optimization and Geometric Visual Inference , 1997, AFPAC.

[6]  Peter F. Sturm,et al.  Critical motion sequences for monocular self-calibration and uncalibrated Euclidean reconstruction , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  Michal Irani,et al.  Multi-frame optical flow estimation using subspace constraints , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[8]  Bill Triggs,et al.  Critical motions in euclidean structure from motion , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[9]  Richard Szeliski,et al.  Video mosaics for virtual environments , 1996, IEEE Computer Graphics and Applications.

[10]  Bill Triggs,et al.  Factorization methods for projective structure and motion , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Martial Hebert,et al.  Iterative projective reconstruction from multiple views , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[12]  Long Quan,et al.  Relative 3D Reconstruction Using Multiple Uncalibrated Images , 1995, Int. J. Robotics Res..

[13]  Mei Han,et al.  Scene Reconstruction from Multiple Uncalibrated Views , 2000 .

[14]  Michael Werman,et al.  Trilinearity of three perspective views and its associated tensor , 1995, Proceedings of IEEE International Conference on Computer Vision.

[15]  David W. Jacobs,et al.  Linear fitting with missing data: applications to structure-from-motion and to characterizing intensity images , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Takeo Kanade,et al.  Uncertainty Modeling for Optimal Structure from Motion , 1999, Workshop on Vision Algorithms.

[17]  B. Triggs,et al.  Projective Geometry for Image Analysis , 1996 .

[18]  Philip F. McLauchlan,et al.  Gauge Independence in Optimization Algorithms for 3D Vision , 1999, Workshop on Vision Algorithms.

[19]  Narendra Ahuja,et al.  Dense shape and motion from region correspondences by factorization , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[20]  Amnon Shashua,et al.  Trajectory Triangulation: 3D Reconstruction of Moving Points from a Monocular Image Sequence , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Bill Triggs,et al.  Autocalibration and the absolute quadric , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[23]  Michael Werman,et al.  Trajectory triangulation over conic section , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[24]  Michal Irani,et al.  Detecting and Tracking Multiple Moving Objects Using Temporal Integration , 1992, ECCV.

[25]  Henning Biermann,et al.  Recovering non-rigid 3D shape from image streams , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[26]  Lihi Zelnik-Manor,et al.  Multi-view subspace constraints on homographies , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[27]  T. Boult,et al.  Factorization-based segmentation of motions , 1991, Proceedings of the IEEE Workshop on Visual Motion.

[28]  P. Anandan,et al.  Parallax Geometry of Pairs of Points for 3D Scene Analysis , 1996, ECCV.

[29]  Michal Irani,et al.  Recovery of Ego-Motion Using Region Alignment , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Andrew W. Fitzgibbon,et al.  Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[31]  Peter F. Sturm,et al.  Critical motion sequences for the self-calibration of cameras and stereo systems with variable focal length , 1999, Image Vis. Comput..

[32]  Amnon Shashua,et al.  The Rank 4 Constraint in Multiple (>=3) View Geometry , 1996, ECCV.

[33]  P. McLauchlan Gauge invariance in projective 3D reconstruction , 1999, Proceedings IEEE Workshop on Multi-View Modeling and Analysis of Visual Scenes (MVIEW'99).

[34]  Peter F. Sturm,et al.  A Factorization Based Algorithm for Multi-Image Projective Structure and Motion , 1996, ECCV.

[35]  Takeo Kanade,et al.  A Paraperspective Factorization Method for Shape and Motion Recovery , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  Steven A. Shafer,et al.  Dense Structure from a Dense Optical Flow Sequence , 1998, Comput. Vis. Image Underst..

[37]  Olivier D. Faugeras,et al.  Variational principles, surface evolution, PDEs, level set methods, and the stereo problem , 1998, IEEE Trans. Image Process..

[38]  Philip H. S. Torr,et al.  Outlier detection and motion segmentation , 1993, Other Conferences.

[39]  Daphna Weinshall,et al.  From Reference Frames to Reference Planes: Multi-View Parallax Geometry and Applications , 1998, ECCV.

[40]  Stéphane Christy,et al.  Euclidean Shape and Motion from Multiple Perspective Views by Affine Iterations , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Richard Szeliski,et al.  Creating full view panoramic image mosaics and texture-mapped models , 1997, International Conference on Computer Graphics and Interactive Techniques.

[42]  P. Anandan,et al.  Direct recovery of shape from multiple views: a parallax based approach , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[43]  Mei Han,et al.  Interactive 3D Modeling from Multiple Images Using Scene Regularities , 1998, SMILE.

[44]  Roger Y. Tsai,et al.  A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lenses , 1987, IEEE J. Robotics Autom..

[45]  Kenichi Kanatani,et al.  Gauges and Gauge Transformations in 3-D Reconstruction from a Sequence of Images , 2000 .

[46]  Takeo Kanade,et al.  A unified factorization algorithm for points, line segments and planes with uncertainty models , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[47]  P. Anandan,et al.  Direct Recovery of Planar-Parallax from Multiple Frames , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[48]  Richard Szeliski,et al.  A layered approach to stereo reconstruction , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[49]  Amnon Shashua,et al.  On the synthesis of dynamic scenes from reference views , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[50]  Amnon Shashua,et al.  Trajectory triangulation of lines: reconstruction of a 3D point moving along a line from a monocular image sequence , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[51]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[52]  Stéphane Christy,et al.  Euclidean Reconstruction: From Paraperspective to Perspective , 1996, ECCV.

[53]  Richard I. Hartley,et al.  Computation of the Quadrifocal Tensor , 1998, ECCV.

[54]  Hua Yu,et al.  3D shape and motion by SVD under higher-order approximation of perspective projection , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[55]  Mei Han,et al.  Interactive construction of 3D models from panoramic mosaics , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[56]  Anders Heyden,et al.  Euclidean reconstruction from image sequences with varying and unknown focal length and principal point , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[57]  Richard I. Hartley,et al.  Euclidean Reconstruction from Uncalibrated Views , 1993, Applications of Invariance in Computer Vision.

[58]  Paul A. Beardsley,et al.  3D Model Acquisition from Extended Image Sequences , 1996, ECCV.

[59]  Václav Hlavác,et al.  Projective Reconstruction from N Views Having One View in Common , 1999, Workshop on Vision Algorithms.

[60]  Harry Shum,et al.  Principal Component Analysis with Missing Data and Its Application to Polyhedral Object Modeling , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[61]  José M. F. Moura,et al.  Factorization as a rank 1 problem , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[62]  Olivier D. Faugeras,et al.  Level Set Methods and the Stereo Problem , 1997, Scale-Space.

[63]  P. Anandan,et al.  A unified approach to moving object detection in 2D and 3D scenes , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[64]  T. Kanade,et al.  Perspective Factorization Methods for Euclidean Reconstruction , 1999 .

[65]  Richard I. Hartley,et al.  Linear self-calibration of a rotating and zooming camera , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[66]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[67]  Mei Han,et al.  Creating 3D models with uncalibrated cameras , 2000, Proceedings Fifth IEEE Workshop on Applications of Computer Vision.

[68]  Olivier D. Faugeras,et al.  Complete Dense Stereovision Using Level Set Methods , 1998, ECCV.

[69]  Lior Wolf,et al.  Homography Tensors: On Algebraic Entities that Represent Three Views of Static or Moving Planar Points , 2000, ECCV.

[70]  Lund UniversityBox Projective Structure and Motion from Image Sequences Using Subspace Methods , 1997 .

[71]  Peter F. Sturm,et al.  Algorithms for plane-based pose estimation , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[72]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[73]  Rakesh Kumar,et al.  Shape Recovery from Multiple Views: A Parallax Based Approach , 1994 .

[74]  Peter Sturm Critical Motion Sequences and Conjugacy of Ambiguous Euclidean Reconstructions , 1997 .

[75]  The Factorization Method with Linear Motions , 1999 .

[76]  Harpreet S. Sawhney,et al.  Independent motion detection in 3D scenes , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.