Dynamic structure from motion using uncalibrated cameras and unsegmented scenes (חישובי מבנה של סצנה דינאמית מתנועת מצלמה, עבור מצלמות לא ידועות וללא סגמנטציה נתונה.)

Much work has been done in the last decade by the Computer Vision community in understanding the geometry of images of a rigid scene taken by a moving camera. The case of a scene containing motion has been largely ignored. Since most video footage aim at capturing events, hence motion, the need for handling dynamic scenes became apparent. Our work deals with discovering the geometrical models and the mathematical tools that we can use to analyze views of such scenes. In particular, we focus on the extraction of information about multiple independently moving objects. Unlike previous work which at best tried to ignore such moving objects, we show that valuable information can be extracted from such motions. The main mathematical tools that we have used are projective algebra and multi-linear tensors. Projective algebra has long been used to model the process of imaging of rigid scenes. From this model multiple views’ invariants can be derived to describe for example stereo vision. It was shown that these invariants can be described most generally by using multi-linear tensors. In our work, we further use multi-linear tensors to model dynamic scenes. In order to handle dynamic scenes we have often lifted the model of the scene to higher projective spaces. In these higher spaces, we were able to derive novel multi-linear invariants. We then found ways to decompose these invariants to unravel underlying information such as scene structure, motion in the scene and the motion of the camera. The research presented in this thesis appears in the following papers [46, 57, 33, 56, 58, 61]. These papers are reprinted as chapters of this thesis: Chapter 2 describes the recovery of structure and motion of a scene containing points moving along coplanar lines, appeared in ECCV2000 European Conference on Computer Vision [46]. Chapter 3 describes the recovery of structure and motion of a scene containing points in 3D moving in constant velocity, and other scenarios, appeared in International Journal on Computer Vision (IJCV) 48(1), 2002. [57]. Chapter 4 describes the derivation multi linear constraints used to recognize a moving object from a single view appeared in CVPR 2001 IEEE Conf. on Computer Vision and Pattern Recognition [33]

[1]  Michal Irani,et al.  Multi-frame optical flow estimation using subspace constraints , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[2]  Amnon Shashua,et al.  On Representation Theory in Computer Vision Problems , 2002 .

[3]  Kun Huang,et al.  Generalized Rank Conditions in Multiple View Geometry with Applications to Dynamical Scenes , 2002, ECCV.

[4]  Olivier D. Faugeras,et al.  On the geometry and algebra of the point and line correspondences between N images , 1995, Proceedings of IEEE International Conference on Computer Vision.

[5]  Andrew W. Fitzgibbon,et al.  Multibody Structure and Motion: 3-D Reconstruction of Independently Moving Objects , 2000, ECCV.

[6]  Stefan Carlsson,et al.  The Dou8ble Algebra: An Effective Tool for Computing Invariants in Computer Vision , 1993, Applications of Invariance in Computer Vision.

[7]  Daphna Weinshall,et al.  Dual Computation of Projective Shape and Camera Positions from Multiple Images , 1998, International Journal of Computer Vision.

[8]  S. P. Mudur,et al.  Three-dimensional computer vision: a geometric viewpoint , 1993 .

[9]  Carsten Rother,et al.  Linear Multi View Reconstruction and Camera Recovery , 2001, ICCV.

[10]  Michal Irani,et al.  Computing occluding and transparent motions , 1994, International Journal of Computer Vision.

[11]  Richard I. Hartley,et al.  In Defense of the Eight-Point Algorithm , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Kenichi Kanatani,et al.  Motion segmentation by subspace separation and model selection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[13]  Minas E. Spetsakis,et al.  A multi-frame approach to visual motion perception , 2004, International Journal of Computer Vision.

[14]  Stefan Carlsson,et al.  Duality of reconstruction and positioning from projective views , 1995, Proceedings IEEE Workshop on Representation of Visual Scenes (In Conjunction with ICCV'95).

[15]  Lior Wolf,et al.  Affine 3-D reconstruction from two projective images of independently translating planes , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[16]  Joe Harris,et al.  Representation Theory: A First Course , 1991 .

[17]  Lior Wolf,et al.  Homography Tensors: On Algebraic Entities that Represent Three Views of Static or Moving Planar Points , 2000, ECCV.

[18]  Gian-Carlo Rota,et al.  On the Exterior Calculus of Invariant Theory , 1985 .

[19]  Lihi Zelnik-Manor,et al.  Degeneracies, dependencies and their implications in multi-body and multi-sequence factorizations , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[20]  Takeo Kanade,et al.  A Multibody Factorization Method for Independently Moving Objects , 1998, International Journal of Computer Vision.

[21]  Daphna Weinshall,et al.  From Reference Frames to Reference Planes: Multi-View Parallax Geometry and Applications , 1998, ECCV.

[22]  Lior Wolf,et al.  Correspondence-free Synchronization and Reconstruction in a Non-rigid Scene , 2004 .

[23]  Amnon Shashua,et al.  Trilinear Tensor: The Fundamental Construct of Multiple-view Geometry and Its Applications , 1997, AFPAC.

[24]  Michael Werman,et al.  Trajectory triangulation over conic section , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[25]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[26]  Andrew Zisserman,et al.  Applications of Invariance in Computer Vision , 1993, Lecture Notes in Computer Science.

[27]  Amnon Shashua,et al.  Trajectory Triangulation: 3D Reconstruction of Moving Points from a Monocular Image Sequence , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Adrien Bartoli The Geometry of Dynamic Scenes - On Coplanar and Convergent Linear Motions Embedded in 3D Static Scenes , 2002, BMVC.

[29]  Mei Han,et al.  Multiple Motion Scene Reconstruction from Uncalibrated Views , 2001, ICCV.

[30]  Nassir Navab,et al.  Relative Affine Structure: Canonical Model for 3D From 2D Geometry and Applications , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  P. Anandan,et al.  Parallax Geometry of Pairs of Points for 3D Scene Analysis , 1996, ECCV.

[32]  Peter F. Sturm Structure and Motion for Dynamic Scenes - The Case of Points Moving in Planes , 2002, ECCV.

[33]  M. F.,et al.  Bibliography , 1985, Experimental Gerontology.

[34]  S. Shankar Sastry,et al.  Two-View Multibody Structure from Motion , 2005, International Journal of Computer Vision.

[35]  Henning Biermann,et al.  Recovering non-rigid 3D shape from image streams , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[36]  Ian D. Reid,et al.  Duality, Rigidity and Planar Parallax , 1998, ECCV.

[37]  Lior Wolf,et al.  On the Structure and Properties of the Quadrifocal Tensor , 2000, ECCV.

[38]  Lior Wolf,et al.  Time-varying shape tensors for scenes with multiply moving points , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[39]  Michael Werman,et al.  Duality of Multi-Point and Multi-Frame Geometry: Fundamental Shape Matrices and Tensors , 1996, ECCV.

[40]  Denis Simakov,et al.  Feature-Based Sequence-to-Sequence Matching , 2006, International Journal of Computer Vision.

[41]  Amnon Shashua,et al.  3D Reconstruction from Tangent-of-Sight Measurements of a Moving Object Seen from a Moving Camera , 2000, ECCV.

[42]  Marc Pollefeys,et al.  Multiple view geometry , 2005 .

[43]  Lior Wolf,et al.  Two-body segmentation from two perspective views , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[44]  Mei Han,et al.  Reconstruction of a Scene with Multiple Linearly Moving Objects , 2004, International Journal of Computer Vision.

[45]  Amnon Shashua,et al.  Threading Fundamental Matrices , 1998, ECCV.

[46]  Lior Wolf,et al.  Sequence-to-Sequence Self Calibration , 2002, ECCV.

[47]  Amnon Shashua,et al.  The Rank 4 Constraint in Multiple (>=3) View Geometry , 1996, ECCV.

[48]  Matthew Brand,et al.  Morphable 3D models from video , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[49]  O. Faugeras Stratification of three-dimensional vision: projective, affine, and metric representations , 1995 .

[50]  Lior Wolf,et al.  Join tensors: On 3D-to-3D alignment of dynamic sets , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[51]  P. Torr Geometric motion segmentation and model selection , 1998, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[52]  Charles R. Dyer,et al.  Interpolating view and scene motion by dynamic view morphing , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[53]  Amnon Shashua,et al.  On the synthesis of dynamic scenes from reference views , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).