Structure from multiple 2D affine correspondences without camera calibration

Image motion induced by camera or object motion can be approximated locally by an affine coordinate transformation. We extract 3D information directly from the affine parameters, without camera calibration. The derivation relies on the following assumptions: the object is rigid locally planar, and its local 3D motion is translation. These assumptions enable complete recovery of 3D structure, whereas it is impossible to compute the direction (and magnitude) of the motion. Still, it is possible to distinguish between objects moving differently. Explicit expressions for the structure and the motion indicators are given in terms of the 6 affine parameters, computed for each image patch. Results of experiments on data with known ground truth are described.

[1]  Haim Schweitzer Occam Algorithms for Computing Visual Motion , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Roberto Cipolla,et al.  Extracting the Affine Transformation from Texture Moments , 1994, ECCV.

[3]  R. Manmatha Measuring the Affine Transform Using Gaussian Filters , 1994, ECCV.

[4]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[5]  Tony Lindeberg,et al.  Direct estimation of affine image deformations using visual front-end operations with automatic scale selection , 1995, Proceedings of IEEE International Conference on Computer Vision.

[6]  J J Koenderink,et al.  Affine structure from motion. , 1991, Journal of the Optical Society of America. A, Optics and image science.

[7]  Gilad Adiv,et al.  Determining Three-Dimensional Motion and Structure from Optical Flow Generated by Several Moving Objects , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Olivier D. Faugeras,et al.  What can be seen in three dimensions with an uncalibrated stereo rig , 1992, ECCV.

[9]  Andrew Zisserman,et al.  Motion From Point Matches Using Affine Epipolar Geometry , 1994, ECCV.

[10]  Shmuel Peleg,et al.  A Three-Frame Algorithm for Estimating Two-Component Image Motion , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Paul A. Beardsley,et al.  Active visual navigation using non-metric structure , 1995, Proceedings of IEEE International Conference on Computer Vision.