Model-based Invariant Functions and Their Use for Recognition

Using three dimensional invariant representations, we address the problem of changes in appearance that result from a change in camera orientation (or change of viewpoint). This approach is based on a Euclidean invariant representation of three dimensional objects, where the metric information is kept using the Gramian of 4 basis points and the affine coordinates of the remaining points, or using the generalized inverse Gramian of all the object points. We describe functions which operate on two dimensional images of three dimensional objects, and which are invariant under changes of viewpoint. These functions can be used to improve and extend various existing recognition approaches, including alignment, linear combination, and indexing. The invariant representation can be computed with a linear algorithm from a sequence of images.

[1]  J J Koenderink,et al.  Affine structure from motion. , 1991, Journal of the Optical Society of America. A, Optics and image science.

[2]  David W. Jacobs,et al.  Space and Time Bounds on Indexing 3D Models from 2D Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Yehezkel Lamdan,et al.  Object recognition by affine invariant matching , 2011, Proceedings CVPR '88: The Computer Society Conference on Computer Vision and Pattern Recognition.

[4]  David A. Forsyth,et al.  Invariant Descriptors for 3D Object Recognition and Pose , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Azriel Rosenfeld,et al.  Human and Machine Vision , 1983 .

[6]  Daphna Weinshall,et al.  Distance metric between 3D models and 2D images for recognition and classification , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Olivier D. Faugeras,et al.  What can be seen in three dimensions with an uncalibrated stereo rig , 1992, ECCV.

[8]  Daphna Weinshall,et al.  Linear and incremental acquisition of invariant shape models from image sequences , 1993, 1993 (4th) International Conference on Computer Vision.

[9]  Shimon Ullman,et al.  Computational Studies in the Interpretation of Structure and Motion: Summary and Extension , 1983 .

[10]  Amnon Shashua,et al.  Projective depth: A geometric invariant for 3D reconstruction from two perspective/orthographic views and for visual recognition , 1993, 1993 (4th) International Conference on Computer Vision.

[11]  Allen R. Hanson,et al.  Description and reconstruction from image trajectories of rotational motion , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[12]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Andrew P. Witkin,et al.  Scale-Space Filtering , 1983, IJCAI.

[14]  Daphna Weinshall,et al.  3D object recognition by indexing structural invariants from multiple views , 1993, 1993 (4th) International Conference on Computer Vision.

[15]  S Ullman,et al.  Maximizing Rigidity: The Incremental Recovery of 3-D Structure from Rigid and Nonrigid Motion , 1984, Perception.

[16]  Daphna Weinshall Model-based invariants for 3D vision , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[17]  J.B. Burns,et al.  View Variation of Point-Set and Line-Segment Features , 1993, IEEE Trans. Pattern Anal. Mach. Intell..