We consider the problem of learning the mapping between the image coordinates of unknown affine views of an object and the parameters of the affine transformation that can align a known view of the same object with them. A single layer neural network (SL-NN) is used to learn the mapping. Although the proposed approach is conceptually similar to other approaches in the literature, its practical advantages are more profound. The views used to train the SL-NN are not obtained by taking different pictures of the object but by sampling the space of its affine transformed views. This space is constructed by estimating the range of values that the parameters of affine transformation can assume using a single view and a methodology based on singular value decomposition and interval arithmetic. The proposed scheme is as accurate as traditional least-squares approaches but faster. A front-end stage to the SL-NN, based on principal components analysis increases its noise tolerance dramatically and guides us in deciding how many training views are necessary in order for it to learn a good mapping.
[1]
T. Poggio,et al.
A network that learns to recognize three-dimensional objects
,
1990,
Nature.
[2]
E. Hansen,et al.
Interval Arithmetic in Matrix Computations, Part II
,
1965
.
[3]
R. Palmer,et al.
Introduction to the theory of neural computation
,
1994,
The advanced book program.
[4]
Amnon Shashua,et al.
Algebraic Functions For Recognition
,
1995,
IEEE Trans. Pattern Anal. Mach. Intell..
[5]
D. W. Thompson,et al.
Three-dimensional model matching from an unconstrained viewpoint
,
1987,
Proceedings. 1987 IEEE International Conference on Robotics and Automation.
[6]
Yehezkel Lamdan,et al.
Affine invariant model-based object recognition
,
1990,
IEEE Trans. Robotics Autom..
[7]
Ronen Basri,et al.
Recognition by Linear Combinations of Models
,
1991,
IEEE Trans. Pattern Anal. Mach. Intell..
[8]
E. Hansen.
Interval Arithmetic in Matrix Computations, Part I
,
1965
.