An algorithm is presented in which a computer is visually shown a sequence of views of a solid planar object as the object is rotated in space. The computer automatically forms a three-dimensional description of the object. The description consists of a deterministic description of the object's surfaces and how they are interconnected to form the object, along with a measure of each surface's shape which is invariant to three-dimensional rotation. From this self-learned model of the object, the object can later be recognized from any viewing angle. The basis of the algorithm is the ability of the program to determine in a specific visual view: "What do I see now that I have seen before?" This is accomplished by generating two sets of mappings of one object description to another object description.
[1]
Lawrence G. Roberts,et al.
Machine Perception of Three-Dimensional Solids
,
1963,
Outstanding Dissertations in the Computer Sciences.
[2]
David L. Waltz,et al.
Generating Semantic Descriptions From Drawings of Scenes With Shadows
,
1972
.
[3]
Patrick Henry Winston,et al.
Learning structural descriptions from examples
,
1970
.
[4]
Gunnar Rutger Grape.
Model based (intermediate-level) computer-vision
,
1973
.
[5]
P. Wortman,et al.
Pattern Recognition, Learning, and Thought
,
1974
.
[6]
Stephen Alan Underwood.
Visual learning and recognition by computer
,
1972
.