论文信息 - Visual Learning from Multiple Views

Visual Learning from Multiple Views

An algorithm is presented in which a computer is visually shown a sequence of views of a solid planar object as the object is rotated in space. The computer automatically forms a three-dimensional description of the object. The description consists of a deterministic description of the object's surfaces and how they are interconnected to form the object, along with a measure of each surface's shape which is invariant to three-dimensional rotation. From this self-learned model of the object, the object can later be recognized from any viewing angle. The basis of the algorithm is the ability of the program to determine in a specific visual view: "What do I see now that I have seen before?" This is accomplished by generating two sets of mappings of one object description to another object description.

Clarence L. Coates | Stephen A. Underwood | C. Coates | S. Underwood

[1] Lawrence G. Roberts,et al. Machine Perception of Three-Dimensional Solids , 1963, Outstanding Dissertations in the Computer Sciences.

[2] David L. Waltz,et al. Generating Semantic Descriptions From Drawings of Scenes With Shadows , 1972 .

[3] Patrick Henry Winston,et al. Learning structural descriptions from examples , 1970 .

[4] Gunnar Rutger Grape. Model based (intermediate-level) computer-vision , 1973 .

[5] P. Wortman,et al. Pattern Recognition, Learning, and Thought , 1974 .

[6] Stephen Alan Underwood. Visual learning and recognition by computer , 1972 .