Learning recognition and segmentation of 3-D objects from 2-D images

A framework called Cresceptron is introduced for automatic algorithm design through learning of concepts and rules, thus deviating from the traditional mode in which humans specify the rules constituting a vision algorithm. With the Cresceptron, humans as designers need only to provide a good structure for learning, but they are relieved of most design details. The Cresceptron has been tested on the task of visual recognition by recognizing 3-D general objects from 2-D photographic images of natural scenes and segmenting the recognized objects from the cluttered image background. The Cresceptron uses a hierarchical structure to grow networks automatically, adaptively, and incrementally through learning. The Cresceptron makes it possible to generalize training exemplars to other perceptually equivalent items. Experiments with a variety of real-world images are reported to demonstrate the feasibility of learning in the Cresceptron.<<ETX>>

[1]  P. Thompson,et al.  Margaret Thatcher: A New Illusion , 1980, Perception.

[2]  Takayuki Ito,et al.  Neocognitron: A neural network model for a mechanism of visual pattern recognition , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[3]  P. A. Kolers,et al.  Size in the visual processing of faces and words. , 1985 .

[4]  P. A. Kolers,et al.  Size in the visual processing of faces and words. , 1985, Journal of experimental psychology. Human perception and performance.

[5]  R. Kesner,et al.  Learning and memory : a biological view , 1986 .

[6]  R. Lippmann,et al.  An introduction to computing with neural nets , 1987, IEEE ASSP Magazine.

[7]  V S Ramachandran,et al.  Perceiving shape from shading. , 1988, Scientific American.

[8]  Dean Pomerleau,et al.  ALVINN, an autonomous land vehicle in a neural network , 2015 .

[9]  D. Hubel Eye, brain, and vision , 1988 .

[10]  J. O'Regan,et al.  Some results on translation invariance in the human visual system. , 1990, Spatial vision.

[11]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[12]  Theodosios Pavlidis,et al.  Why progress in machine vision is so slow , 1992, Pattern Recognit. Lett..