Neural Object Recognition by Hierarchical Learning and Extraction of Essential Shapes

We present a hierarchical system for object recognition that models neural mechanisms of visual processing identified in the mammalian ventral stream. The system is composed of neural units organized in a hierarchy of layers with increasing complexity. A key feature of the system is that the neural units learn their preferred patterns from visual input alone. Through this "soft wiring" of neural units the system becomes tuned for target object classes through pure visual experience and with no prior labeling. Object labels are only introduced to train a classifier on the system's output. The system's tuning takes place in a feedforward path. We also present a neural mechanism for back projection of the learned image patterns down the hierarchical layers. This feedback mechanism could serve as a starting point for integration of what- and where-information processed by the ventral and dorsal stream. We test the neural system with natural images from publicly available datasets of natural scenes and handwritten digits.

[1]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[2]  D. J. Felleman,et al.  Distributed hierarchical processing in the primate cerebral cortex. , 1991, Cerebral cortex.

[3]  Yann LeCun,et al.  Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[4]  Gustavo Deco,et al.  Computational neuroscience of vision , 2002 .

[5]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[6]  David G. Lowe,et al.  Multiclass Object Recognition with Sparse, Localized Features , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[7]  Dariu Gavrila,et al.  An Experimental Study on Pedestrian Classification , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[9]  E. Rolls,et al.  A Neurodynamical cortical model of visual attention and invariant object recognition , 2004, Vision Research.

[10]  Tomaso Poggio,et al.  Fast Readout of Object Identity from Macaque Inferior Temporal Cortex , 2005, Science.

[11]  R. Mansfield,et al.  Analysis of visual behavior , 1982 .

[12]  I. Guyon,et al.  Handwritten digit recognition: applications of neural network chips and automatic learning , 1989, IEEE Communications Magazine.

[13]  Leslie G. Ungerleider Two cortical visual systems , 1982 .

[14]  Thomas Serre,et al.  A Theory of Object Recognition: Computations and Circuits in the Feedforward Path of the Ventral Stream in Primate Visual Cortex , 2005 .