Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position

A neural network model for a mechanism of visual pattern recognition is proposed in this paper. The network is self-organized by “learning without a teacher”, and acquires an ability to recognize stimulus patterns based on the geometrical similarity (Gestalt) of their shapes without affected by their positions. This network is given a nickname “neocognitron”. After completion of self-organization, the network has a structure similar to the hierarchy model of the visual nervous system proposed by Hubel and Wiesel. The network consits of an input layer (photoreceptor array) followed by a cascade connection of a number of modular structures, each of which is composed of two layers of cells connected in a cascade. The first layer of each module consists of “S-cells”, which show characteristics similar to simple cells or lower order hypercomplex cells, and the second layer consists of “C-cells” similar to complex cells or higher order hypercomplex cells. The afferent synapses to each S-cell have plasticity and are modifiable. The network has an ability of unsupervised learning: We do not need any “teacher” during the process of self-organization, and it is only needed to present a set of stimulus patterns repeatedly to the input layer of the network. The network has been simulated on a digital computer. After repetitive presentation of a set of stimulus patterns, each stimulus pattern has become to elicit an output only from one of the C-cell of the last layer, and conversely, this C-cell has become selectively responsive only to that stimulus pattern. That is, none of the C-cells of the last layer responds to more than one stimulus pattern. The response of the C-cells of the last layer is not affected by the pattern's position at all. Neither is it affected by a small change in shape nor in size of the stimulus pattern.

[1]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[2]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[3]  É. D. L. Tour,et al.  Nouvelles observations concernant l’action du laurylsulfate de sodium sur la paroi et la membrane d’E. coli , 1965 .

[4]  D H HUBEL,et al.  RECEPTIVE FIELDS AND FUNCTIONAL ARCHITECTURE IN TWO NONSTRIATE VISUAL AREAS (18 AND 19) OF THE CAT. , 1965, Journal of neurophysiology.

[5]  Ray S. Snider A Proposed Model for Visual Information Processing in the Human Brain , 1967, Neurology.

[6]  H. Giebel,et al.  Feature Extraction and Recognition of Handwritten Characters by Homogeneous Layers , 1971 .

[7]  D. B. Bender,et al.  Visual properties of neurons in inferotemporal cortex of the Macaque. , 1972, Journal of neurophysiology.

[8]  D. Hubel,et al.  Ferrier lecture - Functional architecture of macaque monkey visual cortex , 1977, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[9]  T. Wiesel,et al.  Functional architecture of macaque monkey visual cortex , 1977 .

[10]  Shuichi Sato,et al.  (Invited) An Advanced MOS-IC Process Technology Using Oxidation of Oxygen-Doped Polycrystalline Silicon Films , 1978 .

[11]  Kunihiko Fukushima Self-Organization of a Neural Network which Gives Position-Invariant Response , 1979, IJCAI.

[12]  P. Couturier Japan , 1988, The Lancet.

[13]  Kunihiko Fukushima,et al.  Cognitron: A self-organizing multilayered neural network , 1975, Biological Cybernetics.