A neural network model for a mechanism of visual pattern recognition is proposed in this paper. The network is self-organized by “learning without a teacher”, and acquires an ability to recognize stimulus patterns based on the geometrical similarity (Gestalt) of their shapes without affected by their positions. This network is given a nickname “neocognitron”. After completion of self-organization, the network has a structure similar to the hierarchy model of the visual nervous system proposed by Hubel and Wiesel. The network consits of an input layer (photoreceptor array) followed by a cascade connection of a number of modular structures, each of which is composed of two layers of cells connected in a cascade. The first layer of each module consists of “S-cells”, which show characteristics similar to simple cells or lower order hypercomplex cells, and the second layer consists of “C-cells” similar to complex cells or higher order hypercomplex cells. The afferent synapses to each S-cell have plasticity and are modifiable. The network has an ability of unsupervised learning: We do not need any “teacher” during the process of self-organization, and it is only needed to present a set of stimulus patterns repeatedly to the input layer of the network. The network has been simulated on a digital computer. After repetitive presentation of a set of stimulus patterns, each stimulus pattern has become to elicit an output only from one of the C-cell of the last layer, and conversely, this C-cell has become selectively responsive only to that stimulus pattern. That is, none of the C-cells of the last layer responds to more than one stimulus pattern. The response of the C-cells of the last layer is not affected by the pattern's position at all. Neither is it affected by a small change in shape nor in size of the stimulus pattern.
[1]
D. Hubel,et al.
Receptive fields, binocular interaction and functional architecture in the cat's visual cortex
,
1962,
The Journal of physiology.
[2]
A. A. Mullin,et al.
Principles of neurodynamics
,
1962
.
[3]
É. D. L. Tour,et al.
Nouvelles observations concernant l’action du laurylsulfate de sodium sur la paroi et la membrane d’E. coli
,
1965
.
[4]
D H HUBEL,et al.
RECEPTIVE FIELDS AND FUNCTIONAL ARCHITECTURE IN TWO NONSTRIATE VISUAL AREAS (18 AND 19) OF THE CAT.
,
1965,
Journal of neurophysiology.
[5]
Ray S. Snider.
A Proposed Model for Visual Information Processing in the Human Brain
,
1967,
Neurology.
[6]
H. Giebel,et al.
Feature Extraction and Recognition of Handwritten Characters by Homogeneous Layers
,
1971
.
[7]
D. B. Bender,et al.
Visual properties of neurons in inferotemporal cortex of the Macaque.
,
1972,
Journal of neurophysiology.
[8]
D. Hubel,et al.
Ferrier lecture - Functional architecture of macaque monkey visual cortex
,
1977,
Proceedings of the Royal Society of London. Series B. Biological Sciences.
[9]
T. Wiesel,et al.
Functional architecture of macaque monkey visual cortex
,
1977
.
[10]
Shuichi Sato,et al.
(Invited) An Advanced MOS-IC Process Technology Using Oxidation of Oxygen-Doped Polycrystalline Silicon Films
,
1978
.
[11]
Kunihiko Fukushima.
Self-Organization of a Neural Network which Gives Position-Invariant Response
,
1979,
IJCAI.
[12]
P. Couturier.
Japan
,
1988,
The Lancet.
[13]
Kunihiko Fukushima,et al.
Cognitron: A self-organizing multilayered neural network
,
1975,
Biological Cybernetics.