An evaluation of the neocognitron

We describe a sequence of experiments investigating the strengths and limitations of Fukushima's neocognitron as a handwritten digit classifier. Using the results of these experiments as a foundation, we propose and evaluate improvements to Fukushima's original network in an effort to obtain higher recognition performance. The neocognitron performance is shown to be strongly dependent on the choice of selectivity parameters and we present two methods to adjust these variables. Performance of the network under the more effective of the two new selectivity adjustment techniques suggests that the network fails to exploit the features that distinguish different classes of input data. To avoid this shortcoming, the network's final layer cells were replaced by a nonlinear classifier (a multilayer perceptron) to create a hybrid architecture. Tests of Fukushima's original system and the novel systems proposed in this paper suggest that it may be difficult for the neocognitron to achieve the performance of existing digit classifiers due to its reliance upon the supervisor's choice of selectivity parameters and training data.

[1]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[2]  K. Johnson,et al.  Feature extraction in the Neocognitron , 1988, IEEE 1988 International Conference on Neural Networks.

[3]  Ah Chung Tsoi,et al.  Comments on 'Optimal training of thresholded linear correlation classifiers' [with reply] , 1993, IEEE Trans. Neural Networks.

[4]  Kunihiko Fukushima,et al.  Analysis of the process of visual pattern recognition by the neocognitron , 1989, Neural Networks.

[5]  K Fukushima,et al.  Handwritten alphanumeric character recognition by the neocognitron , 1991, IEEE Trans. Neural Networks.

[6]  Ching Y. Suen,et al.  Historical review of OCR research and development , 1992, Proc. IEEE.

[7]  Hayaru Shouno,et al.  Training neocognitron to recognize handwritten digits in the real world , 1997, Proceedings of IEEE International Symposium on Parallel Algorithms Architecture Synthesis.

[8]  Elie Bienenstock,et al.  Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[9]  John S. Denker,et al.  Improving Rejection Performance on Handwritten Digits by Training with Rubbish , 1993, Neural Computation.

[10]  Masashi Tanigawa,et al.  Use of different thresholds in learning and recognition , 1996, Neurocomputing.

[11]  Takayuki Ito,et al.  Neocognitron: A neural network model for a mechanism of visual pattern recognition , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[12]  Lawrence D. Jackel,et al.  Handwritten Digit Recognition with a Back-Propagation Network , 1989, NIPS.

[13]  David Lovell,et al.  The neocognitron as a system for handwritten character recognition : limitations and improvements , 1994 .

[14]  Kunihiko Fukushima,et al.  Improved neocognitron with bend-detecting cells , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[15]  Kunihiko Fukushima,et al.  Neocognitron: A hierarchical neural network capable of visual pattern recognition , 1988, Neural Networks.

[16]  Kunihiko Fukushima,et al.  Neocognitron: A new algorithm for pattern recognition tolerant of deformations and shifts in position , 1982, Pattern Recognit..

[17]  T. H. Hildebrandt,et al.  Optimal training of thresholded linear correlation classifiers , 1991, IEEE Trans. Neural Networks.

[18]  D. Signorini,et al.  Neural networks , 1995, The Lancet.

[19]  A. M. Chiang,et al.  A CCD programmable image processor and its neural network applications , 1991 .

[20]  Jonathan J. Hull,et al.  Concerns in Creation of Image Databases * , 1993 .

[21]  Nabil Jean Naccache,et al.  SPTA: A proposed algorithm for thinning binary patterns , 1984, IEEE Transactions on Systems, Man, and Cybernetics.

[22]  Sargur N. Srihari,et al.  Understanding Handwritten Text in a Structured Environment: Determining ZIP Codes from Addresses , 1991, Int. J. Pattern Recognit. Artif. Intell..

[23]  Ah Chung Tsoi,et al.  TRAINED USING FUKUSHIMA'S METHOD (NC-F) AND HILDEBRANDT'S CLOSED-FORM TRAINING ALGORITHM (NC-H) Comments on "Optimal Training of Thresholded Linear Correlation Classifiers" Network %CE?zy % Misclassified % Unclassified , 1993 .

[24]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[25]  Etienne Barnard,et al.  Shift invariance and the neocognitron , 1990, Neural Networks.

[26]  Philip D. Wasserman,et al.  Neural computing - theory and practice , 1989 .