Classification of Japanese Kanji using principal component analysis as a preprocessor to an artificial neural network

Applies principal component analysis (PCA) to the problem of classifying handwritten Kanji characters. PCA is a statistical tool which can yield substantial data reduction by representing each pattern in terms of a relatively small subset of orthonormal features (principal components) extracted from the input set. A PCA preprocessor to an artificial neural network has been used to reduce the dimensionality of a set of handwritten Kanji patterns to less than 5% of that of the original images. Reconstructions of the patterns from the preprocessed versions are quite impressive. Preliminary results yield nearly 90% correct classification of exemplars of 40 different Kanji characters, and also indicate that reconstruction requires more information than classification. These results demonstrate the effectiveness of PCA as a preprocessor for neural networks.<<ETX>>

[1]  J. Rubner,et al.  Development of feature detectors by self-organization , 2004, Biological Cybernetics.

[2]  J. Rubner,et al.  A Self-Organizing Network for Principal-Component Analysis , 1989 .

[3]  Ralph Linsker,et al.  Self-organization in a perceptual network , 1988, Computer.

[4]  E. Oja Simplified neuron model as a principal component analyzer , 1982, Journal of mathematical biology.

[5]  Garth S. Barbour,et al.  Classification of handwritten digits and Japanese Kanji , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[6]  T. P. Vogl,et al.  Dynamically stable associative learning (DYSTAL): a biologically motivated artificial neural network , 1989, International 1989 Joint Conference on Neural Networks.