A preclassification Method for Handwritten Chinese Character Recognition via Fuzzy Rules and Seart Neural Net

In this paper, a method of character preclassification for handwritten Chinese character recognition is proposed. Since the number of Chinese characters is very large (at least 5401s for daily use), we employ two stages to reduce the candidates of an input character. In stage I, we extract the first set of primitive features from handwritten Chinese characters and use fuzzy rules to create four preclassification groups. The purpose in stage I is to reduce the candidates roughly. In stage II, we extract the second set of primitive features from handwritten Chinese characters and then use the Supervised Extended ART (SEART) as the classifier to generate preclassification classes for each preclassification group created in stage I. Since the number of characters in each preclassification class is smaller than that in the whole character set, the problem becomes simpler. In order to evaluate the proposed preclassification system, we use 605 Chinese character categories in the textbooks of elementary school as our training and testing data. The database used is HCCRBASE (provided by CCL, ITRI, Taiwan). In samples 1–100, we select the even samples as the training set, and the odd samples as the testing set. The characters of the testing set can be distributed into correct preclassification classes at a rate of 98.11%.

[1]  Zen Chen,et al.  Preclassification of handwritten Chinese characters based on basic stroke substructures , 1995, Pattern Recognit. Lett..

[2]  Fang-Hsuan Cheng,et al.  Research on Chinese OCR in Taiwan , 1991, Int. J. Pattern Recognit. Artif. Intell..

[3]  D. Hammerstrom,et al.  Neural networks at work , 1993, IEEE Spectrum.

[4]  Gang Bai Multifont Chinese character recognition using side-stroke-end feature , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[5]  Lin-Yu Tseng,et al.  A heuristic algorithm for the recognition of printed Chinese characters , 1995, IEEE Trans. Syst. Man Cybern..

[6]  Stephen Grossberg,et al.  Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system , 1991, Neural Networks.

[7]  Jhing-Fa Wang,et al.  Preclassification for handwritten chinese character recognition by a peripheral shape coding method , 1993, Pattern Recognit..

[8]  Gail A. Carpenter,et al.  Neural network models for pattern recognition and associative memory , 1989, Neural Networks.

[9]  Yung-Sheng Chen,et al.  A modified fast parallel algorithm for thinning digital patterns , 1988, Pattern Recognit. Lett..

[10]  Lin-Yu Tseng,et al.  A stroke extraction method for multifont Chinese characters based on the reduced special interval graph , 1995, IEEE Trans. Syst. Man Cybern..

[11]  K Fukushima,et al.  Handwritten alphanumeric character recognition by the neocognitron , 1991, IEEE Trans. Neural Networks.