An Approach to Off-Line Handwritten Chinese Character Recognition Based on Hierarchical Radical Decomposition

Off-line recognition of handwritten Chinese characters is of considerable practical importance as well as representing a very hard pattern recognition problem. A popular approach is to decompose characters into their component or ‘primitive’ parts – most usually strokes. Here, however, we take the less usual approach of decomposing into radicals. Active shape modelling is applied and developed into active radical modelling. In training, 60 examples of each radical are represented by ‘landmark’ points, labelled semi-automatically, with radicals in different characteristic positions treated as distinctly different radicals. Principal component analysis then captures the main variation around the mean radical. In recognition, the dynamic tunnelling algorithm is incorporated with gradient descent to search for optimal shape parameters in terms of chamfer distance minimisation. Although prior landmark labelling is time-consuming and gradient descent search during recognition is computationally expensive, the method is theoretically well motivated, incorporates prior knowledge about the structure of Chinese characters in an appropriate way, and avoids problems implicit in stroke extraction. Experiments are conducted on 280,000 loosely constrained characters from 200 writers. There are 98 different categories of radical included in 1400 character categories, and approximately 590,000 radicals in total. The matching rate on this large test set is 94.2% radicals correct (writer-independent), greatly superior to existing radical approaches. Assuming character composition to be a Markov process in which up to four radicals are combined in some assumed sequential order, we can recognise complete, hierarchically composed characters using the Viterbi algorithm. This results in a character recognition rate of 92.6%.

[1]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[2]  Shi-Kuo Chang,et al.  An Interactive System for Chinese Character Generation and Retrieval , 1973, IEEE Trans. Syst. Man Cybern..

[3]  Qiang Huo,et al.  A Discrete Contextual Stochastic Model for the Offline Recognition of Handwritten Chinese Characters , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Daming Shi,et al.  Recognition rule acquisition by an advanced extension matrix algorithm , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[5]  Daming Shi,et al.  A radical approach to handwritten Chinese character recognition using active handwriting models , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[6]  Kunihiko Fukushima,et al.  Character recognition with selective attention , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[7]  Yuan Yan Tang,et al.  Offline Recognition of Chinese Handwriting by Multifeature and Multilevel Classification , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Timothy F. Cootes,et al.  Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[9]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[10]  Robert C. Bolles,et al.  Parametric Correspondence and Chamfer Matching: Two New Techniques for Image Matching , 1977, IJCAI.

[11]  Chien-Cheng Tseng,et al.  On-line chinese character recognition with effective candidate radical and candidate character selections , 1996, Pattern Recognit..

[12]  Daming Shi,et al.  Active radical modeling for handwritten Chinese characters , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[13]  Hong Yan,et al.  Recognition of handprinted Chinese characters by constrained graph matching , 1998, Image Vis. Comput..

[14]  John F. Kolen,et al.  Backpropagation is Sensitive to Initial Conditions , 1990, Complex Syst..

[15]  Gunilla Borgefors,et al.  Hierarchical Chamfer Matching: A Parametric Edge Matching Algorithm , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Hang Joon Kim,et al.  On-line Chinese character recognition using ART-based stroke classification , 1996, Pattern Recognit. Lett..

[17]  Yashwant Prasad Singh,et al.  Hybridization of gradient descent algorithms with dynamic tunneling methods for global optimization , 2000, IEEE Trans. Syst. Man Cybern. Part A.

[18]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[19]  Wentai Liu,et al.  Optical recognition of handwritten Chinese characters: Advances since 1980 , 1993, Pattern Recognit..

[20]  Suh-Yin Lee,et al.  On-Line Chinese Character Recognition via A Representation of Spatial Relationships between Strokes , 1997, Int. J. Pattern Recognit. Artif. Intell..

[21]  Jun S. Huang,et al.  A transformation invariant matching algorithm for handwritten chinese character recognition , 1990, Pattern Recognit..

[22]  Hang Joon Kim,et al.  On-line recognition of cursive Korean characters using graph representation , 2000, Pattern Recognit..

[23]  David L. Neuhoff,et al.  The Viterbi algorithm as an aid in text recognition (Corresp.) , 1975, IEEE Trans. Inf. Theory.

[24]  Fu-Lai Chung,et al.  Complex character decomposition using deformable model , 2001 .

[25]  Korris Fu-Lai Chung,et al.  Offline handwritten Chinese character recognition via radical extraction and recognition , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[26]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[27]  Michael Isard,et al.  Active Contours: The Application of Techniques from Graphics, Vision, Control Theory and Statistics to Visual Tracking of Shapes in Motion , 2000 .

[28]  Daming Shi,et al.  Neocognitron's Parameter Tuning by Genetic Algorithms , 1999, Int. J. Neural Syst..

[29]  Kuo-Chin Fan,et al.  Optical recognition of handwritten Chinese characters by hierarchical radical matching method , 2001, Pattern Recognit..

[30]  Yong Yao,et al.  Dynamic tunneling algorithm for global optimization , 1989, IEEE Trans. Syst. Man Cybern..

[31]  Roland T. Chin,et al.  One-Pass Parallel Thinning: Analysis, Properties, and Quantitative Evaluation , 1992, IEEE Trans. Pattern Anal. Mach. Intell..