Chinese character recognition: history, status and prospects

Chinese character recognition (CCR) is an important branch of pattern recognition. It was considered as an extremely difficult problem due to the very large number of categories, complicated structures, similarity between characters, and the variability of fonts or writing styles. Because of its unique technical challenges and great social needs, the last four decades witnessed the intensive research in this field and a rapid increase of successful applications. However, higher recognition performance is continuously needed to improve the existing applications and to exploit new applications. This paper first provides an overview of Chinese character recognition and the properties of Chinese characters. Some important methods and successful results in the history of Chinese character recognition are then summarized. As for classification methods, this article pays special attention to the syntactic-semantic approach for online Chinese character recognition, as well as the metasynthesis approach for discipline crossing. Finally, the remaining problems and the possible solutions are discussed.

[1]  Michio Umeda Advances in Recognition Methods for Handwritten Kanji Characters (Special issue on Character Recognition and Document Understanding) , 1996 .

[2]  Kazuhiko Yamamoto,et al.  Research on Machine Recognition of Handprinted Characters , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Patrick Suppes,et al.  Syntactic Methods in Pattern Recognition (K. S. Fu) , 1977 .

[4]  Kohji Fukunaga,et al.  Introduction to Statistical Pattern Recognition-Second Edition , 1990 .

[5]  Chunheng Wang,et al.  Parallel compact integration in handwritten Chinese character recognition , 2007, Science in China Series F: Information Sciences.

[6]  Wentai Liu,et al.  Optical recognition of handwritten Chinese characters: Advances since 1980 , 1993, Pattern Recognit..

[7]  J. Tsukumo Handprinted Kanji character recognition based on flexible template matching , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[8]  Sargur N. Srihari,et al.  Gradient-based contour encoding for character recognition , 1996, Pattern Recognit..

[9]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Keinosuke Fukunaga,et al.  Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[11]  Masaki Nakagawa,et al.  Evaluation of prototype learning algorithms for nearest-neighbor classifier in application to handwritten character recognition , 2001, Pattern Recognit..

[12]  King-Sun Fu,et al.  A Syntactic Approach to Shape Recognition Using Attributed Grammars , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[13]  A. Tanaka,et al.  Online recognition of freely handwritten Japanese characters using directional feature densities , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[14]  Pavel Pudil,et al.  Introduction to Statistical Pattern Recognition , 2006 .

[15]  Y. J. Liu,et al.  CHINESE CHARACTER RECOGNITION , 1990 .

[16]  Fumitaka Kimura,et al.  Modified Quadratic Discriminant Functions and the Application to Chinese Character Recognition , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Donald E. Knuth Semantics of context-free languages: Correction , 2005, Mathematical systems theory.

[18]  Tetsushi Wakabayashi,et al.  Improvement of handwritten Japanese character recognition using weighted direction code histogram , 1997, Pattern Recognit..

[19]  Rui Zhang,et al.  Adaptive confidence transform based classifier combination for Chinese character recognition , 1998, Pattern Recognit. Lett..

[20]  King-Sun Fu,et al.  Error-Correcting Isomorphisms of Attributed Relational Graphs for Pattern Analysis , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[21]  Masaki Nakagawa,et al.  'Online recognition of Chinese characters: the state-of-the-art , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Ryoji Haruki,et al.  Two-dimensional extension of nonlinear normalization method using line density for character recognition , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[23]  Hongwei Hao,et al.  Handwritten Chinese character recognition by metasynthetic approach , 1997, Pattern Recognit..

[24]  Chunheng Wang,et al.  Adaptive combination of classifiers and its application to handwritten Chinese character recognition , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[25]  Ching Y. Suen,et al.  Analysis and Design of a Decision Tree Based on Entropy Reduction and Its Application to Large Character Set Recognition , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Nei Kato,et al.  High Accuracy Recognition of ETL9B Using Exclusive Learning Neural Network-II : ELNET-II (Special Issue on Character Recognition and Document Understanding) , 1996 .

[27]  Cheng-Lin Liu,et al.  Handwritten digit recognition: benchmarking of state-of-the-art techniques , 2003, Pattern Recognit..

[28]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[29]  J. Tsukumo,et al.  Classification of handprinted Chinese characters using nonlinear normalization and correlation methods , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[30]  Hiromitsu Yamada,et al.  A nonlinear normalization method for handprinted kanji character recognition - line density equalization , 1990, Pattern Recognit..

[31]  Cheng-Lin Liu,et al.  Pseudo two-dimensional shape normalization methods for handwritten Chinese character recognition , 2005, Pattern Recognit..

[32]  William Stallings,et al.  Approaches to chinese character recognition , 1976, Pattern Recognit..

[33]  Xiaoqing Ding,et al.  Handwritten character recognition using gradient feature and quadratic classifier with multiple discrimination schemes , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[34]  Taylor L. Booth,et al.  Grammatical Inference: Introduction and Survey-Part I , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Jin Hyung Kim,et al.  Statistical Character Structure Modeling and Its Application to Handwritten Chinese Character Recognition , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  Kazuhiro Sawa,et al.  Accuracy Improvement by Gradient Feature and Variance Absorbing Covariance Matrix in Handwritten Chinese Character Recognition , 1999 .

[37]  George Nagy,et al.  Recognition of Printed Chinese Characters , 1966, IEEE Trans. Electron. Comput..

[38]  Ching Y. Suen,et al.  Application of a Multilayer Decision Tree in Computer Recognition of Chinese Characters , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Nei Kato,et al.  A Handwritten Character Recognition System Using Directional Element Feature and Asymmetric Mahalanobis Distance , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Adam Krzyżak,et al.  Methods of combining multiple classifiers and their applications to handwriting recognition , 1992, IEEE Trans. Syst. Man Cybern..

[41]  Yoshiyuki Yamashita,et al.  Classification of handprinted Kanji characters by the structured segment matching method , 1983, Pattern Recognit. Lett..

[42]  King-Sun Fu,et al.  Attributed Grammar-A Tool for Combining Syntactic and Statistical Approaches to Pattern Recognition , 1980, IEEE Transactions on Systems, Man, and Cybernetics.

[43]  Yuan Yan Tang,et al.  Offline Recognition of Chinese Handwriting by Multifeature and Multilevel Classification , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[44]  K. S. Fu,et al.  Syntactic Pattern Recognition and its Applications to Signal Processing , 1978 .

[45]  Dai Ruwei,et al.  A new discipline of science — The study of open complex giant system and its methodology , 1993 .

[46]  Robert P. W. Duin,et al.  Linear dimensionality reduction via a heteroscedastic extension of LDA: the Chernoff criterion , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  Cheng-Lin Liu,et al.  High Accuracy Handwritten Chinese Character Recognition Using Quadratic Classifiers with Discriminative Feature Extraction , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[48]  King-Sun Fu,et al.  A Pattern Deformational Model and Bayes Error-Correcting Recognition System , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[49]  Taizo Iijima,et al.  A Theory of Character Recognition by Pattern Matching Method , 1974 .

[50]  George Nagy,et al.  Pattern Recognition 1966 IEEE Workshop , 1967, IEEE Spectrum.