A new scheme for rectifying recognition results of printed chinese characters

Abstract Chinese character recognition error occurs when the correct character is not the highest rank character in a candidate set or, more seriously, is excluded from the set. This paper presents a new scheme aimed at rectifying the above error in printed Chinese character recognition. The rectification scheme employs a line-segment cancellation method with low sensitivity to misshapen line segments caused by noise and hence powerful discriminating ability to inspect character images. In addition, the scheme can determine possibly invalid candidate sets, which may exclude the correct characters, and then, for the invalid sets, retrieve additional possibly correct characters to be inspected with the cancellation method. Therefore, the scheme is not only able to pick out the correct character not only in the highest rank position but can also recover the correct character originally excluded from a candidate set. Experimental results showed that 75% of the recognition errors for normal quality documents and 48% for low quality ones were rectified by our new rectification scheme.

[1]  Kim-Teng Lua,et al.  Recognizing chinese characters through interactive activation and competition , 1990, Pattern Recognit..

[2]  Tao Hong,et al.  Image-based keyword recognition in oriental language document images , 1997, Pattern Recognit..

[3]  Makoto Nagao Shape Recognition by Human-Like Trial and Error Random Processes , 1996, Int. J. Pattern Recognit. Artif. Intell..

[4]  Jiangying Zhou,et al.  Discrimination of characters by a multi-stage recognition process , 1994, Pattern Recognit..

[5]  V. K. Govindan,et al.  Character recognition - A review , 1990, Pattern Recognit..

[6]  Hsi-Jian Lee,et al.  A language model based on semantically clustered words in a Chinese character recognition system , 1997, Pattern Recognit..

[7]  Yung-Sheng Chen,et al.  A comparison of some one-pass parallel thinnings , 1990, Pattern Recognit. Lett..

[8]  Bin Chen,et al.  Recognition of handwritten Chinese characters via short line segments , 1992, Pattern Recognit..

[9]  Gary Geunbae Lee,et al.  Multi-level post-processing for Korean character recognition using morphological analysis and linguistic evaluation , 1997, Pattern Recognit..

[10]  Chao-Huang Chang Simulated annealing clustering of Chinese words for contextual text recognition , 1996, Pattern Recognit. Lett..

[11]  Hsi-Jian Lee,et al.  Increasing character recognition accuracy by detection and correction of erroneously identified characters , 1994, Pattern Recognit..

[12]  Raymond W. Smith,et al.  Computer processing of line images: A survey , 1987, Pattern Recognit..

[13]  Kazuhiko Yamamoto,et al.  Research on Machine Recognition of Handprinted Characters , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.