A candidate reduction method for handwritten Kanji character recognition

This paper describes a method for reducing the number of candidates in Kanji character recognition. Such reduction is vital to increasing the speed of such applications as Kanji address recognition. Further, it also reduces the probability of misreadings in linguistic postprocessing. First, we define a confidence value by which we can express the potential correctness of recognition candidates. Next, we define "accumulated confidence value" and use it as a measure to express a threshold for candidate acceptance in Kanji character recognition. The efficiency of the proposed method is evaluated in an experiment using IPTP CD-ROM2 Japanese address images. Results show that a roughly 31% reduction in the number of candidates was obtained without reducing the number of correct candidates.

[1]  Atsushi Sato,et al.  A formulation of learning vector quantization using a new misclassification measure , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[2]  Masaki Nakagawa,et al.  Precise Candidate Selection for Large Character Set Recognition by Confidence Evaluation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  K. Yamada,et al.  On-line Japanese character recognition experiments by an off-line method based on normalization-cooperated feature extraction , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).