Unconstrained Japanese address recognition using a combination of spatial information and word knowledge

We describe a new handwritten address recognition method which can correct the errors occurring in line extraction, character segmentation, and character recognition as a possible means of avoiding the error accumulation which occurs during the recognition sequence in conventional methods. We formulate the address recognition method as a minimum cost search problem. We define the character recognition cost which estimates the reliability of the character recognition result, the arrangement cost which estimates the plausibility of the character string's spatial arrangement, and the word knowledge cost which estimates the plausibility of the linguistic conditions. By using a combination of these costs, the proposed method can recognize an address which has not been extracted as a single line from input images by a conventional method. The efficiency of the proposed method is evaluated through an experiment using 600 Japanese mail images. An address recognition rate of 79.38% was obtained.

[1]  Noboru Babaguchi,et al.  Constraint Satisfaction Approach to Extraction of Japanese Character Regions from Unformatted Document Image , 1995, IEICE Trans. Inf. Syst..

[2]  Keiji Yamada,et al.  Analysis of address layout on Japanese handwritten mail-a hierarchical process of hypothesis verification , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[3]  Keiji Kobayashi,et al.  Text recognition system for Japanese documents , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[4]  Hiroshi Murase Online recognition of free-format Japanese handwritings , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.