Recognition of Handwritten Korean Address Strings by Tight-Coupling of Minimum Distance Classification and Dictionary-Based Post-Processing

In this paper, we propose a system for recognizing handwritten Korean ad- dresses. The input address is composed of four field images, and each field is first segmented into individual characters and then recognized by a character classifier in cooperation with a dictionary-based post-processing. In contrast to conventional approaches to word recogni- tion, we have combined the character classifier and post-processor as tightly-coupled as pos- sible to compensate the weakness of each other and to get a high recognition performance in the address interpretation. Higher performance of the proposed system has been proven by an experiment with a database of handwritten Korean addresses, in which more than 80% of 1,107 addresses are processed with less than 1% of reading error. Its processing speed is about two times faster than those of conventional approaches without post-processing. The proposed system can be useful for the interpretation of some constrained handwritten Korean addresses in forms, and can be extended easily to the recognition of unconstrained ones in general forms and mail envelops.

[1]  Seong-Whan Lee,et al.  Hidden Markov mesh random field: theory and its application to handwritten character recognition , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[2]  Seong-Whan Lee,et al.  Efficient postprocessing algorithms for error correction in handwritten Hangul address and human name recognition , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[3]  Hsi-Jian Lee,et al.  Multi-stage pre-candidate selection in handwritten chinese character recognition systems , 1994, Pattern Recognit..

[4]  Yoshiyuki Yamashita,et al.  Classification of handprinted Kanji characters by the structured segment matching method , 1983, Pattern Recognit. Lett..

[5]  Daehwan Kim,et al.  Handwritten Korean character image database PE92 , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[6]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[7]  M. Berthod,et al.  Automatic recognition of handprinted characters—The state of the art , 1980, Proceedings of the IEEE.

[8]  G. McLachlan Discriminant Analysis and Statistical Pattern Recognition , 1992 .

[9]  Yann LeCun,et al.  Handwritten zip code recognition with multilayer networks , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[10]  李幼升,et al.  Ph , 1989 .

[11]  J. Tsukumo Handprinted Kanji character recognition based on flexible template matching , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[12]  Patrick J. Grother,et al.  The First Census Optical Character Recognition Systems Conference | NIST , 1992 .

[13]  Jorma Laaksonen,et al.  LVQ_PAK: The Learning Vector Quantization Program Package , 1996 .

[14]  Sargur N. Srihari,et al.  Integration of hand-written address interpretation technology into the United States Postal Service Remote Computer Reader system , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.