A two-stage handwritten character segmentation approach in mail address recognition

Character segmentation has become a crucial step for mail address recognition in the automatic post mail sorting system. In this paper, a two-stage character segmentation algorithm according to the characteristics of handwritten mail address characters is proposed. In the simple segmentation stage, the block sequence is extracted from the mail address image using the structure-based methods, including projection profile analysis, connected components analysis and stroke cross number analysis. In the precise segmentation stage, all candidate segmentation paths are created by combining the neighboring blocks and represented with a candidate segmentation graph first. Then several optimal candidate paths are selected from the graph by dynamic programming searching based on recognition confidence. Finally the best segmentation path is determined by matching these paths with the known post address database. In the experiment on more than 500 real envelop images with the this approach, the correct sorting rate of address recognition is up to 79.46% and that of address-postcode integrated recognition is up to 96.26%.

[1]  Malayappan Shridhar,et al.  A segmentation system for touching handwritten Japanese characters , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[2]  Hsi-Jian Lee,et al.  Recognition-based handwritten Chinese character segmentation using a probabilistic Viterbi algorithm , 1999, Pattern Recognit. Lett..

[3]  Yi Lu,et al.  Machine printed character segmentation --; An overview , 1995, Pattern Recognit..

[4]  Eric Lecolinet,et al.  A Survey of Methods and Strategies in Character Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Tomohiro Yoshikawa,et al.  A segmentation method for touching Japanese handwritten characters based on connecting condition of lines , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[6]  H. Fujisawa,et al.  Segmentation of Japanese handwritten characters using peripheral feature analysis , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[7]  J. Tsukumo,et al.  A segmentation method for handwritten Japanese character lines based on transitional information , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[8]  Cheng-Lin Liu,et al.  Lexicon-Driven Segmentation and Recognition of Handwritten Character Strings for Japanese Address Reading , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  S. Ariyoshi A character segmentation method for Japanese printed documents coping with touching character problems , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.