Improvement in handwritten numeral string recognition by slant normalization and contextual information

This work describes a way of enhancing handwritten numeral string recognition by considering slant normalization and contextual information to train an implicit segmentation­based system. A word slant normalization method is modified in order to improve the results for handwritten numeral strings. We assume that each connected component (CC) in the string has its own slant. The slant and contour length of each CC are used for obtaining the mean slant of the string. Both the original and modified methods are evaluated by means of some interesting analyses on the NIST SD19 database. These analyses show (a) the positive impact of slant correction on the number of overlapping numerals in strings, and (b) the difference in normalizing isolated numerals based on the slant estimated from their own images and the slant estimated from their original string images. Slant normalization and contextual information regarding string slant and digit size variations within the string are used to train numeral HMMs. Preliminary string recognition results, produced by a system under construction, are shown.

[1]  Sargur N. Srihari,et al.  Off-Line Cursive Script Word Recognition , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Fumitaka Kimura,et al.  Improvements of a lexicon directed algorithm for recognition of unconstrained handwritten words , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[3]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[4]  Zsolt Miklós Kovács-Vajna,et al.  A system for reading USA census '90 hand-written fields , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[5]  John Illingworth,et al.  The recognition of handwritten digit strings of unknown length using hidden Markov models , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[6]  Ching Y. Suen,et al.  Sorting and Recognizing Cheques and Financial Documents , 1998, Document Analysis Systems.

[7]  Robert Sabourin,et al.  An HMM-Based Approach for Off-Line Unconstrained Handwritten Word Modeling and Recognition , 1999, IEEE Trans. Pattern Anal. Mach. Intell..