Segmentation of handprinted letter strings using a dynamic programming algorithm

Segmentation of handwritten input into individual characters is a crucial step in many connected handwriting recognition systems. This paper describes a segmentation algorithm for letters in Roman alphabets, curved pre-stroke cut (CPSC) segmentation. The CPSC algorithm evaluates a large set of curved cuts through the image of the input string using dynamic programming and selects a small "optimal" subset of cuts for segmentation. It usually generates pixel accurate segmentations, indistinguishable from characters written in isolation. At four times oversegmentation, segmentation points are missed with an undetectable frequency on real-world databases. The CPSC algorithm has been used as part of a high-performance handwriting recognition system.

[1]  Kunihiko Fukushima,et al.  Character recognition in cursive handwriting with the mechanism of selective attention , 1993, Systems and Computers in Japan.

[2]  Patrick J. Grother,et al.  The Second Census Optical Character Recognition Systems Conference , 1994 .

[3]  James D. Keeler,et al.  A Self-Organizing Integrated Segmentation and Recognition Neural Net , 1991, NIPS.

[4]  Fumitaka Kimura,et al.  Improvements of a lexicon directed algorithm for recognition of unconstrained handwritten words , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[5]  Anthony J. Robinson,et al.  An Off-Line Cursive Handwriting Recognition System , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Gale Martin,et al.  Recognizing Overlapping Hand-Printed Characters by Centered-Object Integrated Segmentation and Recognition , 1991, NIPS.

[7]  Michel Gilloux,et al.  Strategies for handwritten words recognition using hidden Markov models , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[8]  Yann LeCun,et al.  Off Line Recognition of Handwritten Postal Words Using Neural Networks , 1993, Int. J. Pattern Recognit. Artif. Intell..

[9]  Kunihiko Fukushima,et al.  Recognition and segmentation of characters in handwriting with selective attention , 1992 .

[10]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Malayappan Shridhar,et al.  Context-directed segmentation algorithm for handwritten numeral strings , 1987, Image Vis. Comput..

[12]  Thomas M. Breuel,et al.  A system for the off-line recognition of handwritten text , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[13]  Yann LeCun,et al.  Multi-Digit Recognition Using a Space Displacement Neural Network , 1991, NIPS.

[14]  Thomas M. Breuel Design and Implementation of a System for the Recognition of Handwritten Responses on US Census Forms , 1994 .

[15]  Malayappan Shridhar,et al.  Recognition of isolated and simply connected handwritten numerals , 1986, Pattern Recognition.

[16]  Torsten Caesar,et al.  Preprocessing and feature extraction for a handwriting recognition system , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).