Use of adaptive segmentation in handwritten phrase recognition

Abstract Research in handwriting recognition has thus far been primarily focused on recognizing words and phrases. In fact, phrases are usually treated as a concatenation of the constituent words making it in essence an enhanced word recognizer. In this paper we present a methodology that will take advantage of the spacing between the words in a phrase to aid the recognition process. The novelty of our approach lies in the fact that the determination of word breaks is made in a manner that adapts to the writing style of the individual. The parameters that decide whether a particular gap between components is an inter-word gap or an inter-character gap are computed without the necessity of generalizing over a large training set. Rather, it is tuned to the distribution of the gaps within the instance of the phrase image being examined. We compare our approach to the methods described in the literature that simply ignore the significance of gaps in a phrase. Our experiments show an improvement of about 5% in recognition rates. On a test set of about 1400 phrase images the segmentation method “misses” only 2% of the true word break points.

[1]  Geetha Srikantan,et al.  A multiple feature/resolution approach to handprinted digit and character recognition , 1996 .

[2]  Gyeonghwan Kim,et al.  A Lexicon Driven Approach to Handwritten Word Recognition for Real-Time Applications , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Gyeonghwan Kim,et al.  Handwritten phrase recognition as applied to street name images , 1998, Pattern Recognit..

[4]  Sargur N. Srihari,et al.  Off-Line Cursive Script Word Recognition , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Sargur N. Srihari,et al.  Control Structure for Interpreting Handwritten Addresses , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Sargur N. Srihari,et al.  Understanding Handwritten Text in a Structured Environment: Determining ZIP Codes from Addresses , 1991, Int. J. Pattern Recognit. Artif. Intell..

[7]  Ken Thompson,et al.  Reading Chess , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Michael Brady Toward a Computational Theory of Early Visual Processing in Reading. , 1980 .

[9]  Uma Mahadevan,et al.  Gap metrics for word separation in handwritten lines , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[10]  J.-C. Simon,et al.  Off-line cursive word recognition , 1992, Proc. IEEE.

[11]  Michael D. Garris,et al.  Unconstrained handprint recognition using a limited lexicon , 1994, Electronic Imaging.

[12]  Ishwar K. Sethi,et al.  Off-line cursive handwriting segmentation , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[13]  Gyeonghwan Kim,et al.  An architecture for handwritten text recognition systems , 1999, International Journal on Document Analysis and Recognition.

[14]  Anthony J. Robinson,et al.  An Off-Line Cursive Handwriting Recognition System , 1998, IEEE Trans. Pattern Anal. Mach. Intell..