Optimizing the integration of a statistical language model in HMM based offline handwritten text recognition

Although handwritten text recognition has been studied for some years, only few authors have used statistical language models to increase the performance of their recognizers. In those few cases where a language model has been used, its integration has not been systematically optimized. We investigate the optimization of the integration of statistical language models into HMM based recognition systems for offline handwritten text. Based on experiments with the IAM database we show that the recognition performance of a general offline handwritten text recognizer can be substantially improved.

[1]  Geoffrey Leech,et al.  The tagged LOB Corpus : user's manual , 1986 .

[2]  Sargur N. Srihari,et al.  Off-Line Cursive Script Word Recognition , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[4]  Hynek Hermansky,et al.  Towards increasing speech recognition error rates , 1995, Speech Commun..

[5]  F. Itakura,et al.  Balancing acoustic and linguistic probabilities , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[6]  Anthony J. Robinson,et al.  An Off-Line Cursive Handwriting Recognition System , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  R. Rosenfeld,et al.  Two decades of statistical language modeling: where do we go from here? , 2000, Proceedings of the IEEE.

[8]  Horst Bunke,et al.  Using a Statistical Language Model to Improve the Performance of an HMM-Based Cursive Handwriting Recognition System , 2001, Int. J. Pattern Recognit. Artif. Intell..

[9]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[10]  Horst Bunke,et al.  Automatic segmentation of the IAM off-line database for handwritten English text , 2002, Object recognition supported by user interaction for service robots.

[11]  Horst Bunke,et al.  Hidden Markov model length optimization for handwriting recognition systems , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[12]  Nikos Fakotakis,et al.  An unconstrained handwriting recognition system , 2002, International Journal on Document Analysis and Recognition.

[13]  Christian Viard-Gaudin,et al.  N-gram and N-class models for on line handwriting recognition , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[14]  Samy Bengio,et al.  Offline recognition of large vocabulary cursive handwritten text , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..