Easily adaptable handwriting recognition in historical manuscripts

As libraries increasingly digitize their collections, there are growing numbers of scanned manuscripts that current OCR and handwriting recognition techniques cannot transcribe, because the systems are not trained for the scripts in which these manuscripts are written. Documents in this category range from illuminated medieval manuscripts to handwritten letters to early printed works. Without transcriptions, these documents remain unsearchable. Unfortunately with existing methods, a user must manually label large amounts of text in the target font to adapt the system to a new script. Some systems require that a user manually segment and label instances of each glyph. Others provide for less costly training, allowing a user to segment and label entire lines of text instead of individual characters. Still, the collections we consider are extremely diverse, to the extent that in some cases almost every document may be in a different style. Because of this, the cost of manually transcribing dozens of lines of text for each font is prohibitively high. In this dissertation, we introduce methods that significantly reduce the manual labor involved in training a character recognizer to new scripts. Rather than forcing a user to transcribe portions of each target document, our system leverages general language statistics to identify regions of the document from which it may automatically extract new training exemplars. Unlike document specific transcriptions, these language statistics may be generated in a largely unsupervised manner, allowing our system to automate the process of building a model of scripts. We demonstrate the effectiveness of the model thus generated by using it to build a search engine for a Medieval illuminated manuscript.

[1]  S. Crawford,et al.  Volume 1 , 2012, Journal of Diabetes Investigation.

[2]  Frederick Jelinek,et al.  Statistical methods for speech recognition , 1997 .

[3]  Sargur N. Srihari Document Image Understanding , 1986, FJCC.

[4]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[6]  Gary E. Kopec,et al.  Document image decoding approach to character template estimation , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[7]  Harold S. Stone,et al.  Proceedings of 1986 ACM Fall joint computer conference , 1986 .

[8]  Fernando Pereira,et al.  Weighted finite-state transducers in speech recognition , 2002, Comput. Speech Lang..

[9]  Ehud Rivlin,et al.  Offline cursive script word recognition – a survey , 1999, International Journal on Document Analysis and Recognition.

[10]  R. Manmatha,et al.  Word image matching using dynamic time warping , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[11]  Pedro Larrañaga,et al.  An Introduction to Probabilistic Graphical Models , 2002, Estimation of Distribution Algorithms.

[12]  Robert Wilensky,et al.  UC Berkeley's Digital Library project , 1995, CACM.

[13]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[14]  Venu Govindaraju,et al.  The HOVER system for rapid holistic verification of off-line handwritten phrases , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[15]  Edward M. Riseman,et al.  Word spotting: a new approach to indexing handwriting , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Michel Gilloux Hidden Markov Models in Handwriting Recognition , 1994 .

[17]  Alexei A. Efros,et al.  Discovering objects and their location in images , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[18]  R. Manmatha,et al.  Holistic word recognition for handwritten historical documents , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[19]  Ben Taskar,et al.  Max-Margin Markov Networks , 2003, NIPS.

[20]  J. Davenport Editor , 1960 .

[21]  Thomas M. Breuel,et al.  A system for the off-line recognition of handwritten text , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[22]  Samy Bengio,et al.  Offline recognition of unconstrained handwritten texts using HMMs and statistical language models , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Y. Le Cun,et al.  Shortest path segmentation: a method for training a neural network to recognize character strings , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[24]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[25]  Sargur N. Srihari,et al.  Integration of hand-written address interpretation technology into the United States Postal Service Remote Computer Reader system , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[26]  Kenneth M. Sayre,et al.  Machine recognition of handwritten words: A project report , 1973, Pattern Recognit..

[27]  Ching Y. Suen,et al.  Historical review of OCR research and development , 1992, Proc. IEEE.

[28]  C. Mello,et al.  A Comparative Study on OCR Tools , 1999 .

[29]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[30]  Alessandro Vinciarelli Offline Cursive Handwriting: From Word To Text Recognition , 2003 .

[31]  Verzekeren Naar Sparen,et al.  Cambridge , 1969, Humphrey Burton: In My Own Time.

[32]  Patrick J. Grother,et al.  The Second Census Optical Character Recognition Systems Conference , 1994 .

[33]  James Allan,et al.  Text alignment with handwritten documents , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[34]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[35]  M. F.,et al.  Bibliography , 1985, Experimental Gerontology.