History of the Tesseract OCR engine: what worked and what didn't

This paper describes the development history of the Tesseract OCR engine, and compares the methods to general changes in the field over a similar time period. Emphasis is placed on the lessons learned with the goal of providing a primer for those interested in OCR research.

[1]  Sebastiano Impedovo,et al.  Tuning between Exponential Functions and Zones for Membership Functions Selection in Voronoi-Based Zoning for Handwritten Character Recognition , 2011, 2011 International Conference on Document Analysis and Recognition.

[2]  R. Smith A simple and efficient skew detection algorithm via text row accumulation , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[3]  Stephen V. Rice,et al.  The Fourth Annual Test of OCR Accuracy , 1995 .

[4]  Philip A. Chou,et al.  Document Image Decoding Using Markov Source Models , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Christopher Raphael,et al.  Language-independent OCR using a continuous speech recognition system , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[6]  Stephen R. Marsland,et al.  Machine Learning - An Algorithmic Perspective , 2009, Chapman and Hall / CRC machine learning and pattern recognition series.

[7]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[8]  Barry A. Blesser,et al.  Empirical tests for feature selection based on a psychological theory of character recognition , 1976, Pattern Recognit..

[9]  Michael I. Jordan,et al.  On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes , 2001, NIPS.

[10]  Josef Kittler,et al.  Minimum error thresholding , 1986, Pattern Recognit..

[11]  Andrew Y. Ng,et al.  Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning , 2011, 2011 International Conference on Document Analysis and Recognition.

[12]  Christopher J. P. Newton,et al.  Adaptive Thresholding for OCR: A Significant Test , 1993 .

[13]  Raymond W. Smith Hybrid Page Layout Analysis via Tab-Stop Detection , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[14]  Ray Smith Limits on the Application of Frequency-Based Language Models to OCR , 2011, 2011 International Conference on Document Analysis and Recognition.

[15]  Theodosios Pavlidis,et al.  On the Recognition of Printed Characters of Any Font and Size , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Daming Shi,et al.  Offline handwritten Chinese character recognition by radical decomposition , 2003, TALIP.

[17]  Raymond Wensley Smith The extraction and recognition of text from multimedia document images , 1987 .

[18]  Nicole Vincent,et al.  Shape-Based Alphabet for Off-line Arabic Handwriting Recognition , 2007 .

[19]  Franz Josef Och,et al.  Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[20]  Eric Lecolinet,et al.  A Survey of Methods and Strategies in Character Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..