Analysis of line structure in handwritten documents using the Hough transform

In the analysis of handwriting in documents a central task is that of determining line structure of the text, e.g., number of text lines, location of their starting and end-points, line-width, etc. While simple methods can handle ideal images, real world documents have complexities such as overlapping line structure, variable line spacing, line skew, document skew, noisy or degraded images etc. This paper explores the application of the Hough transform method to handwritten documents with the goal of automatically determining global document line structure in a top-down manner which can then be used in conjunction with a bottom-up method such as connected component analysis. The performance is significantly better than other top-down methods, such as the projection profile method. In addition, we evaluate the performance of skew analysis by the Hough transform on handwritten documents.

[1]  John McDonald,et al.  Application of the Hough Transform to Lane Detection and Following on High Speed Roads , 2001 .

[2]  Basilios Gatos,et al.  Text Line Detection in Unconstrained Handwritten Documents Using a Block-Based Hough Transform Approach , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[3]  Henry S. Baird,et al.  The skew angle of printed documents , 1995 .

[4]  Jerome A. Feldman,et al.  Connectionist Models and Their Properties , 1982, Cogn. Sci..

[5]  M. B. Clowes,et al.  Finding Picture Edges Through Collinearity of Feature Points , 1973, IEEE Transactions on Computers.

[6]  George Nagy,et al.  DOCUMENT ANALYSIS WITH AN EXPERT SYSTEM , 1986 .

[7]  Eric Lecolinet,et al.  Cursive handwriting recognition using the Hough transform and a neural network , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[8]  Sung-Hyuk Cha,et al.  Individuality of handwriting. , 2002, Journal of forensic sciences.

[9]  Stephen D. Shapiro,et al.  Geometric Constructions for Predicting Hough Transform Performance , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Venu Govindaraju,et al.  Analysis of textual images using the Hough transform , 1989, Machine Vision and Applications.

[11]  Friedrich M. Wahl,et al.  Block segmentation and text extraction in mixed text/image documents , 1982, Comput. Graph. Image Process..

[12]  Yu Zhang,et al.  Application of Hough transform in recognition of the pointer feature of instrument , 2008, International Symposium on Precision Mechanical Measurements.

[13]  Laurence Likforman-Sulem,et al.  A Hough based algorithm for extracting text lines in handwritten documents , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[14]  Christopher M. Brown Inherent Bias and Noise in the Hough Transform , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Richard O. Duda,et al.  Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.