Finding the Best-Fit Bounding-Boxes

The bounding-box of a geometric shape in 2D is the rectangle with the smallest area in a given orientation (usually upright) that complete contains the shape. The best-fit bounding-box is the smallest bounding-box among all the possible orientations for the same shape. In the context of document image analysis, the shapes can be characters (individual components) or paragraphs (component groups). This paper presents a search algorithm for the best-fit bounding-boxes of the textual component groups, whose shape are customarily rectangular in almost all languages. One of the applications of the best-fit bounding-boxes is the skew estimation from the text blocks in document images. This approach is capable of multi-skew estimation and location, as well as being able to process documents with sparse text regions. The University of Washington English Document Image Database (UW-I) is used to verify the skew estimation method directly and the proposed best-fit bounding-boxes algorithm indirectly.

[1]  George Nagy,et al.  Twenty Years of Document Image Analysis in PAMI , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Stefano Messelodi,et al.  Geometric Layout Analysis Techniques for Document Image Understanding: a Review , 2008 .

[3]  Henry S. Baird,et al.  The skew angle of printed documents , 1995 .

[4]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[5]  Robert M. Haralick,et al.  An automatic algorithm for text skew estimation in document images using recursive morphological transforms , 1994, Proceedings of 1st International Conference on Image Processing.

[6]  William H. Press,et al.  Numerical Recipes in FORTRAN - The Art of Scientific Computing, 2nd Edition , 1987 .

[7]  Chew Lim Tan,et al.  Fiducial line based skew estimation , 2005, Pattern Recognit..

[8]  Lawrence O'Gorman,et al.  Document Image Analysis Systems - Guest Editors' Introduction to the Special Issue , 1992, Computer.

[9]  Venu Govindaraju,et al.  Analysis of textual images using the Hough transform , 1989, Machine Vision and Applications.

[10]  Lawrence O'Gorman,et al.  Document Image Analysis , 1996 .

[11]  Yasuaki Nakano,et al.  An algorithm for the skew normalization of document image , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[12]  Chew Lim Tan,et al.  A multi-level component grouping algorithm and its applications , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[13]  Hamid K. Aghajan,et al.  SLIDE: Subspace-Based Line Detection , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  S.C. Hinds,et al.  A document skew detection method using run-length encoding and the Hough transform , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.