Local skew angle estimation from background space in text regions

Almost all document analysis approaches need to perform a global analysis of the page orientation as a separate process at an early stage. It would be preferable to estimate the orientation locally after page segmentation and classification, when more knowledge about the different regions is available. A novel local skew estimation method is presented that takes advantage of the information available after flexible and efficient page segmentation and classification methods have been applied to the document image. The proposed method accurately estimates the orientation of individual text regions by efficiently analysing the arrangement of background space contained in them. No assumption is made about the existence of a uniform or dominant orientation in the document. The whole process is very efficient, as only the regions of text are considered and the points used for the angle estimation are already available as by products of previous document analysis stages.

[1]  Tim Ritchings,et al.  Representation and classification of complex-shaped printed regions using white tiles , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[2]  Tim Ritchings,et al.  Flexible page segmentation using the background , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[3]  R. Smith A simple and efficient skew detection algorithm via text row accumulation , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[4]  Yasuaki Nakano,et al.  An algorithm for the skew normalization of document image , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[5]  S.C. Hinds,et al.  A document skew detection method using run-length encoding and the Hough transform , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[6]  Yuan Yan Tang,et al.  Document skew detection based on the fractal and least squares method , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[7]  Jiangying Zhou,et al.  Page segmentation and classification , 1992, CVGIP Graph. Model. Image Process..

[8]  Henry S. Baird,et al.  The skew angle of printed documents , 1995 .

[9]  Azriel Rosenfeld,et al.  A method of detecting the orientation of aligned components , 1986, Pattern Recognit. Lett..

[10]  Lawrence O'Gorman,et al.  Document Image Analysis , 1996 .

[11]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Norihiro Hagita,et al.  Automated entry system for printed documents , 1990, Pattern Recognit..