An improved document skew angle estimation technique

When a document is fed to a scanner either mechanically or by a human operator for digitization, it suffers from some degrees of skew or tilt. Skew angle detection is an important component of any Optical Character Recognition (OCR) and document analysis system. In this letter we consider skew estimation of Roman script. The method considers the lowermost and uppermost pixels of some selected characters of the text which may be subject to Hough transform for skew angle detection. A fast approach is also proposed which works almost as accurately as Hough transform. Experimental results are presented and compared with results on several other skew detection methods.

[1]  Harry Wechsler,et al.  Automated page orientation and skew angle detection for binary document images , 1994, Pattern Recognit..

[2]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Norihiro Hagita,et al.  Automated entry system for printed documents , 1990, Pattern Recognit..

[4]  Henry S. Baird,et al.  The skew angle of printed documents , 1995 .

[5]  Hsieh S. Hou,et al.  Digital document processing , 1983 .

[6]  Bidyut B. Chaudhuri,et al.  Computer recognition of printed Bangla script , 1995 .

[7]  S.C. Hinds,et al.  A document skew detection method using run-length encoding and the Hough transform , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[8]  Azriel Rosenfeld,et al.  A method of detecting the orientation of aligned components , 1986, Pattern Recognit. Lett..

[9]  Hong Yan,et al.  Skew Correction of Document Images Using Interline Cross-Correlation , 1993, CVGIP Graph. Model. Image Process..

[10]  Jiangying Zhou,et al.  Page segmentation and classification , 1992, CVGIP Graph. Model. Image Process..