Automated page orientation and skew angle detection for binary document images

Abstract We describe the development and implementation of algorithms for detecting the page orientation (portrait/landscape) and the degree of skew for documents available as binary images. A new and fast approach is advanced herein whereby skew angle detection takes advantage of information found using the page orientation algorithm. Page orientation is accomplished using local analysis, while skew angle detection is implemented based on the processing of pixels of last black run-lengths of binary image objects. The experiments carried out on a variety of medical journals show the feasibility of the new approach and indicate that detection accuracy can be improved by minimizing the effects of non-textual data.

[1]  Dana H. Ballard,et al.  Computer Vision , 1982 .

[2]  Anil K. Jain Fundamentals of Digital Image Processing , 2018, Control of Color Imaging Systems.

[3]  Rangachar Kasturi,et al.  A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  S.C. Hinds,et al.  A document skew detection method using run-length encoding and the Hough transform , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[5]  Norihiro Hagita,et al.  Automated entry system for printed documents , 1990, Pattern Recognit..

[6]  Daniel X. Le,et al.  Document skew-angle detection algorithm , 1993, Defense, Security, and Sensing.

[7]  Henry S. Baird,et al.  The skew angle of printed documents , 1995 .