A novel approach for skew estimation of document images in OCR system

Optical character recognition (OCR) is an area which has always received special attention. OCR systems are typically built on the strategy of divide and conquer, rather than recognizing documents at one go. They utilize several stages during the course of recognition. There have been many stages in a typical OCR system, preprocessing stage in considered to be indispensable. An input image or information need to be normalized and converted into format acceptable by OCR system. OCR systems typically assume that documents were printed with a single direction of the text and that the acquisition process did not introduce a relevant skew. Practically this assumption is not very strong and printed document could be skewed at some angle with horizontal axis. In this paper, we have proposed a new technique for skew estimation of image document. In the proposed scheme, multiscale properties of an image are utilized together with principal component analysis to estimate the orientation of principal axis of clustered data.

[1]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Muhammad Sarfraz,et al.  Offline Arabic text recognition system , 2003, 2003 International Conference on Geometric Modeling and Graphics, 2003. Proceedings.

[3]  Dana H. Ballard,et al.  Computer Vision , 1982 .

[4]  Nikos Fakotakis,et al.  Skew angle estimation in document processing using Cohen's class distributions , 1999, Pattern Recognit. Lett..

[5]  Adnan Amin,et al.  Page Segmentation and Classification Utilizing Bottom-Up Approach , 2001, Int. J. Image Graph..

[6]  O. Rioul,et al.  Wavelets and signal processing , 1991, IEEE Signal Processing Magazine.

[7]  Jean-Michel Poggi,et al.  Micronde: a Matlab Wavelet Toolbox for Signals and Images , 1995 .

[8]  Sung-Bae Cho,et al.  A data reduction method for efficient document skew estimation based on Hough transformation , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[9]  Huang Jian Offline Arabic character recognition system , 2003 .

[10]  Robert M. Haralick,et al.  An automatic algorithm for text skew estimation in document images using recursive morphological transforms , 1994, Proceedings of 1st International Conference on Image Processing.

[11]  Matti Pietikäinen,et al.  Robust skew estimation on low-resolution document images , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[12]  Lindsay I. Smith,et al.  A tutorial on Principal Components Analysis , 2002 .

[13]  Jonathan J. Hull Document Image skew Detection: Survey and Annotated Bibliography , 1996, DAS.

[14]  Yue Lu,et al.  Improved nearest neighbor based approach to accurate document skew estimation , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[15]  Norihiro Hagita,et al.  Automated entry system for printed documents , 1990, Pattern Recognit..

[16]  Palaiahnakote Shivakumara,et al.  Skew estimation of binary document images using static and dynamic thresholds useful for document image mosaicing. , 2003 .

[17]  Neil W. Bergmann,et al.  Implementation of a statistical based Arabic character recognition system , 1997, TENCON '97 Brisbane - Australia. Proceedings of IEEE TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications (Cat. No.97CH36162).