Application of the wavelet and the Hough transform for detecting the skew angle in Arabic printed documents

The purpose of this work is to present a new technique for detecting the skew angle based on wavelet transform analysis (WT) and Hough transform (HT). The application concerns the Arabic document images. The application of the Hough transform presents a good solution for skew angle detection. However, this approach requires an important memory space and high computing time. For this, we have suggested the using of the WT in order to reduce the number of points and the computing time. This technique is based principally on the high band frequency of the wavelet transform. To evaluate the performance of the proposed method, a test of 100 different documents was used. Obtained result indicates that the suggested approach gave good performance, the accuracy results of the skew angle estimation is very good and the computing time is well decreases compared to the alternative methods. The technique has been applied for Arabic image document, but it can be generalized for any documents.

[1]  Peng-Yeng Yin Skew detection and block classification of printed documents , 2001, Image Vis. Comput..

[2]  Andrew D. Bagdanov,et al.  Projection profile based skew estimation algorithm for JBIG compressed images , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[3]  Nikos Fakotakis,et al.  Skew angle estimation for printed and handwritten documents using the Wigner-Ville distribution , 2002, Image Vis. Comput..

[4]  Harry Wechsler,et al.  Automated page orientation and skew angle detection for binary document images , 1994, Pattern Recognit..

[5]  Huiye Ma,et al.  An enhanced skew angle estimation technique for binary document images , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[6]  S. Bergler,et al.  Skew detection, page segmentation, and script classification of printed document images , 1998, SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.98CH36218).

[8]  Kuo-Liang Chung,et al.  New memory- and computation-efficient hough transform for detecting lines , 2004, Pattern Recognit..

[9]  Anil K. Jain,et al.  A robust and fast skew detection algorithm for generic documents , 1996, Pattern Recognit..

[10]  Adnan Amin,et al.  Fast algorithm for skew detection , 1996, Electronic Imaging.

[11]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..