Handwritten text documents binarization and skew normalization approaches

Handwritten text recognition has been an active research area for many years. Handwritten text recognition needs to perform some preprocessing steps for better recognition. First, we find binary image of given handwritten text document and then after performing the line segmentation task, we normalize it to the segmented lines. There are various normalization tasks such as skew normalization, slant normalization and size normalization. This paper, focuses on the handwritten document binarization and skew normalization and proposes a novel global binarization approach, which is very cost effective. We also propose a new skew normalization approach which is based on orthogonal projection of the segmented line with respect to x-axis. The method has been experimented on various styles of handwritten text documents, and it is found that it detects the exact skew angle, and corrects it efficiently. A comparative study has also been reported to provide a detailed analysis of the proposed methods together with some other existing methods in the literature.

[1]  A. Harvey,et al.  Skew detection in handwritten scripts , 1997, TENCON '97 Brisbane - Australia. Proceedings of IEEE TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications (Cat. No.97CH36162).

[2]  Muhammad Sarfraz,et al.  On Skew Estimation and Correction of Text , 2007, Computer Graphics, Imaging and Visualisation (CGIV 2007).

[3]  Venu Govindaraju,et al.  Analysis of textual images using the Hough transform , 1989, Machine Vision and Applications.

[4]  Victor Wu Document Image Clean-up and Binarization , 1998 .

[5]  Sargur N. Srihari,et al.  Document Image Binarization Based on Texture Features , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  A. Trotta,et al.  Application of Wigner-Ville distribution to measurements on transient signals , 1993, 1993 IEEE Instrumentation and Measurement Technology Conference.

[7]  Jonathan J. Hull Document Image skew Detection: Survey and Annotated Bibliography , 1996, DAS.

[8]  M. Sarfraz,et al.  Skew Estimation and Correction of Text Using Bounding Box , 2008, 2008 Fifth International Conference on Computer Graphics, Imaging and Visualisation.

[9]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[10]  B. Kapralos,et al.  I An Introduction to Digital Image Processing , 2022 .

[11]  R. Manmatha,et al.  Document image cleanup and binarization , 1998, Electronic Imaging.

[12]  Andrew D. Bagdanov,et al.  Projection profile based skew estimation algorithm for JBIG compressed images , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[13]  Venu Govindaraju,et al.  Skew detection for complex document images using fuzzy runlength , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..