e-PCP: A robust skew detection method for scanned document images

We present here an enhanced algorithm (e-PCP) for skew detection in scanned documents, based on the work on Piecewise Covering by Parallelogram (PCP) for robust determination of skew angles [C.-H. Chou, S.-Y. Chu, F. Chang, Estimation of skew angles for scanned documents based on piecewise covering by parallelograms, Pattern Recognition 40 (2007) 443-455]. Our algorithm achieves even better robustness for detection of skew angle than the original PCP algorithm. We have shown accurate determination of skew angles in document images where the original PCP algorithm fails. Further, the increased robustness of performance is achieved with reduced number of computation compared to the originally proposed PCP algorithm. The e-PCP algorithm also outputs a confidence measure which is important in automated systems to filter cases where the estimated skew angle may not be very accurate and thus can be handled by manual intervention. The proposed algorithm was tested extensively on all categories of real time documents and comparisons with PCP method is also provided. Useful details regarding faster execution of the proposed algorithm is provided in Appendix.

[1]  Jun Sun,et al.  Skew detection using wavelet decomposition and projection profile analysis , 2007, Pattern Recognit. Lett..

[2]  Amandeep Kaur,et al.  Hough transform based fast skew detection and accurate skew correction methods , 2008, Pattern Recognit..

[3]  Azriel Rosenfeld,et al.  A method of detecting the orientation of aligned components , 1986, Pattern Recognit. Lett..

[4]  Hong Yan,et al.  Skew Correction of Document Images Using Interline Cross-Correlation , 1993, CVGIP Graph. Model. Image Process..

[5]  Yasuto Ishitani,et al.  Document skew detection based on local region complexity , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[6]  Bidyut Baran Chaudhuri,et al.  An improved document skew angle estimation technique , 1996, Pattern Recognit. Lett..

[7]  Xiaoyi Jiang,et al.  Skew detection of document images by focused nearest-neighbor clustering , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[8]  Chien-Hsing Chou,et al.  Estimation of skew angles for scanned documents based on piecewise covering by parallelograms , 2007, Pattern Recognit..

[9]  Nikos Fakotakis,et al.  Skew angle estimation for printed and handwritten documents using the Wigner-Ville distribution , 2002, Image Vis. Comput..

[10]  Chin-Chuan Han,et al.  A fast approach to the detection and correction of skew documents , 1997, Pattern Recognit. Lett..

[11]  Norihiro Hagita,et al.  Automated entry system for printed documents , 1990, Pattern Recognit..