A New Method for Text-Line Segmentation for Warped Documents

Bound documents either scanned or captured with digital cameras often present a geometrical warp that makes text-lines curled. The identification of text-lines is one of the steps for document de-warping when only a single image is available. This paper presents a new method for text-line segmentation. It is based on a simple, but effective, skew detector proposed by Avila-Lins and simplifies the idea of coupled snakes introduced by Bukhari to a moving parallel line regression. The proposed method performed better than the best of the similar algorithms in the literature.

[1]  Syed Saqib Bukhari,et al.  Ridges Based Curled Textline Region Detection from Grayscale Camera-Captured Document Images , 2009, CAIP.

[2]  Wenxin Li,et al.  A Model-based Book Dewarping Method Using Text Line Detection , 2007 .

[3]  Syed Saqib Bukhari,et al.  Textline information extraction from grayscale camera-captured document images , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[4]  Ioannis Pratikakis,et al.  A Two-Step Dewarping of Camera Document Images , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[5]  Syed Saqib Bukhari,et al.  Coupled Snakelet Model for Curled Textline Segmentation of Camera-Captured Document Images , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[6]  L. M. Mestetskiy,et al.  Usage of continuous skeletal image representation for document images de-warping , 2007 .

[7]  Rafael Dueire Lins,et al.  A fast orientation and skew detection algorithm for monochromatic document images , 2005, DocEng '05.

[8]  Rafael Dueire Lins,et al.  Correcting Book Binding Distortion in Scanned Documents , 2010, ICIAR.

[9]  P. Diehl,et al.  Least-Squares Fitting , 1972 .

[10]  Thomas M. Breuel,et al.  Performance Evaluation and Benchmarking of Six-Page Segmentation Algorithms , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Syed Saqib Bukhari,et al.  Segmentation of Curled Textlines Using Active Contours , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.