An Adaptive Script-Independent Block-Based Text Line Extraction

In this paper, a novel script-independent block-based text line extraction technique is proposed for multi-skewed document images. Three parameters are defined to adopt the method with various writings. Extensive experiments on different datasets demonstrate that the proposed algorithm outperforms previous methods

[1]  Yi Li,et al.  Script-Independent Text Line Segmentation in Freestyle Handwritten Documents , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Fei Yin,et al.  Handwritten Chinese text line segmentation by clustering with distance metric learning , 2009, Pattern Recognit..

[3]  Vassilis Katsouros,et al.  Handwritten document image segmentation into text lines and words , 2010, Pattern Recognit..

[4]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Karim Faez,et al.  FHT: An Unconstraint Farsi Handwritten Text Database , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[6]  Sargur N. Srihari,et al.  A statistical approach to line segmentation in handwritten documents , 2007, Electronic Imaging.

[7]  Tien D. Bui,et al.  Text line segmentation in handwritten documents using Mumford-Shah model , 2009, Pattern Recognit..

[8]  Tianwen Zhang,et al.  Corpus-based HIT-MW database for offline recognition of general-purpose Chinese handwritten text , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[9]  Ioannis Pratikakis,et al.  Text line and word segmentation of handwritten documents , 2009, Pattern Recognit..

[10]  Mahesh Viswanathan,et al.  A prototype document image analysis system for technical journals , 1992, Computer.