论文信息 - Grayscale-Projection Based Optimal Character Segmentation for Camera-Captured Faint Text Recognition

Grayscale-Projection Based Optimal Character Segmentation for Camera-Captured Faint Text Recognition

The faint text document images possess shallow characters inherently and the camera-captured form introduces more degradations such as low-resolution, non-uniform illumination and out-of-focus blur, which make the text binarization very difficult. In this paper, we propose a grayscale-projection based optimal character segmentation method for camera-captured faint text recognition. Instead of extracting the character candidates, we use the gradient projection to extract a series of segmentation candidates which contain inter-character gaps and intra-character gaps as well. In order to select the optimal segmentation path from all possible situations, we construct a segmentation tree and set a evaluation score for each path. The score integrates the information of single point projection, overall distribution and recognition probability. Finally the optimal segmentation path is obtained by selecting the path with the highest score. We collect a faint text recognition dataset and evaluate our method on it. Experimental results show that our method outperforms the binary-projection method and the convolutional recurrent neural network approach in terms of text segmentation and recognition accuracy.

Chunheng Wang | Baihua Xiao | Cunzhao Shi | Yanna Wang | Fuxi Jia

[1] David G. Lowe,et al. Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[2] I. Kaneko,et al. Character segmentation of address reading/letter sorting machine for the ministry of posts and telecommunications of Japan , 1993 .

[3] Chih-Jen Lin,et al. LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[4] Hartmut Neven,et al. PhotoOCR: Reading Text in Uncontrolled Conditions , 2013, 2013 IEEE International Conference on Computer Vision.

[5] Xiang Bai,et al. An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6] Kai Wang,et al. End-to-end scene text recognition , 2011, 2011 International Conference on Computer Vision.

[7] Palaiahnakote Shivakumara,et al. A New Gradient Based Character Segmentation Method for Video Text Recognition , 2011, 2011 International Conference on Document Analysis and Recognition.

[8] Francine Chen,et al. SmartDCap: semi-automatic capture of higher quality document images from a smartphone , 2013, IUI '13.

[9] Rajjan Shinghal,et al. An Algorithm for Segmenting Handwritten Postal Codes , 1990, Int. J. Man Mach. Stud..

[10] Seong-Whan Lee,et al. A New Methodology for Gray-Scale Character Segmentation and Recognition , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[11] Wenyu Liu,et al. Strokelets: A Learned Multi-scale Representation for Scene Text Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[12] Ernest Valveny,et al. Word Spotting and Recognition with Embedded Attributes , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13] Thomas Mensink,et al. Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.

[14] Giovanni Seni,et al. External word segmentation of off-line handwritten text lines , 1994, Pattern Recognit..

[15] N. Otsu. A threshold selection method from gray level histograms , 1979 .

[16] Raja Bala,et al. Mobile Video Capture of Multi-page Documents , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[17] Wayne Niblack,et al. An introduction to digital image processing , 1986 .

[18] Alan L. Yuille,et al. Detecting and reading text in natural scenes , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[19] Eric Lecolinet,et al. A Survey of Methods and Strategies in Character Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[20] Matti Pietikäinen,et al. Adaptive document image binarization , 2000, Pattern Recognit..

[21] Shijian Lu,et al. Accurate Scene Text Recognition Based on Recurrent Neural Network , 2014, ACCV.