Scene text detection using sequential nontext filtering

We present a scene text detection method based on sequential nontext filtering. Firstly, we start our work with multi-channel maximally stable extremal region (MSER) detection. Then nontext components are eliminated by a four-stage sequential nontext filtering strategy which consists of inner-channel MSER pruning, between-channel MSER pruning, unary feature-based nontext filtering, and binary feature-based nontext filtering. Finally, text components are grouped into words and false positives are eliminated. The proposed method achieves the state-of-the-art on the ICDAR2013 database when compared with some existing methods.

[1]  Tao Wang,et al.  End-to-end text recognition with convolutional neural networks , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[2]  Chunheng Wang,et al.  Scene text detection using graph model built upon maximally stable extremal regions , 2013, Pattern Recognit. Lett..

[3]  Hyung Il Koo,et al.  Scene Text Detection via Connected Component Clustering and Nontext Filtering , 2013, IEEE Transactions on Image Processing.

[4]  Volker Märgner,et al.  International Conference on Document Analysis and Recognition (ICDAR 2011) - Competitions Overview , 2011, 2011 International Conference on Document Analysis and Recognition.

[5]  Jiri Matas,et al.  Scene Text Localization and Recognition with Oriented Stroke Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[6]  Fei Yin,et al.  Scene Text Localization Using Gradient Local Correlation , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[7]  Kaizhu Huang,et al.  Robust Text Detection in Natural Scene Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Jon Almazán,et al.  ICDAR 2013 Robust Reading Competition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[9]  Andrew Zisserman,et al.  Deep Features for Text Spotting , 2014, ECCV.

[10]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..