Robust seed-based stroke width transform for text detection in natural images

Text detection in natural scene images is challenging due to the significant variations of the appearance of the text itself and its interaction with the context. The popular stroke width transform (SWT) algorithm is highly efficient but sensitive to the defects of the edges extracted from the input image when searching for the matching edge pixels for computing potential stroke width. In this paper, we propose a novel seed-based variant of SWT that enhances significantly the robustness of the original algorithm to complicated image contextual interference and varied text appearance. We first search for the seed segment of strokes, which is defined as a consecutive sequence of neighbouring rays (pairs of edge pixels) with regular length and satisfying certain constraints, and grow from them to localize more stroke segments. We then exploit the principal width and direction information of stroke captured by the stroke segments detected to rectify inaccurate stroke width and recover missed stroke parts, which are resulted from erroneous and noisy edges in complex natural images. The stroke segments detected are also exploited to improve the accuracy of candidate character localization. The experimental results on public datasets demonstrated the effectiveness of the proposed method.

[1]  Jing Zhang,et al.  A Novel Text Detection System Based on Character and Link Energies , 2014, IEEE Transactions on Image Processing.

[2]  Hyung Il Koo,et al.  Scene Text Detection via Connected Component Clustering and Nontext Filtering , 2013, IEEE Transactions on Image Processing.

[3]  Jiri Matas,et al.  Scene Text Localization and Recognition with Oriented Stroke Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[4]  Kai Wang,et al.  End-to-end scene text recognition , 2011, 2011 International Conference on Computer Vision.

[5]  Feng Su,et al.  Scene Text Detection Based on Robust Stroke Width Transform and Deep Belief Network , 2014, ACCV.

[6]  C. V. Jawahar,et al.  Top-down and bottom-up cues for scene text recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Andreas Dengel,et al.  ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene Images , 2011, 2011 International Conference on Document Analysis and Recognition.

[8]  Yang Liu,et al.  Text detection in natural scene with edge analysis , 2013, 2013 IEEE International Conference on Image Processing.

[9]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Jiřı́ Matas,et al.  Real-time scene text localization and recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Huchuan Lu,et al.  Scene text detection via stroke width , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).