Smooth Stroke Width Transform for Text Detection

The stroke width transform (SWT) is a generic operation for the task of detecting texts from natural images because the characters intrinsically have the elongated shape of nearly uniform width. The edge pairing technique was recently developed by Epshtein et al. and is popularly used due to its simplicity and effectiveness. However since the natural images are noisy and sensitive to variations, high degree of artifacts arises and it hinders subsequent processing of the text detection. This paper reformulates the SWT problem in a new way that searches for an optimal solution in 3-D space. We present an effective search algorithm called the aggregation approach, borrowed from the depth image reconstruction domain. The experiments showed that the algorithm produced a smooth SWT map which is better for subsequent processes.

[1]  Yang Liu,et al.  Text detection in natural scene with edge analysis , 2013, 2013 IEEE International Conference on Image Processing.

[2]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[3]  Zhuowen Tu,et al.  Detecting Texts of Arbitrary Orientations in 1 Natural Images , 2012 .

[4]  Ruigang Yang,et al.  A Performance Study on Different Cost Aggregation Approaches Used in Real-Time Stereo Matching , 2007, International Journal of Computer Vision.

[5]  Yu Zhou,et al.  Text Detection in Natural Scene Images with Stroke Width Clustering and Superpixel , 2014, PCM.

[6]  B. S. Manjunath,et al.  Learning bottom-up text attention maps for text detection using stroke width transform , 2013, 2013 IEEE International Conference on Image Processing.

[7]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[8]  Wenyan Dong,et al.  Text Detection in Natural Images Using Localized Stroke Width Transform , 2015, MMM.

[9]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Weilin Huang,et al.  Text Localization in Natural Images Using Stroke Feature Transform and Text Covariance Descriptors , 2013, 2013 IEEE International Conference on Computer Vision.

[11]  Hyung Il Koo,et al.  Scene Text Detection via Connected Component Clustering and Nontext Filtering , 2013, IEEE Transactions on Image Processing.

[12]  Jing Zhang,et al.  Character Energy and Link Energy-Based Text Extraction in Scene Images , 2010, ACCV.

[13]  Kaizhu Huang,et al.  Robust Text Detection in Natural Scene Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Chucai Yi,et al.  Text String Detection From Natural Scenes by Structure-Based Partition and Grouping , 2011, IEEE Transactions on Image Processing.

[15]  Nizar Bouguila,et al.  Image Text Detection Using a Bandlet-Based Edge Detector and Stroke Width Transform , 2012, BMVC.

[16]  Jon Almazán,et al.  ICDAR 2013 Robust Reading Competition , 2013, 2013 12th International Conference on Document Analysis and Recognition.