Uyghur Text Localization with Fast Component Detection

Text localization in image often is an important part of image content analysis and has broad application prospects. Even though there have been many researches focus on it, fast Uyghur text localization in complex background images is still a challenging task. The obstacles mainly come from the huge extracted candidates and the heavy computation of non-text classification. In this paper, we propose a fast framework for Uyghur text localization which handle above obstacles with two effective measures. One is that we propose a stroke-specific detector based candidate extraction scheme. Compared with the common used I-MSER detector, the presented scheme not only produces 2 times less components but also runs in twice faster. The other is a component similarity based clustering is raised, which neither need the component-level classification nor the extra computations. The experiments confirm that our method has achieved the state-of-the-art on UICBI-500 benchmark dataset and runs in near real-time. The localization results also prove that the proposed method is robust to Chinese and English.

[1]  Hartmut Neven,et al.  PhotoOCR: Reading Text in Uncontrolled Conditions , 2013, 2013 IEEE International Conference on Computer Vision.

[2]  Jiri Matas,et al.  A Method for Text Localization and Recognition in Real-World Images , 2010, ACCV.

[3]  Xiaoyan Gu,et al.  Detecting Uyghur text in complex background images with convolutional neural network , 2017, Multimedia Tools and Applications.

[4]  Jiřı́ Matas,et al.  Real-time scene text localization and recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Chuan Zhou,et al.  Uyghur Language Text Detection in Complex Background Images Using Enhanced MSERs , 2017, MMM.

[6]  Xi Chen,et al.  Robust Uyghur Text Localization in Complex Background Images , 2016, PCM.

[7]  Jean-Michel Jolion,et al.  Object count/area graphs for the evaluation of object detection and segmentation algorithms , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[8]  Pan He,et al.  Detecting Text in Natural Image with Connectionist Text Proposal Network , 2016, ECCV.

[9]  Jiri Matas,et al.  Text Localization in Real-World Images Using Efficiently Pruned Exhaustive Search , 2011, 2011 International Conference on Document Analysis and Recognition.

[10]  Andrew Y. Ng,et al.  Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning , 2011, 2011 International Conference on Document Analysis and Recognition.

[11]  Shin'ichi Satoh,et al.  [Foreword] Welcome to the Special Section on Invited Papers of Media Technology and Applications , 2018 .

[12]  Kaizhu Huang,et al.  Robust Text Detection in Natural Scene Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Xi Chen,et al.  Robust and parallel Uyghur text localization in complex background images , 2017, Machine Vision and Applications.

[14]  David S. Doermann,et al.  Text Detection and Recognition in Imagery: A Survey , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Sheng Tang,et al.  Robust common visual pattern discovery using graph matching , 2013, J. Vis. Commun. Image Represent..

[16]  Tom Drummond,et al.  Faster and Better: A Machine Learning Approach to Corner Detection , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Yongdong Zhang,et al.  Local geometric consistency constraint for image retrieval , 2011, 2011 18th IEEE International Conference on Image Processing.

[18]  Yan Li,et al.  Uyghur Text Matching in Graphic Images for Biomedical Semantic Analysis , 2017, Neuroinformatics.

[19]  Shijian Lu,et al.  Text Flow: A Unified Text Detection System in Natural Scene Images , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[20]  Daijin Kim,et al.  Scene text detection with robust character candidate extraction method , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).