Text detection approach based on confidence map and context information

Abstract Text information plays a significant role in many applications for providing more descriptive and abstract information than other objects. In this paper, an approach based on the confidence map and context information is proposed to robustly detect texts in natural scenes. Most of the conventional methods design sophisticated texture features to describe the text regions, while we focus on building a confidence map model by integrating the seed candidate appearance and the relationships with its adjacent candidates to highlight the texts from the backgrounds, and the candidates with low confidence value will be removed. In order to improve the recall rate, the text context information is adopted to regain the missing text regions. Finally, the text lines are formed and further verified, and the words are obtained by calculating the threshold to separate the intra-word letters from the inter-word letters. Experimental results on the three public benchmark datasets, i.e., ICDAR 2005, ICDAR 2011 and ICDAR 2013, show that the proposed approach has achieved the competitive performances by comparing with the other state-of-the-art methods.

[1]  Jean-Michel Jolion,et al.  Object count/area graphs for the evaluation of object detection and segmentation algorithms , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[2]  Matthieu Cord,et al.  Text segmentation in natural scenes using Toggle-Mapping , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[3]  Chew Lim Tan,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence, Manuscript Id a Laplacian Approach to Multi-oriented Text Detection in Video , 2022 .

[4]  Jiri Matas,et al.  A Method for Text Localization and Recognition in Real-World Images , 2010, ACCV.

[5]  S.M. Lucas,et al.  ICDAR 2005 text locating competition results , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[6]  Jiřı́ Matas,et al.  Real-time scene text localization and recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Yi-Ping Yang,et al.  Locating text based on connected component and SVM , 2007, 2007 International Conference on Wavelet Analysis and Pattern Recognition.

[8]  Jun Guo,et al.  Text extraction from natural scene image: A survey , 2013, Neurocomputing.

[9]  Hyung Il Koo,et al.  Scene Text Detection via Connected Component Clustering and Nontext Filtering , 2013, IEEE Transactions on Image Processing.

[10]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[11]  Cheng-Lin Liu,et al.  A Hybrid Approach to Detect and Localize Texts in Natural Scene Images , 2011, IEEE Transactions on Image Processing.

[12]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[13]  Tao Wang,et al.  End-to-end text recognition with convolutional neural networks , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[14]  Yonghong Song,et al.  Text Detection in Natural Scenes with Salient Region , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[15]  Jiri Matas,et al.  Text Localization in Real-World Images Using Efficiently Pruned Exhaustive Search , 2011, 2011 International Conference on Document Analysis and Recognition.

[16]  Nong Sang,et al.  A hybrid approach for text detection in natural scenes , 2013, Other Conferences.

[17]  Jing Zhang,et al.  Text Detection Using Edge Gradient and Graph Spectrum , 2010, 2010 20th International Conference on Pattern Recognition.

[18]  SongYi-Zhe,et al.  Text extraction from natural scene image , 2013 .

[19]  Kaizhu Huang,et al.  Accurate and robust text detection: a step-in for text retrieval in natural scene images , 2013, SIGIR.

[20]  Xu-Cheng Yin,et al.  Effective text localization in natural scene images with MSER, geometry-based grouping and AdaBoost , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[21]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  Palaiahnakote Shivakumara,et al.  Accurate video text detection through classification of low and high contrast images , 2010, Pattern Recognit..

[23]  Chunheng Wang,et al.  Scene text detection using graph model built upon maximally stable extremal regions , 2013, Pattern Recognit. Lett..

[24]  Jorge Stolfi,et al.  T-HOG: An effective gradient-based descriptor for single line text regions , 2013, Pattern Recognit..

[25]  Tatiana Novikova,et al.  Image Binarization for End-to-End Text Understanding in Natural Images , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[26]  Wen Gao,et al.  Fast and robust text detection in images and video frames , 2005, Image Vis. Comput..

[27]  Ujjwal Bhattacharya,et al.  Scene text detection using sparse stroke information and MLP , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[28]  Lionel Prevost,et al.  2009 10th International Conference on Document Analysis and Recognition Text Detection and Localization in Complex Scene Images using Constrained AdaBoost Algorithm , 2022 .

[29]  Simon M. Lucas,et al.  ICDAR 2003 robust reading competitions , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[30]  Matthieu Cord,et al.  Text detection in street level images , 2013, Pattern Analysis and Applications.

[31]  Andreas Dengel,et al.  ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene Images , 2011, 2011 International Conference on Document Analysis and Recognition.

[32]  Kaizhu Huang,et al.  Robust Text Detection in Natural Scene Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Jiri Matas,et al.  Scene Text Localization and Recognition with Oriented Stroke Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[34]  Weilin Huang,et al.  Text Localization in Natural Images Using Stroke Feature Transform and Text Covariance Descriptors , 2013, 2013 IEEE International Conference on Computer Vision.

[35]  Chucai Yi,et al.  Text String Detection From Natural Scenes by Structure-Based Partition and Grouping , 2011, IEEE Transactions on Image Processing.

[36]  Yingli Tian,et al.  Text extraction from scene images by character appearance and structure modeling , 2013, Comput. Vis. Image Underst..

[37]  Luis Miguel Bergasa,et al.  Text location in complex images , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[38]  Jon Almazán,et al.  ICDAR 2013 Robust Reading Competition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[39]  Yao Li,et al.  Leveraging surrounding context for scene text detection , 2013, 2013 IEEE International Conference on Image Processing.

[40]  Xinbo Gao,et al.  Chinese text location under complex background using Gabor filter and SVM , 2011, Neurocomputing.

[41]  Huchuan Lu,et al.  Scene text detection via stroke width , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[42]  David S. Doermann,et al.  Automatic text detection and tracking in digital video , 2000, IEEE Trans. Image Process..