A Robust Approach for Scene Text Localization Using Rule-Based Confidence Map and Grouping

This study presents a robust algorithm to localize both Farsi/Arabic and Latin scene texts with different sizes, fonts and orientations even the low luminance contrast and poor quality ones. First, a new region detector is proposed to extract the candidate text regions. It is an integration of the weighted median filtering, contrast preserving decolorization and MSER techniques. It is robust to the low luminance contrast and poor quality scene texts. Afterwards, a novel method based on the fuzzy inference systems (FIS) is proposed to build a confidence map. This map indicates the likelihood of being text for the extracted candidate regions. Therefore it is exploited to filter the nontext candidates. Finally, a new fuzzy-based approach is proposed to create the single arbitrarily oriented text lines. It is based on the clustering, FIS, minimum area rectangle as well as radon transform techniques. It could also retrieve some of the discarded isolated characters or subwords in the filtering stage. To validate the proposed algorithm, we created a collection of natural images containing both Farsi/Arabic and Latin texts. Compared with the state-of-the-art methods, the proposed method achieves the best performance on our and Epshtein datasets and competitive performances on the ICDAR dataset.

[1]  Jun Guo,et al.  Text extraction from natural scene image: A survey , 2013, Neurocomputing.

[2]  Yao Li,et al.  Characterness: An Indicator of Text in the Wild , 2013, IEEE Transactions on Image Processing.

[3]  Tao Chen,et al.  Scene text extraction based on edges and support vector regression , 2015, International Journal on Document Analysis and Recognition (IJDAR).

[4]  Hyung Il Koo,et al.  Scene Text Detection via Connected Component Clustering and Nontext Filtering , 2013, IEEE Transactions on Image Processing.

[5]  Kaizhu Huang,et al.  Robust Text Detection in Natural Scene Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Peter Xiaoping Liu,et al.  GcsDecolor: Gradient Correlation Similarity for Efficient Contrast Preserving Decolorization , 2015, IEEE Transactions on Image Processing.

[7]  Chucai Yi,et al.  Text String Detection From Natural Scenes by Structure-Based Partition and Grouping , 2011, IEEE Transactions on Image Processing.

[8]  Yi Wu,et al.  A robust hybrid method for text detection in natural scenes by learning-based partial differential equations , 2015, Neurocomputing.

[9]  Jorge Stolfi,et al.  SnooperText: A text detection system for automatic indexing of urban scenes , 2014, Comput. Vis. Image Underst..

[10]  Xiang Bai,et al.  Scene text detection and recognition: recent advances and future trends , 2015, Frontiers of Computer Science.

[11]  Yong Zhang,et al.  Text string detection for loosely constructed characters with arbitrary orientations , 2015, Neurocomputing.

[12]  M. Moradi,et al.  Hybrid approach for Farsi/Arabic text detection and localisation in video frames , 2013, IET Image Process..

[13]  Gernot A. Fink,et al.  Camera-Based Whiteboard Reading for Understanding Mind Maps , 2015, Int. J. Pattern Recognit. Artif. Intell..

[14]  Karim Faez,et al.  Robust Localization of Texts in Real-World Images , 2015, Int. J. Pattern Recognit. Artif. Intell..

[15]  Yingli Tian,et al.  Localizing Text in Scene Images by Boundary Clustering, Stroke Segmentation, and String Fragment Classification , 2012, IEEE Transactions on Image Processing.

[16]  Ming Zhao,et al.  Text detection in images using sparse representation with discriminative dictionaries , 2010, Image Vis. Comput..

[17]  Asif Masood,et al.  A Caption Text Detection Method from Images/Videos for Efficient Indexing and Retrieval of Multimedia Data , 2015, Int. J. Pattern Recognit. Artif. Intell..

[18]  Jorge Stolfi,et al.  T-HOG: An effective gradient-based descriptor for single line text regions , 2013, Pattern Recognit..

[19]  Lei Sun,et al.  A robust approach for text detection from natural scene images , 2015, Pattern Recognit..

[20]  Xinguo Yu,et al.  Reading Digital Video Clocks , 2015, Int. J. Pattern Recognit. Artif. Intell..

[21]  Matthieu Cord,et al.  Text detection in street level images , 2013, Pattern Analysis and Applications.

[22]  Michio Sugeno,et al.  Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[23]  Chunheng Wang,et al.  Scene text detection using graph model built upon maximally stable extremal regions , 2013, Pattern Recognit. Lett..

[24]  Wenyu Liu,et al.  A Unified Framework for Multioriented Text Detection and Recognition , 2014, IEEE Transactions on Image Processing.

[25]  Karim Faez,et al.  Localizing scene texts by fuzzy inference systems and low rank matrix recovery model , 2016, Comput. Vis. Image Underst..

[26]  Cheng-Lin Liu,et al.  A Hybrid Approach to Detect and Localize Texts in Natural Scene Images , 2011, IEEE Transactions on Image Processing.

[27]  Luis Miguel Bergasa,et al.  A text reading algorithm for natural images , 2013, Image Vis. Comput..

[28]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[29]  Mohammad Rahmati,et al.  A Hybrid Approach to Localize Farsi Text in Natural Scene Images , 2012, INNS-WC.

[30]  Changxin Gao,et al.  Text detection approach based on confidence map and context information , 2015, Neurocomputing.