Text detection in natural scene images using morphological component analysis and Laplacian dictionary

Text in natural scene images usually carries abundant semantic information. However, due to variations of text and complexity of background, detecting text in scene images becomes a critical and challenging task. In this paper, we present a novel method to detect text from scene images. Firstly, we decompose scene images into background and text components using morphological component analysis ( MCA ) , which will reduce the adverse effects of complex backgrounds on the detection results. In order to improve the performance of image decomposition, two discriminative dictionaries of background and text are learned from the training samples. Moreover, Laplacian sparse regularization is introduced into our proposed dictionary learning method which improves discrimination of dictionary. Based on the text dictionary and the sparse-representation coefficients of text, we can construct the text component. After that, the text in the query image can be detected by applying certain heuristic rules. The results of experiments show the effectiveness of the proposed method.

[1]  Xu-Cheng Yin,et al.  Robust Text Detection in Natural Scene Images. , 2014, IEEE transactions on pattern analysis and machine intelligence.

[2]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[3]  Jun Zhang,et al.  Multi-Orientation Scene Text Detection with Adaptive Clustering , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Hao Wang,et al.  Character-like region verification for extracting text in scene images , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[5]  Jun Huang,et al.  Text detection and restoration in natural scene images , 2007, J. Vis. Commun. Image Represent..

[6]  Ching Y. Suen,et al.  Text detection from scene images using sparse representation , 2008, 2008 19th International Conference on Pattern Recognition.

[7]  Gang Zhou,et al.  Detecting multilingual text in natural scene , 2011, 2011 1st International Symposium on Access Spaces (ISAS).

[8]  Rongrong Ji,et al.  Directional correlation analysis of local Haar binary pattern for text detection , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[9]  Lorenzo Rosasco,et al.  Iterative Projection Methods for Structured Sparsity Regularization , 2009 .

[10]  Anil K. Jain,et al.  Automatic text location in images and video frames , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[11]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[12]  Gang Zhou,et al.  Scene text detection with superpixels and hierarchical model , 2012, 2012 19th IEEE International Conference on Image Processing.

[13]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[14]  Michael Elad,et al.  Sparse Representation for Color Image Restoration , 2008, IEEE Transactions on Image Processing.

[15]  Shijian Lu,et al.  Camera Text Recognition based on Perspective Invariants , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[16]  Ming Zhao,et al.  Text detection in images using sparse representation with discriminative dictionaries , 2010, Image Vis. Comput..

[17]  Huizhong Chen,et al.  Robust text detection in natural images with edge-enhanced Maximally Stable Extremal Regions , 2011, 2011 18th IEEE International Conference on Image Processing.

[18]  Cheng-Lin Liu,et al.  A Hybrid Approach to Detect and Localize Texts in Natural Scene Images , 2011, IEEE Transactions on Image Processing.

[19]  David S. Doermann,et al.  Text Detection and Recognition in Imagery: A Survey , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Xinbo Gao,et al.  Chinese text location under complex background using Gabor filter and SVM , 2011, Neurocomputing.

[21]  Kongqiao Wang,et al.  Character location in scene images from digital camera , 2003, Pattern Recognit..

[22]  Lei Sun,et al.  A robust approach for text detection from natural scene images , 2015, Pattern Recognit..

[23]  Kongqiao Wang,et al.  An Improved Scene Text Extraction Method Using Conditional Random Field and Optical Character Recognition , 2011, 2011 International Conference on Document Analysis and Recognition.

[24]  Haifeng Hu,et al.  Local robust sparse representation for face recognition with single sample per person , 2018, IEEE/CAA Journal of Automatica Sinica.

[25]  Yao Li,et al.  Characterness: An Indicator of Text in the Wild , 2013, IEEE Transactions on Image Processing.

[26]  Jin Hyung Kim,et al.  Texture-Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Gueesang Lee,et al.  Text localization in natural scene images by mean-shift clustering and parallel edge feature , 2011, ICUIMC '11.

[28]  Palaiahnakote Shivakumara,et al.  A new Histogram Oriented Moments descriptor for multi-oriented moving text detection in video , 2015, Expert Syst. Appl..

[29]  Michael Elad,et al.  Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[30]  E.J. Candes Compressive Sampling , 2022 .

[31]  José M. Bioucas-Dias,et al.  A New TwIST: Two-Step Iterative Shrinkage/Thresholding Algorithms for Image Restoration , 2007, IEEE Transactions on Image Processing.

[32]  Jon Almazán,et al.  ICDAR 2013 Robust Reading Competition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[33]  David Zhang,et al.  Sparse Representation Based Fisher Discrimination Dictionary Learning for Image Classification , 2014, International Journal of Computer Vision.

[34]  Yong Zhang,et al.  Text string detection for loosely constructed characters with arbitrary orientations , 2015, Neurocomputing.

[35]  Sudeep Sarkar,et al.  Robust outdoor text detection using text intensity and shape features , 2008, 2008 19th International Conference on Pattern Recognition.

[36]  Zhuowen Tu,et al.  Detecting Texts of Arbitrary Orientations in 1 Natural Images , 2012 .

[37]  Salvatore Tabbone,et al.  Text/graphic separation using a sparse representation with multi-learned dictionaries , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[38]  S. Angadi,et al.  A Texture Based Methodology for Text Region Extraction from Low Resolution Natural Scene Images , 2009 .

[39]  D. Donoho,et al.  Redundant Multiscale Transforms and Their Application for Morphological Component Separation , 2004 .

[40]  Wu Guo-rong,et al.  Using Connected-Components' Features to Detect and Segment Text , 2006 .

[41]  Changxin Gao,et al.  Text detection approach based on confidence map and context information , 2015, Neurocomputing.

[42]  Zheng-ping Hu,et al.  A modular weighted sparse representation based on Fisher discriminant and sparse residual for face recognition with occlusion , 2015, Inf. Process. Lett..

[43]  Yilong Yin,et al.  Fractional-order sparse representation for image denoising , 2018, IEEE/CAA Journal of Automatica Sinica.