Scene Character Recognition Using Coupled Spatial Learning

Feature representation, as a key component of scene character recognition, has been widely studied and a number of effective methods have been proposed. In this letter, we propose the novel method named coupled spatial learning (CSL) for scene character representation. Different from the existing methods, the proposed CSL method simultaneously discover the spatial context in both the dictionary learning and coding stages. Concretely, we propose to build the spatial dictionary by preserving the corresponding positions of the codewords. Correspondingly, we introduce the spatial coding strategy which utilizes the spatiality regularization to consider the relationship among features in the Euclidean space. Based on the spatial dictionary and spatial coding, the spatial context can be effectively integrated in the visual representations. We verify our method on two widely used databases (ICDAR2003 and Chars74k), and the experimental results demonstrate that our method achieves competitive results compared with the state-of-the-art methods. In addition, we further validate the proposed CSL method on the Caltech-101 database for image classification task, and the experimental results show the good generalization ability of the proposed CSL. key words: coupled spatial learning, feature representation, scene character recognition

[1]  Eli Shechtman,et al.  In defense of Nearest-Neighbor based image classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Tao Wang,et al.  End-to-end text recognition with convolutional neural networks , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[3]  Hanzi Wang,et al.  Scene text recognition using sparse coding based features , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[4]  Prateek Jain,et al.  Fast image search for learned metrics , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Hassan Foroosh,et al.  Character recognition in natural scene images using rank-1 tensor decomposition , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[7]  Chunheng Wang,et al.  Scene Text Recognition Using Structure-Guided Character Detection and Linguistic Knowledge , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Chunheng Wang,et al.  Action Recognition Using Context-Constrained Linear Coding , 2012, IEEE Signal Processing Letters.

[9]  Junichi Kanai,et al.  Character recognition , 1997 .

[10]  Florent Perronnin,et al.  Fisher Kernels on Visual Vocabularies for Image Categorization , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[12]  Shijian Lu,et al.  Multilingual scene character recognition with co-occurrence of histogram of oriented gradients , 2016, Pattern Recognit..

[13]  Andrew Zisserman,et al.  Deep Features for Text Spotting , 2014, ECCV.

[14]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[15]  Allen R. Hanson,et al.  Scene Text Recognition Using Similarity and a Lexicon with Sparse Belief Propagation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Chunheng Wang,et al.  Stroke Detector and Structure Based Models for Character Recognition: A Comparative Study , 2015, IEEE Transactions on Image Processing.

[17]  Chunheng Wang,et al.  Scene Text Character Recognition Using Spatiality Embedded Dictionary , 2014, IEICE Trans. Inf. Syst..

[18]  Manik Varma,et al.  Character Recognition in Natural Images , 2009, VISAPP.

[19]  Chunheng Wang,et al.  Stroke Bank: A High-Level Representation for Scene Character Recognition , 2014, 2014 22nd International Conference on Pattern Recognition.

[20]  Simon M. Lucas,et al.  ICDAR 2003 robust reading competitions , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..