A new method based on bag of filters for character recognition in scene images by learning

Achieving a good recognition rate for scene characters is a big challenge due to non-uniform illumination effects, perspective distortions, multiple colors or contrasts, different fonts and their various sizes, background or orientation variations, etc. Unlike the existing recognition methods that use binary information or the features extracted from different domains, the proposed method explores gray information in the form of a filter bank to extract the discriminative power for all the 62 scene character classes. We propose a sliding window (patch) operation over a character image for learning the global features, which represent the structures of character images of all the classes by reconstructing a filter bank from the original data. We introduce shareable constrains to activate class-specific filters from the filter bank. Further, we propose constraints by studying the nearest neighbor patches and exemplar selection to maximize the gap between inter-classes and minimize the gap between intra-classes. The method is evaluated and compared with several existing recognition methods in terms of character recognition rate. Experimental results show that the proposed method outperforms the existing methods.

[1]  Jorge Stolfi,et al.  SnooperText: A text detection system for automatic indexing of urban scenes , 2014, Comput. Vis. Image Underst..

[2]  Sajid Saleem,et al.  Recognition of degraded ancient characters based on dense SIFT , 2014, DATeCH '14.

[3]  Andrew Y. Ng,et al.  Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning , 2011, 2011 International Conference on Document Analysis and Recognition.

[4]  Yingli Tian,et al.  Scene Text Recognition in Mobile Applications by Character Descriptor and Structure Configuration , 2014, IEEE Transactions on Image Processing.

[5]  David S. Doermann,et al.  Text Detection and Recognition in Imagery: A Survey , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Erkki Oja,et al.  Independent component analysis: algorithms and applications , 2000, Neural Networks.

[7]  Robert Sablatnig,et al.  End-to-End Text Recognition Using Local Ternary Patterns, MSER and Deep Convolutional Nets , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[8]  William G. Faris,et al.  Reliable evaluation of neural networks , 1991, Neural Networks.

[9]  Jean Ponce,et al.  Learning mid-level features for recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Rui Wang,et al.  Scene Text Segmentation via Inverse Rendering , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[11]  Palaiahnakote Shivakumara,et al.  Recognizing Text with Perspective Distortion in Natural Scenes , 2013, 2013 IEEE International Conference on Computer Vision.

[12]  Shijian Lu,et al.  Scene Text Recognition Using Co-occurrence of Histogram of Oriented Gradients , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[13]  Tao Wang,et al.  End-to-end text recognition with convolutional neural networks , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[14]  C. V. Jawahar,et al.  Top-down and bottom-up cues for scene text recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Bo Xu,et al.  Chinese Image Text Recognition on grayscale pixels , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Jiri Matas,et al.  A Method for Text Localization and Recognition in Real-World Images , 2010, ACCV.

[17]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[18]  Chunheng Wang,et al.  Scene Text Recognition Using Structure-Guided Character Detection and Linguistic Knowledge , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Rong Huang,et al.  Scene Character Detection and Recognition with Cooperative Multiple-Hypothesis Framework , 2013, IEICE Trans. Inf. Syst..

[20]  Wenyu Liu,et al.  A Unified Framework for Multioriented Text Detection and Recognition , 2014, IEEE Transactions on Image Processing.

[21]  Palaiahnakote Shivakumara,et al.  Scene Character Reconstruction through Medial Axis , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[22]  Gang Wang,et al.  Learning Discriminative and Shareable Features for Scene Classification , 2014, ECCV.

[23]  S.M. Lucas,et al.  ICDAR 2005 text locating competition results , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).