Script Identification from Camera-Captured Multi-script Scene Text Components

Identification of script from multi-script text components of camera-captured images is an emerging research field. Here, challenges are mainly twofold: (1) typical challenges of camera-captured images like blur, uneven illumination, complex background, etc., and (2) challenges related to shape, size, and orientation of the texts written in different scripts. In this work, an effective set consisting of both shape-based and texture-based features is designed for script classification. An in-house scene text data set comprising 300 text boxes written in three scripts, namely Bangla, Devanagri, and Roman is prepared. Performance of this feature set is associated with five popular classifiers and highest accuracy of 90% is achieved with Multi-layer Perceptron (MLP) classifier, which is reasonably satisfactory considering the domain complexity.

[1]  Allen R. Hanson,et al.  Scene Text Recognition Using Similarity and a Lexicon with Sparse Belief Propagation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Yao Li,et al.  Characterness: An Indicator of Text in the Wild , 2013, IEEE Transactions on Image Processing.

[3]  C. V. Jawahar,et al.  A Simple and Effective Solution for Script Identification in the Wild , 2016, 2016 12th IAPR Workshop on Document Analysis Systems (DAS).

[4]  Jiřı́ Matas,et al.  Real-time scene text localization and recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Xiang Bai,et al.  An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Kai Wang,et al.  End-to-end scene text recognition , 2011, 2011 International Conference on Computer Vision.

[7]  David S. Doermann,et al.  Text Detection and Recognition in Imagery: A Survey , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Feng Su,et al.  Scene Text Detection Based on Robust Stroke Width Transform and Deep Belief Network , 2014, ACCV.

[9]  Jun Zhang,et al.  Multi-Orientation Scene Text Detection with Adaptive Clustering , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Dimosthenis Karatzas,et al.  TextProposals: A text-specific selective search algorithm for word spotting in the wild , 2016, Pattern Recognit..

[11]  Umapada Pal,et al.  Multi-oriented text detection and verification in video frames and scene images , 2017, Neurocomputing.