Multi-script Text Detection and Classification from Natural Scenes

Most of the text detection and script classification approaches from natural scenes only cater for a single script whereas text in natural scenes may come in various scripts. This research proposes a gestalt-based approach for multi-script text detection and classification based on human perception. Human perceptual organization is where humans are able to organize visual input into meaningful information. This approach is based on the figure-ground articulation where we perceive the figure or text as standing in front of the background. Features extracted from wavelet coefficients and MSER is used as input to SVM for text detection and script classification. Experimental results indicate that this approach is competitive with the state of the art text detection and script classification approaches.

[1]  Huadong Ma,et al.  Automatic Detection and Localization of Natural Scene Text in Video , 2010, 2010 20th International Conference on Pattern Recognition.

[2]  Huizhong Chen,et al.  Robust text detection in natural images with edge-enhanced Maximally Stable Extremal Regions , 2011, 2011 18th IEEE International Conference on Image Processing.

[3]  Andreas Christmann,et al.  Support vector machine , 2013 .

[4]  S. Lucas,et al.  ICDAR 2003 robust reading competitions: entries, results, and future directions , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[5]  Jiri Matas,et al.  A Method for Text Localization and Recognition in Real-World Images , 2010, ACCV.

[6]  Jun Guo,et al.  Text extraction from natural scene image: A survey , 2013, Neurocomputing.

[7]  Patrick Kelly,et al.  Automatic Script Identification From Document Images Using Cluster-Based Templates , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Jiri Matas,et al.  Text Localization in Real-World Images Using Efficiently Pruned Exhaustive Search , 2011, 2011 International Conference on Document Analysis and Recognition.

[9]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Yuanping Zhu,et al.  Improving Scene Text Detection by Scale-Adaptive Segmentation and Weighted CRF Verification , 2011, 2011 International Conference on Document Analysis and Recognition.

[11]  Tong Lu,et al.  A Robust Color-Independent Text Detection Method from Complex Videos , 2011, 2011 International Conference on Document Analysis and Recognition.

[12]  Palaiahnakote Shivakumara,et al.  A Robust Wavelet Transform Based Technique for Video Text Detection , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[13]  Dimosthenis Karatzas,et al.  Multi-script Text Extraction from Natural Scenes , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[14]  Debashis Ghosh,et al.  Script Recognition—A Review , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Edward J. Delp,et al.  A Low Complexity Sign Detection and Text Localization Method for Mobile Applications , 2011, IEEE Transactions on Multimedia.

[16]  Zhuowen Tu,et al.  Detecting Texts of Arbitrary Orientations in 1 Natural Images , 2012 .

[17]  Palaiahnakote Shivakumara,et al.  A novel mutual nearest neighbor based symmetry for text frame classification in video , 2011, Pattern Recognit..

[18]  Ching Y. Suen,et al.  Categorizing Document Images into Script and Language Classes , 1999 .

[19]  Chunluan Zhou,et al.  Arbitrary-Shape Object Localization Using Adaptive Image Grids , 2012, ACCV.

[20]  Palaiahnakote Shivakumara,et al.  Multi-oriented scene text detection in video based on wavelet and angle projection boundary growing , 2013, Multimedia Tools and Applications.

[21]  Wang Zhen,et al.  A Comparative Study of Feature Selection for SVM in Video Text Detection , 2009, 2009 Second International Symposium on Computational Intelligence and Design.

[22]  Ioannis Pratikakis,et al.  A two-stage scheme for text detection in video images , 2010, Image Vis. Comput..