The Keywords Spotting with Context for Multi -Oriented Chinese Scene Text

Scene text spotting is currently a popular research topic in the computer vision community. However, it is a challenging task due to the variations of texts and clutter backgrounds. In some situation, especially in the natural scene, it is not necessary to understand the whole text information in an image if we have acquired the key information. In this paper, we propose a bottom-to-up framework for keywords spotting and context extraction for multi-oriented Chinese in scene images. The proposed framework includes three modules which are Chinese character detection, keywords spotting and context extractor. We adopt the object detection methods to detect and recognize the Chinese text simultaneously in character-level. The geometric relationship between keywords and their context is analyzed, which is robust to multi-oriented text. The proposed framework classifies the valuable images, extracts possible keywords context and achieves a good performance in the CTW dataset.

[1]  Junjie Yan,et al.  FOTS: Fast Oriented Text Spotting with a Unified Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[2]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[3]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Wenyu Liu,et al.  TextBoxes: A Fast Text Detector with a Single Deep Neural Network , 2016, AAAI.

[6]  Zhe Zhu,et al.  Chinese Text in the Wild , 2018, ArXiv.

[7]  Pan He,et al.  Detecting Text in Natural Image with Connectionist Text Proposal Network , 2016, ECCV.

[8]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Huizhong Chen,et al.  Mobile visual search on printed documents using text and low bit-rate features , 2011, 2011 18th IEEE International Conference on Image Processing.

[10]  Xiang Bai,et al.  An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[12]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[14]  Shuchang Zhou,et al.  EAST: An Efficient and Accurate Scene Text Detector , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Azzedine Boukerche,et al.  Road-Sign Text Recognition Architecture for Intelligent Transportation Systems , 2014, 2014 IEEE 80th Vehicular Technology Conference (VTC2014-Fall).