视频和图像文本提取方法综述 (Text Extraction in Video and Images: A Review)

Text extraction in video and images has important application value. Big data era brought urgent demands of huge amounts of information retrieval,many text extraction methods have been proposed in recent years. In this paper, we reviewed text extraction methods from video and images. First, we classified the course of text extraction into two steps: text region detection and localization,text segmentation. Then,some text region detection and localization and text segmentation algorithms have been discussed regarding their application fields and their advantages and disadvantages. Finally, we discussed benchmark data and performance evaluation, and pointed out the promising directions for future research.

[1]  Palaiahnakote Shivakumara,et al.  An Efficient Edge Based Technique for Text Detection in Video Frames , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[2]  Chucai Yi,et al.  Text String Detection From Natural Scenes by Structure-Based Partition and Grouping , 2011, IEEE Transactions on Image Processing.

[3]  Kye Kyung Kim,et al.  Scene text extraction in natural scene images using hierarchical feature combining and verification , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[4]  Yoshinobu Hotta,et al.  A Fast Caption Detection Method for Low Quality Video Images , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[5]  Weilin Huang,et al.  Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network , 2016, ArXiv.

[6]  Chew Lim Tan,et al.  Edge Based Binarization for Video Text Images , 2010, 2010 20th International Conference on Pattern Recognition.

[7]  Dima Damen,et al.  Recognizing linked events: Searching the space of feasible explanations , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Jiri Matas,et al.  COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images , 2016, ArXiv.

[9]  Ming Zhao,et al.  Text detection in images using sparse representation with discriminative dictionaries , 2010, Image Vis. Comput..

[10]  Yuanping Zhu,et al.  Recognizing Natural Scene Characters by Convolutional Neural Network and Bimodal Image Enhancement , 2011, CBDAR.

[11]  Yuanping Zhu,et al.  Improving Scene Text Detection by Scale-Adaptive Segmentation and Weighted CRF Verification , 2011, 2011 International Conference on Document Analysis and Recognition.

[12]  Weiqiang Wang,et al.  Robustly Extracting Captions in Videos Based on Stroke-Like Edges and Spatio-Temporal Analysis , 2012, IEEE Transactions on Multimedia.

[13]  Yann LeCun,et al.  Convolutional neural networks applied to house numbers digit classification , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[14]  Jerod J. Weinman,et al.  Toward Integrated Scene Text Reading , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Jin Hyung Kim,et al.  Texture-Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  David S. Doermann,et al.  Scene Text Detection via Integrated Discrimination of Component Appearance and Consensus , 2013, CBDAR.

[17]  Cheng-Lin Liu,et al.  Fast scene text localization by learning-based filtering and verification , 2010, 2010 IEEE International Conference on Image Processing.

[18]  Manik Varma,et al.  Character Recognition in Natural Images , 2009, VISAPP.

[19]  Weilin Huang,et al.  Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees , 2014, ECCV.

[20]  Shijian Lu,et al.  Multioriented Video Scene Text Detection Through Bayesian Classification and Boundary Growing , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Hartmut Neven,et al.  PhotoOCR: Reading Text in Uncontrolled Conditions , 2013, 2013 IEEE International Conference on Computer Vision.

[22]  Toru Wakahara,et al.  Binarization of Color Characters in Scene Images Using k-means Clustering and Support Vector Machines , 2010, 2010 20th International Conference on Pattern Recognition.

[23]  Gueesang Lee,et al.  Robust Text Detection in Natural Scene Images , 2016, Australasian Conference on Artificial Intelligence.

[24]  Jagath Samarabandu,et al.  Multiscale Edge-Based Text Extraction from Complex Images , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[25]  Kongqiao Wang,et al.  An Improved Scene Text Extraction Method Using Conditional Random Field and Optical Character Recognition , 2011, 2011 International Conference on Document Analysis and Recognition.

[26]  Huy Phat Le,et al.  Text correction in distorted label images by applying biquadratic transformation , 2009, 2009 IEEE International Conference on Signal and Image Processing Applications.

[27]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[28]  George Loizou,et al.  Computer vision and pattern recognition , 2007, Int. J. Comput. Math..

[29]  David S. Doermann,et al.  Automatic text detection and tracking in digital video , 2000, IEEE Trans. Image Process..

[30]  Bernard Gosselin,et al.  Color text extraction with selective metric-based clustering , 2007, Comput. Vis. Image Underst..

[31]  Lianwen Jin,et al.  DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images , 2016, ArXiv.

[32]  Cheng-Lin Liu,et al.  A Hybrid Approach to Detect and Localize Texts in Natural Scene Images , 2011, IEEE Transactions on Image Processing.

[33]  Jihong Liu,et al.  An algorithm for image binarization based on adaptive threshold , 2009, 2009 Chinese Control and Decision Conference.

[34]  Wayne Nilback An introduction to digital image processing , 1985 .

[35]  Jon Almazán,et al.  ICDAR 2013 Robust Reading Competition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[36]  Shuchang Zhou,et al.  Scene Text Detection via Holistic, Multi-Channel Prediction , 2016, ArXiv.

[37]  Jin Hyung Kim,et al.  Integrating multiple character proposals for robust scene text extraction , 2013, Image Vis. Comput..

[38]  Ranjith Unnikrishnan,et al.  End-to-End Interpretation of the French Street Name Signs Dataset , 2016, ECCV Workshops.

[39]  Zhuowen Tu,et al.  Detecting Texts of Arbitrary Orientations in 1 Natural Images , 2012 .

[40]  Makoto Tanaka,et al.  Text-Tracking Wearable Camera System for the Blind , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[41]  Qifeng Liu,et al.  A stroke filter and its application to text localization , 2009, Pattern Recognit. Lett..

[42]  Gang Zhou,et al.  Detecting multilingual text in natural scene , 2011, 2011 1st International Symposium on Access Spaces (ISAS).

[43]  Jun Huang,et al.  Text detection and restoration in natural scene images , 2007, J. Vis. Commun. Image Represent..

[44]  Xiang Bai,et al.  Symmetry-based text line detection in natural scenes , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Dimosthenis Karatzas,et al.  Object proposals for text extraction in the wild , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[46]  Jiri Matas,et al.  A Method for Text Localization and Recognition in Real-World Images , 2010, ACCV.

[47]  Wen Gao,et al.  Automatic text segmentation from complex background , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[48]  Luc Van Gool,et al.  European conference on computer vision (ECCV) , 2006, eccv 2006.

[49]  Wonjun Kim,et al.  A New Approach for Overlay Text Detection and Extraction From Complex Video Scene , 2009, IEEE Transactions on Image Processing.

[50]  Xian-Sheng Hua,et al.  Efficient video text recognition using multiple frame integration , 2002, Proceedings. International Conference on Image Processing.

[51]  Hyung Il Koo,et al.  Scene Text Detection via Connected Component Clustering and Nontext Filtering , 2013, IEEE Transactions on Image Processing.

[52]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[53]  S. Zhou,et al.  An Improved Adaptive Document Image Binarization Method , 2009, 2009 2nd International Congress on Image and Signal Processing.

[54]  C. V. Jawahar,et al.  Top-down and bottom-up cues for scene text recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[55]  Ujjwal Bhattacharya,et al.  Scene text detection using sparse stroke information and MLP , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[56]  Kongqiao Wang,et al.  Character location in scene images from digital camera , 2003, Pattern Recognit..

[57]  Takahiro Matsuda,et al.  Downtown Osaka Scene Text Dataset , 2016, ECCV Workshops.

[58]  C. V. Jawahar,et al.  An MRF Model for Binarization of Natural Scene Text , 2011, 2011 International Conference on Document Analysis and Recognition.

[59]  Jin Hyung Kim,et al.  Scene Text Extraction with Edge Constraint and Text Collinearity , 2010, 2010 20th International Conference on Pattern Recognition.

[60]  Jonghyun Park,et al.  Tensor Voting Based Text Localization in Natural Scene Images , 2010, IEEE Signal Processing Letters.

[61]  B. Kapralos,et al.  I An Introduction to Digital Image Processing , 2022 .

[62]  Kai Wang,et al.  End-to-end scene text recognition , 2011, 2011 International Conference on Computer Vision.

[63]  Jilin Liu,et al.  A new automatic extraction method of container identity codes , 2005, IEEE Transactions on Intelligent Transportation Systems.

[64]  Wumo Pan,et al.  Text detection from natural scene images using topographic maps and sparse representations , 2009, ICIP 2009.

[65]  Weilin Huang,et al.  Text-Attentional Convolutional Neural Network for Scene Text Detection , 2015, IEEE Transactions on Image Processing.

[66]  Chunheng Wang,et al.  Scene Text Recognition Using Part-Based Tree-Structured Character Detection , 2013, CVPR 2013.

[67]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..