论文信息 - Text Detection and Recognition in Imagery: A Survey

Text Detection and Recognition in Imagery: A Survey

This paper analyzes, compares, and contrasts technical challenges, methods, and the performance of text detection and recognition research in color imagery. It summarizes the fundamental problems and enumerates factors that should be considered when addressing these problems. Existing techniques are categorized as either stepwise or integrated and sub-problems are highlighted including text localization, verification, segmentation and recognition. Special issues associated with the enhancement of degraded text and the processing of video text, multi-oriented, perspectively distorted and multilingual text are also addressed. The categories and sub-categories of text are illustrated, benchmark datasets are enumerated, and the performance of the most representative approaches is compared. This review provides a fundamental comparison and analysis of the remaining problems in the field.

David S. Doermann | Qixiang Ye | D. Doermann | Qixiang Ye

[1] Frédo Durand,et al. Efficient marginal likelihood optimization in blind deconvolution , 2011, CVPR 2011.

[2] Salvatore Tabbone,et al. A skeleton based descriptor for detecting text in real scene images , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[3] Yuanping Zhu,et al. Recognizing Natural Scene Characters by Convolutional Neural Network and Bimodal Image Enhancement , 2011, CBDAR.

[4] Jean-Michel Jolion,et al. Object count/area graphs for the evaluation of object detection and segmentation algorithms , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[5] Chucai Yi,et al. Text String Detection From Natural Scenes by Structure-Based Partition and Grouping , 2011, IEEE Transactions on Image Processing.

[6] Chunheng Wang,et al. Scene Text Recognition Using Part-Based Tree-Structured Character Detection , 2013, CVPR 2013.

[7] David S. Doermann,et al. Superresolution-based enhancement of text in digital video , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[8] Ujjwal Bhattacharya,et al. Devanagari and Bangla Text Extraction from Natural Scene Images , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[9] Chew Lim Tan,et al. Edge Based Binarization for Video Text Images , 2010, 2010 20th International Conference on Pattern Recognition.

[10] Kwanghoon Sohn,et al. Static text region detection in video sequences using color and orientation consistencies , 2008, 2008 19th International Conference on Pattern Recognition.

[11] Dimosthenis Karatzas,et al. Multi-script Text Extraction from Natural Scenes , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[12] Erik G. Learned-Miller,et al. Enforcing similarity constraints with integer programming for better scene text recognition , 2011, CVPR 2011.

[13] Yunde Jia,et al. Gaussian mixture modeling and learning of neighboring characters for multilingual text extraction in images , 2008, Pattern Recognit..

[14] Fei Yin,et al. A Fast Stroke-Based Method for Text Detection in Video , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[15] Yongdong Zhang,et al. A Novel Image Text Extraction Method Based on K-Means Clustering , 2008, Seventh IEEE/ACIS International Conference on Computer and Information Science (icis 2008).

[16] Jiri Matas,et al. Text Localization in Real-World Images Using Efficiently Pruned Exhaustive Search , 2011, 2011 International Conference on Document Analysis and Recognition.

[17] Pascale Sébillot,et al. A comprehensive neural-based approach for text recognition in videos using natural language processing , 2011, ICMR '11.

[18] Chew Lim Tan,et al. Ieee Transactions on Pattern Analysis and Machine Intelligence, Manuscript Id a Laplacian Approach to Multi-oriented Text Detection in Video , 2022 .

[19] Yingli Tian,et al. Localizing Text in Scene Images by Boundary Clustering, Stroke Segmentation, and String Fragment Classification , 2012, IEEE Transactions on Image Processing.

[20] Li Linlin,et al. Edge Based Binarization for Video Text Images , 2010, ICPR 2010.

[21] Gérard G. Medioni,et al. Text segmentation in color images using tensor voting , 2007, Image Vis. Comput..

[22] Palaiahnakote Shivakumara,et al. Detection of Curved Text in Video: Quad Tree Based Method , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[23] Apostolos Antonacopoulos,et al. Text extraction from Web images based on a split-and-merge segmentation method using colour perception , 2004, ICPR 2004.

[24] Allen R. Hanson,et al. Scene Text Recognition Using Similarity and a Lexicon with Sparse Belief Propagation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25] Bing-Fei Wu,et al. A multi-plane approach for text segmentation of complex document images , 2009, Pattern Recognit..

[26] Xu Liu,et al. A camera phone based currency reader for the visually impaired , 2008, Assets '08.

[27] Santosh Kumar Divvala,et al. Exemplar Driven Character Recognition in the Wild , 2012, BMVC.

[28] Chunheng Wang,et al. Text detection in images based on unsupervised classification of edge-based features , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[29] C. V. Jawahar,et al. An MRF Model for Binarization of Natural Scene Text , 2011, 2011 International Conference on Document Analysis and Recognition.

[30] Cheng-Lin Liu,et al. Text Localization in Natural Scene Images Based on Conditional Random Field , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[31] Nizar Bouguila,et al. Image Text Detection Using a Bandlet-Based Edge Detector and Stroke Width Transform , 2012, BMVC.

[32] Edward K. Wong,et al. A new robust algorithm for video text extraction , 2003, Pattern Recognit..

[33] Jiri Matas,et al. On Combining Multiple Segmentations in Scene Text Recognition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[34] Shinichiro Omachi,et al. OCR Fonts Revisited for Camera-Based Character Recognition , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[35] Weiqiang Wang,et al. Robustly Extracting Captions in Videos Based on Stroke-Like Edges and Spatio-Temporal Analysis , 2012, IEEE Transactions on Multimedia.

[36] Tao Wang,et al. End-to-end text recognition with convolutional neural networks , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[37] Jiřı́ Matas,et al. Real-time scene text localization and recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[38] Shih-Fu Chang,et al. A Bayesian framework for fusing multiple word knowledge models in videotext recognition , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[39] Rong Huang,et al. On the Possibility of Structure Learning-Based Scene Character Detector , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[40] Jin Hyung Kim,et al. Texture-Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[41] Songfeng Lu,et al. A density-based approach for text extraction in images , 2008, 2008 19th International Conference on Pattern Recognition.

[42] C. V. Jawahar,et al. Whole is Greater than Sum of Parts: Recognizing Scene Text Words , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[43] Steffen Wachenfeld,et al. Recognition of Screen-Rendered Text , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[44] Christophe Garcia,et al. text Detection with Convolutional Neural Networks , 2008, VISAPP.

[45] John R. Kender,et al. A unified text extraction method for instructional videos , 2005, IEEE International Conference on Image Processing 2005.

[46] Erik G. Learned-Miller,et al. Improving Recognition of Novel Input with Similarity , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[47] Lionel Prevost,et al. A cascade detector for text detection in natural scene images , 2008, 2008 19th International Conference on Pattern Recognition.

[48] Pascale Sébillot,et al. Combining Multi-scale Character Recognition and Linguistic Knowledge for Natural Scene Text OCR , 2012, Document Analysis Systems.

[49] Xian-Sheng Hua,et al. An automatic performance evaluation protocol for video text detection algorithms , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[50] Bernard Gosselin,et al. Spatial and Color Spaces Combination for Natural Scene Text Extraction , 2006, 2006 International Conference on Image Processing.

[51] Robinson Piramuthu,et al. Region-Based Discriminative Feature Pooling for Scene Text Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[52] Palaiahnakote Shivakumara,et al. Recognition of Video Text through Temporal Integration , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[53] Hiroshi Sako,et al. Kanji Character Detection from Complex Real Scene Images based on Character Properties , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[54] Anil K. Jain,et al. Locating text in complex color images , 1995, Pattern Recognit..

[55] Raymond Smith,et al. Adapting the Tesseract open source OCR engine for multilingual OCR , 2009, MOCR '09.

[56] Yuxiao Hu,et al. Text From Corners: A Novel Approach to Detect Text and Caption in Videos , 2011, IEEE Transactions on Image Processing.

[57] David S. Doermann,et al. Geometric Rectification of Camera-Captured Document Images , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[58] Hang Joon Kim,et al. Automatic text detection and removal in video sequences , 2003, Pattern Recognit. Lett..

[59] Tae-Kyun Kim,et al. Design and Evaluation of Features That Best Define Text in Complex Scene Images , 2009, MVA.

[60] Kongqiao Wang,et al. An Improved Scene Text Extraction Method Using Conditional Random Field and Optical Character Recognition , 2011, 2011 International Conference on Document Analysis and Recognition.

[61] SongYi-Zhe,et al. Text extraction from natural scene image , 2013 .

[62] Chunheng Wang,et al. An adaptive text detection approach in images and video frames , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[63] Kaizhu Huang,et al. Robust Text Detection in Natural Scene Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[64] C. V. Jawahar,et al. Scene Text Recognition using Higher Order Language Priors , 2009, BMVC.

[65] Palaiahnakote Shivakumara,et al. A New Method for Arbitrarily-Oriented Text Detection in Video , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[66] Fumitaka Kimura,et al. Convex hull based approach for multi-oriented character recognition from graphical documents , 2008, 2008 19th International Conference on Pattern Recognition.

[67] Palaiahnakote Shivakumara,et al. A New Method for Handwritten Scene Text Detection in Video , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[68] Jacqueline L. Feild,et al. Improving Text Recognition in Images of Natural Scenes , 2014 .

[69] Jin Hyung Kim,et al. Integrating multiple character proposals for robust scene text extraction , 2013, Image Vis. Comput..

[70] Craig A. Knoblock,et al. An Approach for Recognizing Text Labels in Raster Maps , 2010, 2010 20th International Conference on Pattern Recognition.

[71] Imran Siddiqi,et al. Edge-Based Features for Localization of Artificial Urdu Text in Video Images , 2011, 2011 International Conference on Document Analysis and Recognition.

[72] Jon Almazán,et al. ICDAR 2013 Robust Reading Competition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[73] Rong Huang,et al. Scene Character Detection by an Edge-Ray Filter , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[74] Wen Wu,et al. Integrating co-training and recognition for text detection , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[75] Alessandro Vinciarelli,et al. A survey on off-line Cursive Word Recognition , 2002, Pattern Recognit..

[76] Nikos A. Nikolaou,et al. Color reduction for complex document images , 2009, Int. J. Imaging Syst. Technol..

[77] Klaus Meyer-Wegener,et al. NEOCR: A Configurable Dataset for Natural Image Text Recognition , 2011, CBDAR.

[78] C. V. Jawahar,et al. Top-down and bottom-up cues for scene text recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[79] Sudeep Sarkar,et al. Robust outdoor text detection using text intensity and shape features , 2008, 2008 19th International Conference on Pattern Recognition.

[80] Wataru Ohyama,et al. Accuracy Improvement of Viewpoint-Free Scene Character Recognition by Rotation Angle Estimation , 2013, CBDAR.

[81] Shijian Lu,et al. A New Fourier-Moments Based Video Word and Character Extraction Method for Recognition , 2011, 2011 International Conference on Document Analysis and Recognition.

[82] Qingming Huang,et al. A configurable method for multi-style license plate recognition , 2009, Pattern Recognit..

[83] Yann LeCun,et al. Convolutional neural networks applied to house numbers digit classification , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[84] Tatiana Novikova,et al. Large-Lexicon Attribute-Consistent Text Recognition in Natural Images , 2012, ECCV.

[85] Christof Koch,et al. AdaBoost for Text Detection in Natural Scene , 2011, 2011 International Conference on Document Analysis and Recognition.

[86] Palaiahnakote Shivakumara,et al. New Fourier-Statistical Features in RGB Space for Video Text Detection , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[87] Jean-Marc Odobez,et al. Video text recognition using sequential Monte Carlo and error voting methods , 2005, Pattern Recognit. Lett..

[88] Michael R. Lyu,et al. A new approach for video text detection , 2002, Proceedings. International Conference on Image Processing.

[89] Palaiahnakote Shivakumara,et al. 2009 10th International Conference on Document Analysis and Recognition A Gradient Difference based Technique for Video Text Detection , 2022 .

[90] Kai Wang,et al. End-to-end scene text recognition , 2011, 2011 International Conference on Computer Vision.

[91] Amandeep Kaur,et al. Hough transform based fast skew detection and accurate skew correction methods , 2008, Pattern Recognit..

[92] Alan L. Yuille,et al. Detecting and reading text in natural scenes , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[93] David S. Doermann,et al. Scene Text Detection via Integrated Discrimination of Component Appearance and Consensus , 2013, CBDAR.

[94] Cheng-Lin Liu,et al. Fast scene text localization by learning-based filtering and verification , 2010, 2010 IEEE International Conference on Image Processing.

[95] Simon M. Lucas,et al. ICDAR 2003 robust reading competitions , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[96] Anil K. Jain,et al. Automatic caption localization in compressed video , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[97] Hongbin Zha,et al. Skew detection for complex document images using robust borderlines in both text and non-text regions , 2008, Pattern Recognit. Lett..

[98] Stefano Soatto,et al. Direct Sparse Deblurring , 2010, Journal of Mathematical Imaging and Vision.

[99] Jorge Stolfi,et al. T-HOG: An effective gradient-based descriptor for single line text regions , 2013, Pattern Recognit..

[100] Hyung Jeong Yang,et al. Automatic detection and recognition of Korean text in outdoor signboard images , 2010, Pattern Recognit. Lett..

[101] Palaiahnakote Shivakumara,et al. Recognizing Text with Perspective Distortion in Natural Scenes , 2013, 2013 IEEE International Conference on Computer Vision.

[102] Ismail Haritaoglu. Scene text extraction and translation for handheld devices , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[103] Nobuo Ezaki,et al. Improved text-detection methods for a camera-based text reading system for blind persons , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[104] S.M. Lucas,et al. ICDAR 2005 text locating competition results , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[105] Ioannis Pratikakis,et al. Goal-Oriented Rectification of Camera-Based Document Images , 2011, IEEE Transactions on Image Processing.

[106] Xin Zhang,et al. Multiple Geometry Transform Estimation from Single Camera-Captured Text Image , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[107] Zhuowen Tu,et al. Rotation-Invariant Features for Multi-Oriented Text Detection in Natural Images , 2013, PloS one.

[108] Palaiahnakote Shivakumara,et al. A New Gradient Based Character Segmentation Method for Video Text Recognition , 2011, 2011 International Conference on Document Analysis and Recognition.

[109] Saeed Mozaffari,et al. Farsi/Arabic text extraction from video images by corner detection , 2010, 2010 6th Iranian Conference on Machine Vision and Image Processing.

[110] Huadong Ma,et al. Automatic Detection and Localization of Natural Scene Text in Video , 2010, 2010 20th International Conference on Pattern Recognition.

[111] C. Garcia,et al. Text detection and segmentation in complex color images , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[112] David J. Crandall,et al. Extraction of special effects caption text events from digital video , 2003, International Journal on Document Analysis and Recognition.

[113] Hiroshi Kawakami,et al. A novel adaptive morphological approach for degraded character image segmentation , 2005, Pattern Recognit..

[114] Palaiahnakote Shivakumara,et al. A Gradient Vector Flow-Based Method for Video Character Segmentation , 2011, 2011 International Conference on Document Analysis and Recognition.

[115] Jiang Gao,et al. An adaptive algorithm for text detection from natural scenes , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[116] Jin Hyung Kim,et al. Scene Text Recognition with a Hough Forest Implicit Shape Model , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[117] Erik G. Learned-Miller,et al. Improving Open-Vocabulary Scene Text Recognition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[118] Andrew Y. Ng,et al. Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning , 2011, 2011 International Conference on Document Analysis and Recognition.

[119] Manik Varma,et al. Character Recognition in Natural Images , 2009, VISAPP.

[120] Xinbo Gao,et al. A spatial-temporal approach for video caption detection and recognition , 2002, IEEE Trans. Neural Networks.

[121] Yuxin Peng,et al. Using Multiple Frame Integration for the Text Recognition of Video , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[122] Andreas Dengel,et al. ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene Images , 2011, 2011 International Conference on Document Analysis and Recognition.

[123] Shijian Lu,et al. Video Character Recognition through Hierarchical Classification , 2011, 2011 International Conference on Document Analysis and Recognition.

[124] Jin Hyung Kim,et al. Scene Text Extraction with Edge Constraint and Text Collinearity , 2010, 2010 20th International Conference on Pattern Recognition.

[125] Ioannis Pratikakis,et al. A two-stage scheme for text detection in video images , 2010, Image Vis. Comput..

[126] Ujjwal Bhattacharya,et al. A Robust Approach to Extraction of Texts from Camera Captured Images , 2013, CBDAR.

[127] Cheng-Lin Liu,et al. Lexicon-Driven Segmentation and Recognition of Handwritten Character Strings for Japanese Address Reading , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[128] Silvio Ferreira,et al. A Text Detection Technique Applied in the Framework of a Mobile Camera-Based Application , .

[129] Allen R. Hanson,et al. A discriminative semi-Markov model for robust scene text recognition , 2008, 2008 19th International Conference on Pattern Recognition.

[130] Jin Hyung Kim,et al. Complementary combination of holistic and component analysis for recognition of low-resolution video character images , 2008, Pattern Recognit. Lett..

[131] David S. Doermann,et al. Text enhancement in digital video using multiple frame integration , 1999, MULTIMEDIA '99.

[132] Palaiahnakote Shivakumara,et al. Efficient video text detection using edge features , 2008, 2008 19th International Conference on Pattern Recognition.

[133] Chunheng Wang,et al. Graph-Based Background Suppression for Scene Text Detection , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[134] Jing Zhang,et al. Extraction of Text Objects in Video Documents: Recent Progress , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[135] Sanghoon Sull,et al. An Efficient Method for Text Detection in Video Based on Stroke Width Similarity , 2007, ACCV.

[136] Lionel Prevost,et al. 2009 10th International Conference on Document Analysis and Recognition Text Detection and Localization in Complex Scene Images using Constrained AdaBoost Algorithm , 2022 .

[137] Jerod J. Weinman,et al. Toward Integrated Scene Text Reading , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[138] Majid Mirmehdi,et al. Recognising text in real scenes , 2002, International Journal on Document Analysis and Recognition.

[139] Rainer Lienhart,et al. Localizing and segmenting text in images and videos , 2002, IEEE Trans. Circuits Syst. Video Technol..

[140] Zhuowen Tu,et al. Detecting Texts of Arbitrary Orientations in 1 Natural Images , 2012 .

[141] Makoto Tanaka,et al. Text-Tracking Wearable Camera System for the Blind , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[142] Minoru Etoh,et al. Hypothesis Preservation Approach to Scene Text Recognition with Weighted Finite-State Transducer , 2011, 2011 International Conference on Document Analysis and Recognition.

[143] Huizhong Chen,et al. Robust text detection in natural images with edge-enhanced Maximally Stable Extremal Regions , 2011, 2011 18th IEEE International Conference on Image Processing.

[144] Kai Wang,et al. Word Spotting in the Wild , 2010, ECCV.

[145] Dai Ruwei,et al. Chinese character recognition: history, status and prospects , 2007 .

[146] Cheng-Lin Liu,et al. A Hybrid Approach to Detect and Localize Texts in Natural Scene Images , 2011, IEEE Transactions on Image Processing.

[147] Craig A. Knoblock,et al. Recognition of Multi-oriented, Multi-sized, and Curved Text , 2011, 2011 International Conference on Document Analysis and Recognition.

[148] Ismail Haritaoglu,et al. Shape-DNA: Effective Character Restoration and Enhancement for Arabic Text Documents , 2010, 2010 20th International Conference on Pattern Recognition.

[149] Wei Liang,et al. A Novel Italic Detection and Rectification Method for Chinese Advertising Images , 2011, 2011 International Conference on Document Analysis and Recognition.

[150] Takuya Kobayashi,et al. Recognition of Multiple Characters in a Scene Image Using Arrangement of Local Features , 2011, 2011 International Conference on Document Analysis and Recognition.

[151] David S. Doermann,et al. Automatic text detection and tracking in digital video , 2000, IEEE Trans. Image Process..

[152] Bernard Gosselin,et al. Color text extraction with selective metric-based clustering , 2007, Comput. Vis. Image Underst..

[153] Shijian Lu,et al. Camera Text Recognition based on Perspective Invariants , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[154] Tatiana Novikova,et al. Image Binarization for End-to-End Text Understanding in Natural Images , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[155] Takeo Kanade,et al. Limits on super-resolution and how to break them , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[156] Wen Gao,et al. Fast and robust text detection in images and video frames , 2005, Image Vis. Comput..

[157] Yi Li,et al. Orientation Robust Text Line Detection in Natural Images , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[158] Wakahara Toru,et al. Binarization of Color Character Strings in Scene Images Using K-means Clustering and Support Vector Machines , 2010 .

[159] Ujjwal Bhattacharya,et al. Scene text detection using sparse stroke information and MLP , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[160] Premkumar Natarajan,et al. Character-Stroke Detection for Text-Localization and Extraction , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[161] Jerod J. Weinman. Typographical Features for Scene Text Recognition , 2010, 2010 20th International Conference on Pattern Recognition.

[162] Xilin Chen,et al. Automatic detection and recognition of signs from natural scenes , 2004, IEEE Transactions on Image Processing.

[163] Allen R. Hanson,et al. Fast Lexicon-Based Scene Text Recognition with Sparse Belief Propagation , 2007 .

[164] Yingli Tian,et al. Text Detection in Natural Scene Images by Stroke Gabor Words , 2011, 2011 International Conference on Document Analysis and Recognition.

[165] David Nistér,et al. Linear Time Maximally Stable Extremal Regions , 2008, ECCV.

[166] Chew Lim Tan,et al. Text extraction from name cards using neural network , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[167] Shijian Lu,et al. Multioriented Video Scene Text Detection Through Bayesian Classification and Boundary Growing , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[168] Hartmut Neven,et al. PhotoOCR: Reading Text in Uncontrolled Conditions , 2013, 2013 IEEE International Conference on Computer Vision.

[169] Toru Wakahara,et al. Binarization of Color Characters in Scene Images Using k-means Clustering and Support Vector Machines , 2010, 2010 20th International Conference on Pattern Recognition.

[170] Palaiahnakote Shivakumara,et al. Text detection in natural scenes using Gradient Vector Flow-Guided symmetry , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[171] Bernd Freisleben,et al. Text detection in images based on unsupervised classification of high-frequency wavelet coefficients , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[172] Jiri Matas,et al. Scene Text Localization and Recognition with Oriented Stroke Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[173] Shinichiro Omachi,et al. Affine Invariant Information Embedment for Accurate Camera-Based Character Recognition , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[174] Seth J. Teller,et al. Spatially Prioritized and Persistent Text Detection and Decoding , 2013, CBDAR.

[175] Kongqiao Wang,et al. Character location in scene images from digital camera , 2003, Pattern Recognit..

[176] Tong Lu,et al. A Robust Color-Independent Text Detection Method from Complex Videos , 2011, 2011 International Conference on Document Analysis and Recognition.

[177] Jean-Marc Odobez,et al. Text detection, recognition in images and video frames , 2004, Pattern Recognit..

[178] Jun Huang,et al. Text detection and restoration in natural scene images , 2007, J. Vis. Commun. Image Represent..

[179] Ana Cristina Murillo,et al. Towards robust and efficient text sign reading from a mobile phone , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[180] Ching Y. Suen,et al. A robust method of recognizing multi-font rotated characters , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[181] Palaiahnakote Shivakumara,et al. A novel ring radius transform for video character reconstruction , 2013, Pattern Recognit..

[182] Yingli Tian,et al. Text extraction from scene images by character appearance and structure modeling , 2013, Comput. Vis. Image Underst..

[183] Lei Huang,et al. A Novel Method for Embedded Text Segmentation Based on Stroke and Color , 2011, 2011 International Conference on Document Analysis and Recognition.

[184] Wenyu Liu,et al. Strokelets: A Learned Multi-scale Representation for Scene Text Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[185] Weiqiang Wang,et al. Extracting Captions in Complex Background from Videos , 2010, 2010 20th International Conference on Pattern Recognition.

[186] Ma Hongqing,et al. A new automatic extraction method of container identity codes , 2003, Proceedings of the 2003 IEEE International Conference on Intelligent Transportation Systems.

[187] Gang Zhou,et al. Detecting multilingual text in natural scene , 2011, 2011 1st International Symposium on Access Spaces (ISAS).

[188] Michael R. Lyu,et al. A comprehensive method for multilingual video text detection, localization, and extraction , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[189] Robert C. Bolles,et al. Rectification and recognition of text in 3-D scenes , 2004, International Journal of Document Analysis and Recognition (IJDAR).

[190] David S. Doermann,et al. Camera-based analysis of text and documents: a survey , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[191] Seungyong Lee,et al. Text Image Deblurring Using Text-Specific Properties , 2012, ECCV.

[192] S. Lucas,et al. ICDAR 2003 robust reading competitions: entries, results, and future directions , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[193] Edward M. Riseman,et al. TextFinder: An Automatic System to Detect and Recognize Text In Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[194] Jiri Matas,et al. A Method for Text Localization and Recognition in Real-World Images , 2010, ACCV.

[195] Wen Gao,et al. Automatic text segmentation from complex background , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[196] Michael J Cortese,et al. Handbook of Psycholinguistics , 2011 .

[197] Umapada Pal,et al. Multi-Oriented and Multi-Sized Touching Character Segmentation Using Dynamic Programming , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[198] Wonjun Kim,et al. A New Approach for Overlay Text Detection and Extraction From Complex Video Scene , 2009, IEEE Transactions on Image Processing.

[199] Partha Pratim Roy,et al. ICDAR 2011 Robust Reading Competition - Challenge 1: Reading Text in Born-Digital Images (Web and Email) , 2011, 2011 International Conference on Document Analysis and Recognition.

[200] Hyung Il Koo,et al. Scene Text Detection via Connected Component Clustering and Nontext Filtering , 2013, IEEE Transactions on Image Processing.

[201] Manuel Blum,et al. reCAPTCHA: Human-Based Character Recognition via Web Security Measures , 2008, Science.

[202] Anil K. Jain,et al. Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[203] Chew Lim Tan,et al. Character Recognition under Severe Perspective Distortion , 2008, 2009 10th International Conference on Document Analysis and Recognition.

[204] Anil K. Jain,et al. Automatic text location in images and video frames , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[205] Li Xu,et al. Two-Phase Kernel Estimation for Robust Motion Deblurring , 2010, ECCV.

[206] Wen Gao,et al. A robust text detection algorithm in images and video frames , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[207] Yonatan Wexler,et al. Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[208] Palaiahnakote Shivakumara,et al. Accurate video text detection through classification of low and high contrast images , 2010, Pattern Recognit..