Video Text extraction and recognition: A survey

A means to naturally recognizing and fetching out the content of video description would possibly make them indexed in considerable and appropriate way for later reference, and would facilitate actions viz. automatic notification and dissemination, to be triggered in real time by the contents of streaming video. Video text recognition, or video OCR, is a constructive tool to characterize the contents of video containing overlay text (text captions superimposed over the video imagery, such as in broadcast news programs) and scene text (text that appears in the real scene of the video, such as text on street signs, nameplates, and billboards). In this paper exhaustive survey is done for text detecting, extraction and recognizing in complex images and video frames. Digital Videos are widely used both professionally and domestically because of the easy availability of camcorders to mobile phones. People are increasingly making videos may be for commercial use or personal use, this is leading to growing content of Video. While we can capture, compress, store, transmit and display video with great facility, editing videos and manipulating them based on their content is still a non-trivial activity. This paper concentrates on extracting text out of the video frames, taken out of the video.

[1]  Ioannis Pratikakis,et al.  Multiresolution text detection in video frames , 2007, VISAPP.

[2]  Anil K. Jain,et al.  Locating text in complex color images , 1995, Pattern Recognit..

[3]  Qifeng Liu,et al.  Stroke Filter for Text Localization in Video Images , 2006, 2006 International Conference on Image Processing.

[4]  E. S. Samundeeswari,et al.  Image Text Extraction and Recognition using Hybrid Approach of Region Based and Connected Component Methods , 2014 .

[5]  Shraddha M. Naik,et al.  Text Detection and Character Extraction in Natural Scene Images , 2015 .

[6]  Korris Fu-Lai Chung,et al.  Hybrid Chinese/English text detection in images and video frames , 2002, Object recognition supported by user interaction for service robots.

[7]  Palaiahnakote Shivakumara,et al.  A New Method for Arbitrarily-Oriented Text Detection in Video , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[8]  Shilpa Arora,et al.  Recognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera , 2014 .

[9]  Wonjun Kim,et al.  A New Approach for Overlay Text Detection and Extraction From Complex Video Scene , 2009, IEEE Transactions on Image Processing.

[10]  Xian-Sheng Hua,et al.  Automatic location of text in video frames , 2001, MULTIMEDIA '01.

[11]  Hang Joon Kim,et al.  Support vector machine-based text detection in digital video , 2000, Neural Networks for Signal Processing X. Proceedings of the 2000 IEEE Signal Processing Society Workshop (Cat. No.00TH8501).

[12]  Ioannis Pratikakis,et al.  A Hybrid System for Text Detection in Video Frames , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[13]  B. H. Shekar,et al.  Text localization in video using multiscale weber's local descriptor , 2015, 2015 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES).

[14]  Xin Zhang,et al.  A combined algorithm for video text extraction , 2010, 2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery.

[15]  P. S. Hiremath,et al.  Multilingual Text Localization in Natural Scene Images using Wavelet based Edge Features and Fuzzy Classification , 2015 .

[16]  C. S. Shin,et al.  Support vector machine-based text detection in digital video , 2000, Neural Networks for Signal Processing X. Proceedings of the 2000 IEEE Signal Processing Society Workshop (Cat. No.00TH8501).

[17]  Michael R. Lyu,et al.  A comprehensive method for multilingual video text detection, localization, and extraction , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  Edward K. Wong,et al.  A robust algorithm for text extraction in color video , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[19]  Shigeru Akamatsu,et al.  Recognizing Characters in Scene Images , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Chunheng Wang,et al.  Text detection in images based on unsupervised classification of edge-based features , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[21]  Qing Chen,et al.  Evaluation of OCR algorithms for images with different spatial resolutions and noises , 2004 .

[22]  M. Sundaresan,et al.  Extraction and Recognition of Text From Digital English Comic Image Using Median , 2013 .

[23]  David J. Crandall,et al.  Evaluation of Methods for Detection and Localization of Text in Video , 2015 .

[24]  Christof Koch,et al.  AdaBoost for Text Detection in Natural Scene , 2011, 2011 International Conference on Document Analysis and Recognition.

[25]  Michael R. Lyu,et al.  A new approach for video text detection , 2002, Proceedings. International Conference on Image Processing.

[26]  Frank Lebourgeois Robust multifont OCR system from gray level images , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[27]  Basilios Gatos,et al.  A Pixel-Based Evaluation Method for Text Detection in Color Images , 2010, 2010 20th International Conference on Pattern Recognition.

[28]  Xinbo Gao,et al.  Automatic News Video Caption Extraction and Recognition , 2000, IDEAL.

[29]  Zhiming Wang,et al.  An Approach for Video-Text Extraction Based on Text Traversing Line and Stroke Connectivity , 2010, 2010 International Conference on Biomedical Engineering and Computer Science.

[30]  Jin Hyung Kim,et al.  Texture-Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Palaiahnakote Shivakumara,et al.  A Robust Wavelet Transform Based Technique for Video Text Detection , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[32]  C. Garcia,et al.  Text detection and segmentation in complex color images , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[33]  Juergen Luettin,et al.  A Survey of Text Detection and Recognition in Images and Videos , 2000 .

[34]  Jianping Fan,et al.  Automatic image segmentation by integrating color-edge extraction and seeded region growing , 2001, IEEE Trans. Image Process..

[35]  Georges Quénot,et al.  From Text Detection in Videos to Person Identification , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[36]  Rainer Lienhart,et al.  Localizing and segmenting text in images and videos , 2002, IEEE Trans. Circuits Syst. Video Technol..

[37]  J. K. Mantri,et al.  Text Extraction and Recognition from Image using Neural Network , 2012 .

[38]  Xueming Qian,et al.  Text Detection, Localization and Segmentation in Compressed Videos , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[39]  Wang Zhi-ming,et al.  An Approach for Video-text Extraction based on Text Traversing Line and Stroke Connectivity , 2009 .

[40]  Chein-I Chang,et al.  Thresholding Video Images for Text Detection , 2002, ICPR.

[41]  Cheng-Lin Liu,et al.  A Hybrid Approach to Detect and Localize Texts in Natural Scene Images , 2011, IEEE Transactions on Image Processing.

[42]  Wen Gao,et al.  Fast and robust text detection in images and video frames , 2005, Image Vis. Comput..

[43]  Alireza Khotanzad,et al.  Invariant Image Recognition by Zernike Moments , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[44]  M. M. Kodabagi,et al.  Text region extraction from low resolution natural scene images using texture features , 2010, 2010 IEEE 2nd International Advance Computing Conference (IACC).

[45]  Edward M. Riseman,et al.  Finding text in images , 1997, DL '97.

[46]  R. Dhir Comparative Analysis of Classifiers Inaccuracies for Bilingual Characters ( Gurmukhi and Roman ) , 2008 .

[47]  Anil K. Jain,et al.  Automatic text location in images and video frames , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[48]  Wen Gao,et al.  A robust text detection algorithm in images and video frames , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[49]  Jayshree Ghorpade,et al.  Extracting Text from Video , 2011 .

[50]  Chew Lim Tan,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence, Manuscript Id a Laplacian Approach to Multi-oriented Text Detection in Video , 2022 .

[51]  Datong Chen,et al.  Text enhancement with asymmetric filter for video OCR , 2001, Proceedings 11th International Conference on Image Analysis and Processing.

[52]  Ellen K. Hughes,et al.  Video OCR for digital news archive , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[53]  David Doermann,et al.  Text enhancement in digital video , 1999, Electronic Imaging.

[54]  Ioannis Pratikakis,et al.  Detecting text in video frames , 2007 .

[55]  David S. Doermann,et al.  Automatic text detection and tracking in digital video , 2000, IEEE Trans. Image Process..

[56]  Jean-Marc Odobez,et al.  Text detection, recognition in images and video frames , 2004, Pattern Recognit..

[57]  David J. Crandall,et al.  Extraction of special effects caption text events from digital video , 2003, International Journal on Document Analysis and Recognition.