Video OCR for Indexing and Retrieval

We present an implementation of a search engine that searches videos based on its textual content. The system consists of four parts video processing, spell correction, indexing and searching. The video processing is done by dividing the video into frames and extracting text out of it. Lecture videos, news having some textual content in it show good results. General Terms Optical Character Recognition (OCR), Connected Components (CCs), Video Search Engine.

[1]  Dimosthenis Karatzas,et al.  MSER-Based Real-Time Text Detection and Tracking , 2014, 2014 22nd International Conference on Pattern Recognition.

[2]  Marc Davis,et al.  Media streams: representing video for retrieval and repurposing , 1994, MULTIMEDIA '94.

[3]  Carlos Merino A Framework Towards Realtime Detection and Tracking of Text , 2007 .

[4]  Horst Bischof,et al.  Efficient Maximally Stable Extremal Region (MSER) Tracking , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[5]  Harald Sack,et al.  Lecture Video Indexing and Analysis Using Video OCR Technology , 2011, 2011 Seventh International Conference on Signal Image Technology & Internet-Based Systems.

[6]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[7]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[8]  Tiecheng Liu,et al.  Content Extraction and Summarization of Instructional Videos , 2006, 2006 International Conference on Image Processing.

[9]  P. Geetha,et al.  An effective video search re-ranking for content based video retrieval , 2011, 3rd International Conference on Trendz in Information Sciences & Computing (TISC2011).

[10]  Julien Law-To,et al.  VOXALEADNEWS: A scalable content based video search engine , 2012, 2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI).

[11]  Zi Huang,et al.  Content-Based Video Search: Is there a Need, and Is it Possible? , 2008, 2008 International Workshop on Information-Explosion and Next Generation Search.

[12]  S.M. Lucas,et al.  ICDAR 2005 text locating competition results , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[13]  Jorge Stolfi,et al.  Snoopertrack: Text detection and tracking for outdoor videos , 2011, 2011 18th IEEE International Conference on Image Processing.

[14]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  Huizhong Chen,et al.  Robust text detection in natural images with edge-enhanced Maximally Stable Extremal Regions , 2011, 2011 18th IEEE International Conference on Image Processing.