Detecting text in video frames

In this paper we propose an edge-based algorithm for artificial text detection in video frames. First, an edge map is created using the Canny edge detector. Then, morphological filtering is used, based on geometrical constraints, in order to connect the vertical edges and discard false alarms. A connected component analysis is performed to the filtered edge map in order to determine a bounding box for every candidate text area. Finally, horizontal and vertical projections are calculated on the edge map of every box and a threshold is applied, refining the result and splitting text areas in text lines. The whole algorithm is applied in multiresolution fashion to ensure text detection with size variability. Experimental results prove that the method is highly effective and efficient for artificial text detection.

[1]  Ellen K. Hughes,et al.  Video OCR for digital news archive , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[2]  Zhang Yi,et al.  Automatic Text Detection In Video Frames Based on Bootstrap Artificial Neural Network And CED , 2003 .

[3]  Rainer Lienhart,et al.  Localizing and segmenting text in images and videos , 2002, IEEE Trans. Circuits Syst. Video Technol..

[4]  Hao Yan,et al.  Automatic Text Detection In Video Frames Based on Bootstrap Artificial Neural Network and CED , 2003, WSCG.

[5]  Dmitry B. Goldgof,et al.  Performance Evaluation of Text Detection and Tracking in Video , 2006, Document Analysis Systems.

[6]  Wen Gao,et al.  Fast and robust text detection in images and video frames , 2005, Image Vis. Comput..

[7]  Majid Mirmehdi,et al.  Finding Text Regions Using Localised Measures , 2000 .

[8]  Noel E. O'Connor,et al.  Automatic detection and extraction of artificial text in video , 2004 .

[9]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Horst Bunke,et al.  Identification of text on colored book and journal covers , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[11]  Chein-I Chang,et al.  Automated system for text detection in individual video images , 2003, J. Electronic Imaging.

[12]  Wen Wu,et al.  Integrating co-training and recognition for text detection , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[13]  Xian-Sheng Hua,et al.  A video text detection and recognition system , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[14]  Christopher Wolf,et al.  Model based text detection in images and videos: a learning approach , 2004 .

[15]  Xian-Sheng Hua,et al.  An automatic performance evaluation protocol for video text detection algorithms , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Ellen K. Hughes,et al.  Video OCR for Digital News Archives , 1998 .

[17]  Anil K. Jain,et al.  Automatic caption localization in compressed video , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[18]  Jean-Philippe Thiran,et al.  Text identification in complex background using SVM , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[19]  David J. Crandall,et al.  Extraction of special effects caption text events from digital video , 2003, International Journal on Document Analysis and Recognition.

[20]  Rainer Lienhart,et al.  Automatic text recognition in digital videos , 1995, Electronic Imaging.