Robust Text Stroke Extraction from Video

In this paper, we present a novel approach which employs the temporal redundancy of video texts to improve the performance of text segmentation and stroke extraction from complex background. We first demonstrate how to obtain the required aligned text image sequences from video frames via a robust corners-set matching scheme. Then the changing background pixels can be identified and removed by exploiting the statistics of the temporal redundancy of video. The text stroke pixels can be clearly separated from the complex background. Experiments on TV programs and movies videos show the proposed approach can generate clean text strokes image which can efficiently improve the OCR's performance compare to some traditional approaches.

[1]  Xueming Qian,et al.  Text Detection, Localization and Segmentation in Compressed Videos , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[2]  Alan L. Yuille,et al.  Detecting and reading text in natural scenes , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[3]  Xinbo Gao,et al.  Video text extraction using temporal feature vectors , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[4]  Ellen K. Hughes,et al.  Video OCR for digital news archive , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[5]  David S. Doermann,et al.  Automatic text detection and tracking in digital video , 2000, IEEE Trans. Image Process..

[6]  Michael R. Lyu,et al.  A comprehensive method for multilingual video text detection, localization, and extraction , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Rainer Lienhart,et al.  Localizing and segmenting text in images and videos , 2002, IEEE Trans. Circuits Syst. Video Technol..