A comprehensive video text localization system based on stroke filter

Texts in images and videos are a significant source of high-level semantics for indexing and retrieval. But the localization and extraction of text are always interfered by complex backgrounds with strong edge and texture clutter. In this paper, we propose a comprehensive text localization system which combines several popular methods. First, we extract the stroke features of text using improved stroke filter. Then we employ global thresholding and projection profile to localize the candidate text regions on the stroke map. In the end, based on the stroke statistical features, we use SVM classifier to refine the text regions. The experimental results show that the system is robust to the text size and complex background.

[1]  Michael R. Lyu,et al.  A comprehensive method for multilingual video text detection, localization, and extraction , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Michael R. Lyu,et al.  A new approach for video text detection , 2002, Proceedings. International Conference on Image Processing.

[3]  Rainer Lienhart,et al.  Automatic text recognition in digital videos , 1995, Electronic Imaging.

[4]  Wolfgang Effelsberg,et al.  Automatic text segmentation and text recognition for video indexing , 2000, Multimedia Systems.

[5]  Qifeng Liu,et al.  Stroke Filter for Text Localization in Video Images , 2006, 2006 International Conference on Image Processing.

[6]  Wen Gao,et al.  Fast and effective text detection , 2008, 2008 15th IEEE International Conference on Image Processing.

[7]  Jin Hyung Kim,et al.  Texture-Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Qifeng Liu,et al.  A stroke filter and its application to text localization , 2009, Pattern Recognit. Lett..

[9]  EffelsbergWolfgang,et al.  Automatic text segmentation and text recognition for video indexing , 2000 .

[10]  Rainer Lienhart,et al.  Localizing and segmenting text in images and videos , 2002, IEEE Trans. Circuits Syst. Video Technol..