Text detection in video frames using hybrid features

Text in video frames provides brief and important content information which is helpful to video scene understanding, annotation and searching. A new text detection method in video frames is proposed in this paper. First, a small overlapped sliding window is scanned over the frame from which hybrid features are extracted. And then SVM classifier is employed to distinguish the text from background. At last, vote mechanism and morphological filter are performed to precisely locate the text region. The new method is expected to outperform the existing strategies based on the following two improvements. One is selecting robust features to distinguish both the scene and overlay text from the complex backgrounds. The other is addressing the multilingual capability over the whole processing. The proposed algorithm has been evaluated by four different kinds of videos and the experiments show its high performance.

[1]  Bernd Freisleben,et al.  Text detection in images based on unsupervised classification of high-frequency wavelet coefficients , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[2]  C. Garcia,et al.  Text detection and segmentation in complex color images , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[3]  Yuh-Jye Lee,et al.  SSVM: A Smooth Support Vector Machine for Classification , 2001, Comput. Optim. Appl..

[4]  Jean-Philippe Thiran,et al.  A localization/verification scheme for finding text in images and video frames based on contrast independent features and machine learning methods , 2004, Signal Process. Image Commun..

[5]  Wonjun Kim,et al.  A New Approach for Overlay Text Detection and Extraction From Complex Video Scene , 2009, IEEE Transactions on Image Processing.

[6]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[7]  Chunheng Wang,et al.  An adaptive text detection approach in images and video frames , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[8]  Bernd Freisleben,et al.  Text detection in images based on unsupervised classification of high-frequency wavelet coefficients , 2004, ICPR 2004.

[9]  Michael R. Lyu,et al.  A comprehensive method for multilingual video text detection, localization, and extraction , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Xueming Qian,et al.  Text detection, localization, and tracking in compressed video , 2007, Signal Process. Image Commun..

[11]  Yuan-Kai Wang,et al.  Detecting Video Texts Using Spatial-Temporal Wavelet Transform , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[12]  Jiang Wu,et al.  Automatic text detection in complex color image , 2002, Proceedings. International Conference on Machine Learning and Cybernetics.