A combined algorithm for video text extraction

Video text provides high-level semantic information. However, due to the complex background in video, it is of great difficulty to extract text efficiently. Although many methods hold assumptions on single feature, such as texture, connected areas etc., there are still some problems in dealing with multilingual text extraction because of its quite different appearance. In this paper, the color and edge features are used to extract the text from the video frame. In this paper, two methods are combined, called color-edge combined algorithm, to remove text background. One of the combined methods is based on the exponential changes of text color, called Transition Map model, the other one uses the text edges of different gray level image. After removing complex background, text location is determined using the vertical and horizontal projection method. This algorithm is robust to the image with multilingual text. Through extensive comparison with other approach, experimental results on a large number of video images successfully demonstrate the efficiency of this algorithm.

[1]  Pheng-Ann Heng,et al.  A double-threshold image binarization method based on edge detector , 2008, Pattern Recognit..

[2]  Wonjun Kim,et al.  A New Approach for Overlay Text Detection and Extraction From Complex Video Scene , 2009, IEEE Transactions on Image Processing.

[3]  Zhong Wang,et al.  Research of Double-Threshold Segmentation of Brazing-Area Defect of Saw Based on Otsu and HSV Color Space , 2009, 2009 2nd International Congress on Image and Signal Processing.

[4]  Wen Gao,et al.  Fast and robust text detection in images and video frames , 2005, Image Vis. Comput..

[5]  Adnan Amin,et al.  Page segmentation and classification utilising a bottom-up approach , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[6]  Palaiahnakote Shivakumara,et al.  Efficient video text detection using edge features , 2008, 2008 19th International Conference on Pattern Recognition.

[7]  Tao Cheng,et al.  A Smart Approach for Text Detection, Localization and Extraction in Video Frames , 2009, 2009 International Conference on Information Technology and Computer Science.

[8]  Bernard Gosselin,et al.  Spatial and Color Spaces Combination for Natural Scene Text Extraction , 2006, 2006 International Conference on Image Processing.

[9]  Wen Gao,et al.  Multi-polarity text segmentation using graph theory , 2008, 2008 15th IEEE International Conference on Image Processing.

[10]  Michael R. Lyu,et al.  A comprehensive method for multilingual video text detection, localization, and extraction , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Chunheng Wang,et al.  Text detection in images based on unsupervised classification of edge-based features , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[12]  Jing Zhang,et al.  Extraction of Text Objects in Video Documents: Recent Progress , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[13]  Jing Zhang,et al.  A new edge-based text verification approach for video , 2008, 2008 19th International Conference on Pattern Recognition.

[14]  Jian Yu,et al.  Otsu Method and K-means , 2009, 2009 Ninth International Conference on Hybrid Intelligent Systems.