Extraction of Text Regions and Recognition of Characters from Video Inputs

In this paper, a new algorithm for extracting and recognizing characters from video, without a priori knowledge such as font, color, size of characters, is proposed. From input videos with complex backgrounds at low resolution, continuous frames with identical text region are automatically detected to compose an averaged frame. Using boundary pixels of a text region as seeds, we apply region filling to remove backgrounds from characters. Then color clustering is applied to remove remaining backgrounds. For the recognition of characters, simple features such as white run and zero-one transition from the center, are extracted. These features are compared with a pre-defined character feature set to recognize the characters. Experimental results tested on various news videos show that the proposed method is effective in terms of caption extraction rate and character recognition rate.

[1]  Anil K. Jain,et al.  Automatic text location in images and video frames , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[2]  Anil K. Jain,et al.  Automatic Caption Localization in Compressed Video , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Minoru Mori,et al.  Telop-on-demand: video structuring and retrieval based on text recognition , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[4]  Ullas Gargi,et al.  Indexing text events in digital video databases , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[5]  Anil K. Jain,et al.  Page segmentation using tecture analysis , 1996, Pattern Recognit..