论文信息 - Robust extraction of text in video

Robust extraction of text in video

Despite advances in the archiving of digital video, we are still unable to efficiently search and retrieve the portions that interest us. Video indexing by shot segmentation has been a proposed solution and several research efforts are seen in the literature. Shot segmentation alone cannot solve the problem of content based access to video. Recognition of text in video has been proposed as an additional feature. Several research efforts are found in the literature for text extraction from complex images and video with applications for video indexing. We present an update of our system for detection and extraction of an unconstrained variety of text from general purpose video. The text detection results from a variety of methods are fused and each single text instance is segmented to enable it for OCR. Problems in segmenting text from video are similar to those faced in detection and localization phases. Video has low resolution and the text often has poor contrast with a changing background. The proposed system applies a variety of methods and takes advantage of the temporal redundancy in video resulting in good text segmentation.

David J. Crandall | Rangachar Kasturi | Sameer K. Antani | R. Kasturi | Sameer Kiran Antani

[1] Stefano Messelodi,et al. Automatic identification and skew estimation of text lines in real scene images , 1999, Pattern Recognition.

[2] Shigeru Akamatsu,et al. Recognizing Characters in Scene Images , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[3] Ramesh C. Jain,et al. Pattern Recognition Methods in Image and Video Databases: Past, Present and Future , 1998, SSPR/SPR.

[4] Lowell L. Winger,et al. Character segmentation and thresholding in low-contrast scene images , 1996, Electronic Imaging.

[5] Ki-Sang Hong,et al. Binarization of noisy gray-scale character images by thin line modeling , 1999, Pattern Recognit..

[6] Seong-Whan Lee,et al. Direct Extraction of Topographic Features for , 1995 .

[7] Mihaela van der Schaar-Mitrea. Compression of mixed video and graphics images for TV systems , 1998 .

[8] A. Gupta,et al. Text segmentation in mixed-mode images , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[9] Edward M. Riseman,et al. Finding text in images , 1997, DL '97.

[10] Frank Lebourgeois. Robust multifont OCR system from gray level images , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[11] Atreyi Kankanhalli,et al. Automatic Extraction of Characters in Complex Scene Images , 1995, Int. J. Pattern Recognit. Artif. Intell..

[12] Chitra Dorai,et al. Automatic text extraction from video for content-based annotation and retrieval , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[13] Pengfei Zhu,et al. On Critical Point Detection of Digital Shapes , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[14] Mohamed S. Kamel,et al. Extraction of Binary Character/Graphics Images from Grayscale Document Images , 1993, CVGIP Graph. Model. Image Process..

[15] Nevenka Dimitrova,et al. Text detection for video analysis , 1999, Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries (CBAIVL'99).