Text extraction in real scene images on planar planes

This paper proposes a hybrid approach of a texture-based method and a connected component-based one for extracting texts in real scene images. For detecting texts having a lot of variations in size, shape, etc, we use a multiple-continuously adaptive mean shift algorithm on the text probability image produced by a multi-layer perceptron. It is assumed that the scene text lies on planar rectangular surfaces with homogeneous background colors. We correct perspective distortion using warping parameters calculated after segmentation of an input image. We can detect and reconstruct text images accurately and efficiently.

[1]  David S. Doermann,et al.  Automatic text detection and tracking in digital video , 2000, IEEE Trans. Image Process..

[2]  Tarak Gandhi,et al.  Application of planar motion segmentation for scene text extraction , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[3]  Rainer Lienhart,et al.  On the segmentation of text in videos , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[4]  V. Pisarevsky,et al.  Intel's Computer Vision Library: applications in calibration, stereo segmentation, tracking, gesture, face and object recognition , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[5]  Anil K. Jain,et al.  Automatic text location in images and video frames , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[6]  Anil K. Jain,et al.  Learning Texture Discrimination Masks , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Hang Joon Kim,et al.  Neural network-based text location for news video indexing , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[8]  JungHyun Han,et al.  Text scanner with text detection technology on image sequences , 2002, Object recognition supported by user interaction for service robots.

[9]  Qian Huang,et al.  Character extraction of license plates from video , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  P. Gamba,et al.  Character recognition in external scenes by means of vanishing point grouping , 1997, Proceedings of 13th International Conference on Digital Signal Processing.

[11]  Dorin Comaniciu,et al.  Robust detection and tracking of human faces with an active camera , 2000, Proceedings Third IEEE International Workshop on Visual Surveillance.