Rectification and recognition of text in 3-D scenes

Abstract.Real-world text on street signs, nameplates, etc. often lies in an oblique plane and hence cannot be recognized by traditional OCR systems due to perspective distortion. Furthermore, such text often comprises only one or two lines, preventing the use of existing perspective rectification methods that were primarily designed for images of document pages. We propose an approach that reliably rectifies and subsequently recognizes individual lines of text. Our system, which includes novel algorithms for extraction of text from real-world scenery, perspective rectification, and binarization, has been rigorously tested on still imagery as well as on MPEG-2 video clips in real time.

[1]  Tarak Gandhi,et al.  Application of planar motion segmentation for scene text extraction , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[2]  Rainer Lienhart,et al.  Indexing and retrieval of digital video sequences based on automatic text recognition , 1997, MULTIMEDIA '96.

[3]  William M. Newman,et al.  Documents through cameras , 1999, Image Vis. Comput..

[4]  David S. Doermann,et al.  Automatic text tracking in digital videos , 1998, 1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175).

[5]  Ellen K. Hughes,et al.  Video OCR for digital news archive , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[6]  David S. Doermann,et al.  Automatic identification of text in digital video key frames , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[7]  David S. Doermann,et al.  Text Extraction, Enhancement and OCR in Digital Video , 1998, Document Analysis Systems.

[8]  Majid Mirmehdi,et al.  Location and recovery of text on oriented surfaces , 1999, Electronic Imaging.

[9]  Victor K. Y. Wu Automatic Text Detection and Recognition , 1997 .

[10]  M. Smith,et al.  Video Skimming for Quick Browsing based on Audio and Image Characterization , 1995 .

[11]  Majid Mirmehdi,et al.  Recognising text in real scenes , 2002, International Journal on Document Analysis and Recognition.

[12]  Maurizio Pilu,et al.  Extraction of illusory linear clues in perspectively skewed documents , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[13]  David S. Doermann,et al.  Automatic text detection and tracking in digital video , 2000, IEEE Trans. Image Process..

[14]  Christopher R. Dance,et al.  Perspective estimation for document images , 2001, IS&T/SPIE Electronic Imaging.

[15]  Shigeru Akamatsu,et al.  Recognizing Characters in Scene Images , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Azriel Rosenfeld,et al.  A method of detecting the orientation of aligned components , 1986, Pattern Recognit. Lett..

[17]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Boon-Lock Yeo,et al.  Visual content highlighting via automatic extraction of embedded captions on MPEG compressed video , 1996, Electronic Imaging.

[19]  Anil K. Jain,et al.  Automatic text location in images and video frames , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[20]  Majid Mirmehdi,et al.  Extracting Low Resolution Text with an Active Camera for OCR , 2001 .

[21]  Anil K. Jain,et al.  Locating text in complex color images , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[22]  Yasuaki Nakano,et al.  An algorithm for the skew normalization of document image , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[23]  Majid Mirmehdi,et al.  On the Recovery of Oriented Documents from Single Images , 2002 .