Towards robust and efficient text sign reading from a mobile phone

Embedded applications on mobile phones are reaching impressive goals thanks to the current powerful smartphones. This work is focused on text recognition applications from mobile phone pictures. Optical Character Recognition (OCR) methods have been developed for a longtime, but they still have poor robustness to process text in general scene images. Our general goal is to study and improve their results, in particular when running locally on a phone. We present a realistic prototype running on iOS, with a light geometry based pre-processing step that helps detecting regions of interest in the image, i.e., likely to contain text-signs. Then, we show how to process and filter these hypothesis to facilitate text recognition by standard OCR methods. This initial version is aimed to rectangular shaped signs to easily take advantage of geometric cues. We demonstrate the performance improvements of including our proposal together with several available OCR libraries. All steps are run locally on the phone in the designed application, which can read or translate the text using additional standard services in the phone.

[1]  Jun Huang,et al.  Text detection and restoration in natural scene images , 2007, J. Vis. Commun. Image Represent..

[2]  Jean-Michel Jolion,et al.  Extraction and recognition of artificial text in multimedia documents , 2003, Formal Pattern Analysis & Applications.

[3]  Yingli Tian,et al.  Improving Computer Vision-Based Indoor Wayfinding for Blind Persons with Context Information , 2010, ICCHP.

[4]  Makoto Tanaka,et al.  Text-tracking wearable camera system for visually-impaired people , 2008, 2008 19th International Conference on Pattern Recognition.

[5]  Silvio Ferreira,et al.  A Text Detection Technique Applied in the Framework of a Mobile Camera-Based Application , .

[6]  Ching Y. Suen,et al.  Historical review of OCR research and development , 1992, Proc. IEEE.

[7]  Nobuo Ezaki,et al.  Text detection from natural scene images: towards a system for visually impaired persons , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[8]  Wei Zhang,et al.  Video Compass , 2002, ECCV.

[9]  David S. Doermann,et al.  Automatic text detection and tracking in digital video , 2000, IEEE Trans. Image Process..

[10]  Kai Wang,et al.  Word Spotting in the Wild , 2010, ECCV.

[11]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[12]  Jiri Matas,et al.  A Method for Text Localization and Recognition in Real-World Images , 2010, ACCV.

[13]  Matthieu Cord,et al.  TEXT EXTRACTION FROM STREET LEVEL IMAGES , 2009 .

[14]  Alan L. Yuille,et al.  Detecting and reading text in natural scenes , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[15]  Chao Yang,et al.  Attacks and design of image recognition CAPTCHAs , 2010, CCS '10.

[16]  Edward M. Riseman,et al.  Finding text in images , 1997, DL '97.

[17]  Bernard Gosselin,et al.  A Multifunctional Reading Assistant for the Visually Impaired , 2007, EURASIP J. Image Video Process..

[18]  Majid Mirmehdi,et al.  Combining statistical measures to find image text regions , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.