OCRdroid: A Framework to Digitize Text Using Mobile Phones

As demand grows for mobile phone applications, research in optical character recognition, a technology well developed for scanned documents, is shifting focus to the recognition of text embedded in digital photographs. In this paper, we present OCRdroid, a generic framework for developing OCR-based applications on mobile phones. OCRdroid combines a light-weight image preprocessing suite installed inside the mobile phone and an OCR engine connected to a backend server. We demonstrate the power and functionality of this framework by implementing two applications called PocketPal and PocketReader based on OCRdroid on HTC Android G1 mobile phone. Initial evaluations of these pilot experiments demonstrate the potential of using OCRdroid framework for real-world OCR-based mobile applications.

[1]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[2]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .

[3]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[4]  Christopher R. Dance,et al.  Binarising camera images for OCR , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[5]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[6]  Xilin Chen,et al.  Automatic detection and recognition of signs from natural scenes , 2004, IEEE Transactions on Image Processing.

[7]  David S. Doermann,et al.  Camera-based analysis of text and documents: a survey , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[8]  Hiroshi Hanaizumi,et al.  Barcode readers using the camera device in mobile phones , 2004, 2004 International Conference on Cyberworlds.

[9]  Jun Li,et al.  Design and implementation of a card reader based on build-in camera , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[10]  Christoph H. Lampert,et al.  Document image dewarping using robust estimation of curled text lines , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[11]  W. Bieniecki,et al.  Image Preprocessing for Improving OCR Accuracy , 2007, 2007 International Conference on Perspective Technologies and Methods in MEMS Design.

[12]  M. Elmore A Morphological Image Preprocessing Suite for OCR on Natural Scene Images , 2008 .

[13]  Pattie Maes,et al.  Quickies : intelligent sticky notes , 2008 .

[14]  Estrin,et al.  A System for Determining Indoor Air Quality from Images of an Air Sensor Captured on Cell Phones , 2008 .

[15]  Thomas M. Breuel,et al.  Efficient implementation of local adaptive thresholding techniques using integral images , 2008, Electronic Imaging.

[16]  Oliver Bimber,et al.  Adaptive training of video sets for image recognition on mobile phones , 2009, Personal and Ubiquitous Computing.

[17]  Surya "A System for Determining Indoor Air Quality from Images of an Air Sensor Captured on Cell Phones." Whitesell, K., Kutler, B., Ramanathan, N., Estrin, D. ACM Conference on Embedded Networked Sensor Systems, ImageSense Workshop, 2008. , 2009 .