OCRdroid : A Framework to Digitize Text on Smart Phones

As demand grows for mobile phone applications, research in optical character recognition, a technology well developed for scanned documents, is shifting focus to the recognition of text embedded in digital photographs. In this paper, we present OCRdroid, a generic framework for developing OCR-based applications on mobile phones. OCRdroid combines a lightweight image preprocessing suite installed inside the mobile phone and an OCR engine connected to a backend server. We demonstrate the power and functionality of this framework by implementing two applications called PocketPal and PocketReader based on OCRdroid on HTC Android G1 mobile phone. Initial evaluations of these pilot experiments demonstrate the potential of using OCRdroid framework for real-world OCR-based mobile applications.

[1]  Christoph H. Lampert,et al.  Document image dewarping using robust estimation of curled text lines , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[2]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[3]  M. Elmore A Morphological Image Preprocessing Suite for OCR on Natural Scene Images , 2008 .

[4]  Christopher R. Dance,et al.  Binarising camera images for OCR , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[5]  Thomas M. Breuel,et al.  Efficient implementation of local adaptive thresholding techniques using integral images , 2008, Electronic Imaging.

[6]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[7]  Ergina Kavallieratou A binarization algorithm specialized on document images and photos , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[8]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .

[9]  Pattie Maes,et al.  Quickies : intelligent sticky notes , 2008 .

[10]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[11]  Hiroshi Hanaizumi,et al.  Barcode readers using the camera device in mobile phones , 2004, 2004 International Conference on Cyberworlds.

[12]  W. Bieniecki,et al.  Image Preprocessing for Improving OCR Accuracy , 2007, 2007 International Conference on Perspective Technologies and Methods in MEMS Design.

[13]  David S. Doermann,et al.  Camera-based analysis of text and documents: a survey , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[14]  Jun Li,et al.  Design and implementation of a card reader based on build-in camera , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..