Camera-based Kanji OCR for mobile-phones: practical issues

A camera based optical character reader (OCR) for Japanese Kanji characters was implemented on a mobile phone. This OCR has three key features. The first is discriminative feature extraction (DFE) which enables a character classifier needing only small memory size. The second is a word segmentation method specially designed for looking up Japanese words in a dictionary. The third feature is a GUI suitable for a mobile phone. A prototype mobile phone Kanji OCR was constructed and experimentally tested. Recognition accuracy of over 95% was obtained under the best conditions, which shows the potential of our prototype as a new type of electronic dictionary.

[1]  Hiroshi Sako,et al.  Information capturing camera and developmental issues , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[2]  David S. Doermann,et al.  Progress in camera-based document image analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[3]  Hiroshi Sako,et al.  Handwritten digit recognition: investigation of normalization and feature extraction techniques , 2004, Pattern Recognit..

[4]  Keiji Yamada,et al.  Camera-typing interface for ubiquitous information services , 2004, Second IEEE Annual Conference on Pervasive Computing and Communications, 2004. Proceedings of the.

[5]  Hiroshi Sako,et al.  A robust method for recognition of monetary amount printed by checkwriter , 2004 .

[6]  Hiroshi Sako,et al.  A robust method for recognition of monetary amount printed by checkwriter , 2004, Systems and Computers in Japan.

[7]  Hiroshi Sako,et al.  Lexical Search Approach for Character-String Recognition , 1998, Document Analysis Systems.

[8]  Milan Sonka,et al.  Image Processing, Analysis and Machine Vision , 1993, Springer US.

[9]  Cheng-Lin Liu,et al.  Building compact classifier for large character set recognition using discriminative feature extraction , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[10]  Rudolf Albrecht Systems: Theory and Practice , 1998, Advances in Computing Science.