Challenges in OCR of Devanagari documents

OCR of Devanagari script presents a wide range of challenges that are not seen in Latin based scripts. This paper outlines the implementation of a neural network based Devanagari OCR. Experimental results on a standard data set are reported and analyzed.

[1]  Venu Govindaraju,et al.  Tools for enabling digital access to multi-lingual Indic documents , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[2]  Geetha Srikantan,et al.  A multiple feature/resolution approach to handprinted digit and character recognition , 1996, Int. J. Imaging Syst. Technol..

[3]  David S. Doermann,et al.  Adaptive Hindi OCR using generalized Hausdorff image comparison , 2003, TALIP.

[4]  Milan Sonka,et al.  Image Processing, Analysis and Machine Vision , 1993, Springer US.

[5]  Kalina Bontcheva,et al.  Corpus Linguistics and South Asian Languages: Corpus Creation and Tool Development , 2004, Lit. Linguistic Comput..

[6]  Venu Govindaraju,et al.  Creation of data resources and design of an evaluation test bed for Devanagari script recognition , 2003, Proceedings. Seventeenth Workshop on Parallel and Distributed Simulation.

[7]  Venu Govindaraju,et al.  DL Architecture for Indic Scripts , 2004, Document Analysis Systems.