Implementing Kannada Optical Character Recognition on the Android Operating System for Kannada Sign Boards

2 ABSTRACT: This paper focuses on an application that performs Kannada OCR (Optical Character Recognition) in hand held devices. The objective is to make use of the visual capabilities of the built in camera of Android devices to extract text from Kannada sign boards, newspapers etc., and use the same platform to implement the OCR technology with the help of kohonen's algorithm. Another aim of this application is to find the meaning of the word detected on the Internet. With the help of an initial, sample data, the Kohonen network is trained. The image taken is further processed by initially thinning the characters in the image using the Hilditch algorithm. The image is processed, eliminating small distortions present. The image is then converted to into a greyscale image, which is segmented and the result is displayed along with a vocal output. These characters may be edited by the user and its meaning is then displayed in English.

[1]  Rose Holley,et al.  How Good Can It Get? Analysing and Improving OCR Accuracy in Large Scale Historic Newspaper Digitisation Programs , 2009, D Lib Mag..

[2]  Mark L. Murphy The Busy Coder's Guide to Android Development , 2009 .

[3]  Junaid Tariq,et al.  α-Soft: An English language OCR , 2010, 2010 2nd International Conference on Computer Engineering and Technology.

[4]  Heaton T. Jeff,et al.  Introduction to Neural Networks with Java , 2005 .

[5]  R. Manmatha,et al.  A Fast Alignment Scheme for Automatic OCR Evaluation of Books , 2011, 2011 International Conference on Document Analysis and Recognition.

[6]  Kevin N. Gurney,et al.  An introduction to neural networks , 2018 .

[7]  Debashis Ghosh,et al.  A Comparative Study of Different Approaches of Noise Removal for Document Images , 2011, SocProS.

[8]  Himanshu Aggarwal,et al.  A Comprehensive Review of Image Enhancement Techniques , 2010, ArXiv.

[9]  Jürgen Schmidhuber,et al.  Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks , 2008, NIPS.

[10]  Ergina Kavallieratou Text line detection and segmentation: uneven skew angles and hill-and-dale writing , 2010, SAC '10.

[11]  David S. Doermann,et al.  Logo Matching for Document Image Retrieval , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[12]  Ioannis Anagnostopoulos,et al.  A License Plate-Recognition Algorithm for Intelligent Transportation System Applications , 2006, IEEE Transactions on Intelligent Transportation Systems.

[13]  Christopher N. K. Mooers,et al.  Performance evaluation of the self‐organizing map for feature extraction , 2006 .