Broken kannada character recognition — A neural network based approach

Character degradation is a major problem in character recognition. Most of the historical documents are degraded due to extrinsic factors like blurring, skew, background noise etc and intrinsic factors like distortion, broken characters, touching characters etc. In this paper we propose a novel approach of rebuilding the broken characters and then using neural network for recognition. Kannada characters are difficult to recognize due to their complicated shapes and braking of characters makes it much more difficult. The broken characters are therefore rebuilt using end point algorithm to remove brokenness and a single layer neural network is used for classification. A recognition accuracy of 98.9% was achieved for broken characters on synthetically generated data sets.

[1]  N. Sandhya,et al.  A novel local enhancement technique for rebuilding Broken characters in a degraded Kannada script , 2015, 2015 IEEE International Advance Computing Conference (IACC).

[2]  D.R. Ramesh Babu,et al.  Recognition of machine printed broken characters based on gradient patterns and its spatial relationship , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[3]  Chaivatna Sumetphong,et al.  An Optimal Approach Towards Recognizing Broken Thai Characters in OCR Systems , 2012, 2012 International Conference on Digital Image Computing Techniques and Applications (DICTA).

[4]  S. Tangwongsan,et al.  Effectively recognizing broken characters in Historical documents , 2012, 2012 IEEE International Conference on Computer Science and Automation Engineering (CSAE).

[5]  Chaivatna Sumetphong,et al.  Modeling broken characters recognition as a set-partitioning problem , 2012, Pattern Recognit. Lett..

[6]  S. Karsoliya,et al.  Approximating Number of Hidden layer neurons in Multiple Hidden Layer BPNN Architecture , 2012 .

[7]  R. Indra Gandhi,et al.  RECOGNITION OF DISTORTED CHARACTER USING EDGE DETECTION ALGORITHM , 2013 .

[8]  Subhagata Chattopadhyay,et al.  Recognition and Classification of Broken Characters using Feed Forward Neural Network to Enhance an OCR Solution , 2012 .

[9]  Subhagata Chattopadhyay,et al.  Automatic Recognition of Handwritten Bengali Broken Characters (BBC): Simulating Human Pattern Matching , 2012 .

[10]  S. Tangwongsan,et al.  Recognizing broken characters in Thai Historical documents , 2010, 2010 3rd International Conference on Advanced Computer Theory and Engineering(ICACTE).