Segmentation of touching modifiers and consonants in middle region of handwritten Hindi text

Character segmentation is a major problem in any text recognition system. The most common problem in the recognition of handwritten text is the presence of touching of left modifier with the consonant in the middle region of the word. In Optical Character Recognition (OCR) system, the recognition rate decreases due to the presence of touching characters. The determination of presence of touching characters is a very tedious task. Based on the structural properties of the text, a new algorithm is proposed to segment left modifier from the consonant in the middle region of the word. The results obtained with the proposed algorithm are very challenging. The problem of over segmentation that occurs in the segmentation of touching left modifier from consonant in the middle region, are also explained.

[1]  PalUmapada,et al.  Offline Recognition of Devanagari Script , 2011 .

[2]  Naresh Kumar Garg,et al.  THE HAZARDS IN SEGMENTATION OF HANDWRITTEN HINDI TEXT , 2011 .

[3]  Rajendra Kumar Sharma,et al.  Segmentation of touching characters in upper zone in printed Gurmukhi script , 2009, COMPUTE '09.

[4]  Umapada Pal,et al.  Offline Recognition of Devanagari Script: A Survey , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[5]  Malayappan Shridhar,et al.  A Segmentation Based Approach to Offline Handwritten Devanagari Word Recognition , 2008, 2008 International Conference on Information Technology.

[6]  Umapada Pal,et al.  Handwriting segmentation of unconstrained Oriya text , 2006 .

[7]  Umapada Pal,et al.  Handwriting segmentation of unconstrained Oriya text , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.

[8]  Pooja Agrawal,et al.  Segmentation of Handwritten Hindi Text: A Structural Approach , 2009, Int. J. Comput. Process. Orient. Lang..

[9]  Veena Bansal Integrating Knowledge Sources in Devanagari Text Recognition , 1999 .

[10]  Naresh Kumar Garg,et al.  The Segmentation of Half Characters in Handwritten Hindi Text , 2011, ICIS 2011.

[11]  Ching Y. Suen,et al.  Historical review of OCR research and development , 1992, Proc. IEEE.

[12]  Bidyut Baran Chaudhuri,et al.  Automatic recognition of printed Oriya script , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[13]  Rajendra Kumar Sharma,et al.  On Segmentation of Touching Characters and Overlapping Lines in Degraded Printed Gurmukhi Script , 2009, Int. J. Image Graph..

[14]  Naresh Kumar Garg,et al.  A New Method for Line Segmentation of Handwritten Hindi Text , 2010, 2010 Seventh International Conference on Information Technology: New Generations.

[15]  Bidyut Baran Chaudhuri,et al.  Segmentation of touching characters in printed Devnagari and Bangla scripts using fuzzy multifactorial analysis , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[16]  Naresh Kumar Garg,et al.  Segmentation of Handwritten Hindi Text , 2010 .

[17]  Veena Bansal,et al.  Integrating knowledge sources in Devanagari text recognition system , 2000, IEEE Trans. Syst. Man Cybern. Part A.