Segmentation of Printed Text in Devanagari Script and Gurmukhi Script

In this paper, we describe the line, word, character and top character segmentation for printed Hindi text in Devanagari script. And also describe the line and word segmentation for printed text in Gurmukhi script. A performance of 100% at line level, approximately 100% at word level, 99% at character level, and 97% at top character level for Devanagari script and performance of 100% at line level and 99% at word level for Gurmukhi script is obtained. Here we have observed the performance of segmentation with the help of five documents in devanagari script and five document in gurmukhi script.

[1]  Yi Lu,et al.  Machine printed character segmentation --; An overview , 1995, Pattern Recognit..

[2]  R. Mahesh K. Sinha,et al.  Rule based contextual post-processing for devanagari text recognition , 1987, Pattern Recognit..

[3]  Ishwar K. Sethi,et al.  Machine recognition of constrained hand printed devanagari , 1977, Pattern Recognit..

[4]  Veena Bansal,et al.  Segmentation of touching and fused Devanagari characters , 2002, Pattern Recognit..

[5]  Amardeep Singh,et al.  Detection and segmentation of Handwritten Text in Gurmukhi Script using Flexible Windowing , 2010 .

[6]  Eric Lecolinet,et al.  A Survey of Methods and Strategies in Character Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Marco Furini,et al.  International Journal of Computer and Applications , 2010 .

[8]  Chandan Singh,et al.  A Gurmukhi script recognition system , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[9]  Rajendra Kumar Sharma,et al.  Segmentation Problems and Solutions in Printed Degraded Gurmukhi Script , 2006 .

[10]  Chandan Singh,et al.  Text segmentation of machine-printed Gurmukhi script , 2000, IS&T/SPIE Electronic Imaging.

[11]  Veena Bansal,et al.  Partitioning and searching dictionary for correction of optically read Devanagari character strings , 2002, International Journal on Document Analysis and Recognition.

[12]  Sameer Antani,et al.  Gujarati character recognition , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[13]  Bidyut Baran Chaudhuri,et al.  Segmentation of touching characters in printed Devnagari and Bangla scripts using fuzzy multifactorial analysis , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[14]  Chandra Shekhar Yadav,et al.  Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network , 2010 .