Handwritten Hindi Text Segmentation Techniques for Lines and Characters

This paper mainly deals with the new methods for line segmentation and character segmentation of overlapping characters of Handwritten Hindi text. The text is segmented into lines, lines into words and then from lines words header lines are detected and converted as straight lines. Each word is divided into three parts upper modifier, consonant and lower, so that character segmentation becomes easy. Algorithm is finding the header lines and base lines by estimating the average line height and based on it. This algorithm works efficiently on overlapped characters for different text sizes and different resolutions images.

[1]  David Doermann,et al.  A New Algorithm for Detecting Text Line in Handwritten Documents , 2006 .

[2]  Seong-Whan Lee,et al.  A New Methodology for Gray-Scale Character Segmentation and Recognition , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Murray J. J. Holt,et al.  Line extraction and stroke ordering of text pages , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[4]  G. Shobha,et al.  Character segmentation algorithms for Kannada optical character recognition , 2008, 2008 International Conference on Wavelet Analysis and Pattern Recognition.

[5]  Bidyut Baran Chaudhuri,et al.  2009 10th International Conference on Document Analysis and Recognition Handwritten Text Line Identification In Indian Scripts , 2022 .

[6]  Malayappan Shridhar,et al.  A Segmentation Based Approach to Offline Handwritten Devanagari Word Recognition , 2008, 2008 International Conference on Information Technology.

[7]  Laurence Likforman-Sulem,et al.  Overlapping and multi-touching text-line segmentation by Block Covering analysis , 2008, Pattern Analysis and Applications.

[8]  Naresh Kumar Garg,et al.  Segmentation of Handwritten Hindi Text , 2010 .

[9]  Chandra Shekhar Yadav,et al.  Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network , 2010 .

[10]  Subhadip Basu,et al.  A Hough Transform based Technique for Text Segmentation , 2010, ArXiv.

[11]  Seong-Whan Lee,et al.  A new methodology for gray-scale character segmentation and recognition , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[12]  C. Weliwitage,et al.  Handwritten Document Offline Text Line Segmentation , 2005, Digital Image Computing: Techniques and Applications (DICTA'05).

[13]  Supriya Deshmukh,et al.  "Analysis of Directional Features - Stroke and Contour for Handwritten Character Recognition" , 2009, 2009 IEEE International Advance Computing Conference.

[14]  Anju Vyas Print , 2003 .

[15]  Naresh Kumar Garg,et al.  A New Method for Line Segmentation of Handwritten Hindi Text , 2010, 2010 Seventh International Conference on Information Technology: New Generations.