Character Segmentation in Gurumukhi Handwritten Textusing Hybrid Approach

the researchers to think about the optical character recognition (OCR). OCR is the process of recognizing a segmented part of the scanned image as a character. OCR process consists of three major sub processes-pre processing, segmentation and then recognition. Out of these three, the segmentation process is the most important phase of the overall OCR process. It is the most significant process because if the output of segmentation phase is incorrect then we can not expect the correct results; it is just like garbage in and garbage out. But on the same time, segmentation is complex too. If the document is handwritten then the situation becomes more cumbersome, because in that case only few points are there which can be used to make segmentation. In this paper, we formulate an algorithm to segment the scanned document image as a character. As per our earlier published work, the information about the lines and words within each line is written in a data file. According to proposed algorithm, one part is extracted from the word present in the line. This extracted part is checked whether it has some meaningful symbol (as per Gurumukhi script). If it has then the extracted part is marked and written in the file, otherwise the extracted part is readjusted to find the symbol. For classification, we have used hybrid approach which consists of water reservoir and feature extraction approach. This concept was implemented and got good reasonable results.

[1]  K. Sharma Rajiv,et al.  Challenges in Segmentation of Text in Handwritten Gurmukhi Script , 2010, BAIP.

[2]  Amardeep Singh,et al.  Detection and segmentation of Handwritten Text in Gurmukhi Script using Flexible Windowing , 2010 .

[3]  Veena Bansal,et al.  Segmentation of touching and fused Devanagari characters , 2002, Pattern Recognit..

[4]  U. Pal,et al.  Segmentation of Bangla unconstrained handwritten text , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[5]  Amardeep Singh,et al.  A Hybrid Approach to Classify Gurmukhi Script Characters , 2010 .

[6]  Eric Lecolinet,et al.  A Survey of Methods and Strategies in Character Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Yasuaki Nakano,et al.  Segmentation methods for character recognition: from segmentation to document structure analysis , 1992, Proc. IEEE.

[8]  Majid Ahmadi,et al.  Segmentation of touching characters in printed document recognition , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[9]  Giovanni Seni,et al.  External word segmentation of off-line handwritten text lines , 1994, Pattern Recognit..

[10]  Rajendra Kumar Sharma,et al.  Segmentation Problems and Solutions in Printed Degraded Gurmukhi Script , 2006 .

[11]  Hardeep Singh,et al.  A hybrid approach to character segmentation of Gurmukhi script characters , 2003, 32nd Applied Imagery Pattern Recognition Workshop, 2003. Proceedings..

[12]  Chandan Singh,et al.  Text segmentation of machine-printed Gurmukhi script , 2000, IS&T/SPIE Electronic Imaging.