Segmentation of Bengali Handwritten Conjunct Characters Through Structural Disintegration

Substantial size of convoluted conjunct characters in Bengali language makes the recognition process burdensome. In this paper, we propose a structural disintegration based segmentation technique that fragments the conjunct characters into discernible shapes for better recognition accuracy. We use a set of structure based segmentation rules that bifurcates the characters into discernible shape components. The bifurcation is done by finding the touching region where two basic shapes coincide to form a conjunct character. The proposed method has been tested on a data set of Bengali handwritten conjunct characters efficiently. In future, we will continue our work to incorporate it as a prominent preprocessing step for Bengali optical character recognition system.

[1]  Brijesh Verma,et al.  Binary segmentation algorithm for English cursive handwriting recognition , 2012, Pattern Recognit..

[2]  Subhadip Basu,et al.  A Novel GA-SVM Based Multistage Approach for Recognition of Handwritten Bangla Compound Characters , 2012 .

[3]  Jun Tan,et al.  A new handwritten character segmentation method based on nonlinear clustering , 2012, Neurocomputing.

[4]  Reza Azmi,et al.  A New Method to Improve Multi Font Farsi/Arabic Character Segmentation Results: Using Extra Classes of Some Character Combinations , 2007, MMM.

[5]  Tetsushi Wakabayashi,et al.  Handwritten Bangla Compound Character Recognition Using Gradient Feature , 2007 .

[6]  Ching Y. Suen,et al.  A fast parallel algorithm for thinning digital patterns , 1984, CACM.

[7]  Partha Bhowmick,et al.  Character Segmentation of Handwritten Bangla Text by Vertex Characterization of Isothetic Covers , 2011, 2011 Third National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics.

[8]  Gaurav Harit,et al.  Skeletonizing Character Images Using a Modified Medial Axis-Based Strategy , 2011, Int. J. Pattern Recognit. Artif. Intell..

[9]  Munish Kumar,et al.  Segmentation of Isolated and Touching Characters in Offline Handwritten Gurmukhi Script Recognition , 2014 .

[10]  Subhadip Basu,et al.  Handwritten Bangla Basic and Compound character recognition using MLP and SVM classifier , 2010, ArXiv.

[11]  Venu Govindaraju,et al.  Segmentation of Arabic Handwriting Based on both Contour and Skeleton Segmentation , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[12]  Subhadip Basu,et al.  Handwritten Bangla Compound character recognition: Potential challenges and probable solution , 2009, IICAI.

[13]  Dzulkifli Mohamad,et al.  A simple segmentation approach for unconstrained cursive handwritten words in conjunction with neural network , 2008 .

[14]  Subhadip Basu,et al.  A benchmark image database of isolated Bangla handwritten compound characters , 2014, International Journal on Document Analysis and Recognition (IJDAR).

[15]  N. Otsu A threshold selection method from gray level histograms , 1979 .