Modeling segmentation cuts using support vector machines

In this paper we propose a method to evaluate segmentation cuts for handwritten touching digits. The idea of this method is to work as a filter in segmentation-based recognition systems. These types of systems usually rely on over-segmentation methods, where several segmentation hypotheses are created for each touching group of digits and then assessed by a general-purpose classifier. Through the use of the proposed method, unnecessary segmentation cuts can be identified without any attempt of classification by a general-purpose classifier, reducing the number of paths in a segmentation graph, what can consequently lead to a reduction in computational cost. Concavity analysis is performed in each digit before and after segmentation. The difference of those concavities is used to model the segmentation cuts. SVM is used to classify those segmentation cuts. The preliminary results obtained are very promising as for the segmentation algorithm tested, 67.9% of the unnecessary segmentation cuts were eliminated. Moreover, it was possible to achieve a significant increase in the recognition rate for the generalpurpose classifier.

[1]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[2]  Umapada Pal,et al.  Touching numeral segmentation using water reservoir concept , 2003, Pattern Recognit. Lett..

[3]  Alexander J. Smola,et al.  Advances in Large Margin Classifiers , 2000 .

[4]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .

[5]  Ching Y. Suen,et al.  Automatic segmentation of unconstrained handwritten numeral strings , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.

[6]  Jhing-Fa Wang,et al.  Segmentation of Single- or Multiple-Touching Handwritten Numeral String Using Background and Foreground Analysis , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[8]  Venu Govindaraju,et al.  Holistic recognition of touching digits , 1998 .

[9]  Luiz Eduardo Soares de Oliveira,et al.  Automatic Recognition of Handwritten Numerical Strings: A Recognition and Verification Strategy , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Yun Lei,et al.  A recognition based system for segmentation of touching handwritten numeral strings , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.

[11]  Eric Lecolinet,et al.  A Survey of Methods and Strategies in Character Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Yasuaki Nakano,et al.  Segmentation methods for character recognition: from segmentation to document structure analysis , 1992, Proc. IEEE.

[13]  Luiz Eduardo Soares de Oliveira,et al.  A synthetic database to assess segmentation algorithms , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).