Handwritten digit segmentation: a comparative study

In this work, algorithms for segmenting handwritten digits based on different concepts are compared by evaluating them under the same conditions of implementation. A robust experimental protocol based on a large synthetic database is used to assess each algorithm in terms of correct segmentation and computational time. Results on a real database are also presented. In addition to the overall performance of each algorithm, we show the performance for different types of connections, which provides an interesting categorization of each algorithm. Another contribution of this work concerns the complementarity of the algorithms. We have observed that each method is able to segment samples that cannot be segmented by any other method, and do so independently of their individual performance. Based on this observation, we conclude that combining different segmentation algorithms may be an appropriate strategy for improving the correct segmentation rate.

[1]  Umapada Pal,et al.  Touching numeral segmentation using water reservoir concept , 2003, Pattern Recognit. Lett..

[2]  Ching Y. Suen,et al.  Segmentation-based recognition of handwritten touching pairs of digits using structural features , 2002, Pattern Recognit. Lett..

[3]  Mohamed Cheriet,et al.  Background region-based algorithm for the segmentation of connected digits , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[4]  Edouard Lethelier,et al.  An automatic reading system for handwritten numeral amounts on French checks , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[5]  Luiz Eduardo Soares de Oliveira,et al.  A synthetic database to assess segmentation algorithms , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[6]  Luiz Eduardo Soares de Oliveira,et al.  Filtering segmentation cuts for digit string recognition , 2008, Pattern Recognit..

[7]  Eric Lecolinet,et al.  A Survey of Methods and Strategies in Character Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Venu Govindaraju,et al.  Segmentation and recognition of connected handwritten numeral strings , 1997, Pattern Recognit..

[9]  Luiz Eduardo Soares de Oliveira,et al.  Automatic Recognition of Handwritten Numerical Strings: A Recognition and Verification Strategy , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  John Illingworth,et al.  The recognition of handwritten digit strings of unknown length using hidden Markov models , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[11]  Yun Lei,et al.  A recognition based system for segmentation of touching handwritten numeral strings , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.

[12]  Ching Y. Suen,et al.  A genetic framework using contextual knowledge for segmentation and recognition of handwritten numeral strings , 2007, Pattern Recognit..

[13]  Ashraf Elnagar,et al.  Segmentation of connected handwritten numeral strings , 2003, Pattern Recognit..

[14]  Yasuaki Nakano,et al.  Segmentation methods for character recognition: from segmentation to document structure analysis , 1992, Proc. IEEE.

[15]  Il-Seok Oh,et al.  A segmentation-free recognition of two touching numerals using neural network , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[16]  Nicholas W Strathy A method for segmentation of touching handwritten numerals , 1993 .

[17]  Flávio Bortolozzi,et al.  The recognition of handwritten numeral strings using a two-stage HMM-based method , 2003, International Journal on Document Analysis and Recognition.

[18]  Jhing-Fa Wang,et al.  Segmentation of Single- or Multiple-Touching Handwritten Numeral String Using Background and Foreground Analysis , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Hong Yan,et al.  Separation of touching handwritten multi-numeral strings based on morphological structural features , 2001, Pattern Recognit..

[20]  Pengfei Shi,et al.  A background-thinning-based approach for separating and recognizing connected handwritten digit strings , 1999, Pattern Recognit..

[21]  Satoshi Naoi,et al.  Segmentation of handwritten numerals by graph representation , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.