Word separation in handwritten legal amounts on bank cheques based on spatial gap distances

This paper presents an efficient method of separating words in handwritten legal amounts on bank cheques based on the spatial gaps between connected components. Currently all typical existing gap measures suffer from poor performance due to the inherent problem of underestimation and overestimation. In order to decrease such burden, a modified version for each of those existing measures is explored. Also, a new method of combining three different types of distance measures based on 4-class clustering is proposed to reduce the errors generated by each measure. In experiments on real bank cheque database, the modified distance measures show about 3% of better separation rate than their original counterparts. In addition, by applying the combining method, further improvement in word separation was achieved.

[1]  Ching Y. Suen,et al.  Recognition of legal amounts on bank cheques , 1998, Pattern Analysis and Applications.

[2]  Ching Y. Suen,et al.  Legal amount recognition based on the segmentation hypotheses for bank check processing , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[3]  Eberhard Mandler,et al.  Document analysis-from pixels to contents , 1992 .

[4]  Giovanni Seni,et al.  External word segmentation of off-line handwritten text lines , 1994, Pattern Recognit..

[5]  Horst Bunke,et al.  Automated Reading of Cheque Amounts , 2000, Pattern Analysis & Applications.

[6]  Uma Mahadevan,et al.  Gap metrics for word separation in handwritten lines , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[7]  Jun Zhou,et al.  A feedback-based approach for segmenting handwritten legal amounts on bank cheques , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[8]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..