Handwritten Mail Classification Experiments with the Rimes Database

In this paper, we consider the task of automatic handwritten mail classification and we investigate the relation between the transcription rate and the classification rate. Several configurations of a multi-word handwriting recognizer using different language models are tested and their word recognition rates on the documents to be classified are reported. For the document classification task, we have investigated three different classifiers (KNN, SVM, AdaBoost). All the experiments were conducted on the public database Rimes.

[1]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[2]  Slava M. Katz,et al.  Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[3]  Christian Viard-Gaudin,et al.  Categorization of On-Line Handwritten Documents , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[4]  Emmanuel Augustin,et al.  RIMES evaluation campaign for handwritten mail processing , 2006 .

[5]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[6]  I. Song,et al.  Working Set Selection Using Second Order Information for Training Svm, " Complexity-reduced Scheme for Feature Extraction with Linear Discriminant Analysis , 2022 .

[7]  Alfons Juan-Císcar,et al.  Spontaneous handwriting recognition and classification , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[8]  Cyril Allauzen,et al.  Generalized optimization algorithm for speech recognition transducers , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[9]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[10]  Haikal El Abed,et al.  ICDAR 2009 Handwriting Recognition Competition , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[11]  Mounim A. El-Yacoubi,et al.  Conjoined location and recognition of street names within a postal address delivery line , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[12]  Christian Viard-Gaudin,et al.  Impact of online handwriting recognition performance on text categorization , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[13]  Chih-Jen Lin,et al.  Working Set Selection Using Second Order Information for Training Support Vector Machines , 2005, J. Mach. Learn. Res..

[14]  Emmanuel Augustin,et al.  Industrial bank check processing: the A2iA CheckReaderTM , 2001, International Journal on Document Analysis and Recognition.

[15]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..