Off-line Cursive Handwriting Recognition-On the Influence of Training Set and Vocabulary Size in Multiple Classifier Systems

Handwritten text recognition is one of the most difficult problems in the field of pattern recognition. Recently, a number of classifier creation and combination methods, known as ensemble methods, have been proposed in the field of machine learning. In this paper we examine the influence of the vocabulary size and the number of training samples on the performance of three ensemble methods in the context of cursive handwriting recognition. All experiments were conducted using an off-line HMM-based handwritten word recognizer.

[1]  Sargur N. Srihari Handwritten Address Interpretation: A Task of Many Pattern Recognition Problems , 2000, Int. J. Pattern Recognit. Artif. Intell..

[2]  Ching Y. Suen,et al.  Multiple Classifier Combination Methodologies for Different Output Levels , 2000, Multiple Classifier Systems.

[3]  H.M.R. Abou-zeid,et al.  Computer recognition of unconstrained handwritten numerals , 2003, 2003 46th Midwest Symposium on Circuits and Systems.

[4]  Horst Bunke,et al.  Automatic bankcheck processing , 1997 .

[5]  Robert P. W. Duin,et al.  Bagging and the Random Subspace Method for Redundant Feature Spaces , 2001, Multiple Classifier Systems.

[6]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[7]  Gilles F. Houle,et al.  A Multi-Layered Corroboration-Based Check Reader , 1996, DAS.

[8]  Makoto Kobayashi,et al.  Off-line character recognition using HMM by multiple directional feature extraction and voting with bagging algorithm , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[9]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[10]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[11]  Robert P. W. Duin,et al.  Bagging and Boosting for the Nearest Mean Classifier: Effects of Sample Size on Diversity and Accuracy , 2002, Multiple Classifier Systems.

[12]  William B. Yates,et al.  Engineering Multiversion Neural-Net Systems , 1996, Neural Computation.

[13]  Horst Bunke,et al.  Using a Statistical Language Model to Improve the Performance of an HMM-Based Cursive Handwriting Recognition System , 2001, Int. J. Pattern Recognit. Artif. Intell..