Evaluation of Classical and Novel Ensemble Methods for Handwritten Word Recognition

Handwritten text recognition is one of the most difficult problems in the field of pattern recognition. Recently, a number of classifier creation and combination methods, known as ensemble methods, have been proposed in the field of machine learning. They have shown improved recognition performance over single classifiers. In this paper a number of ensemble methods for handwritten word recognition are described, experimentally evaluated, and compared to each other. Those methods include classical, general purpose ensemble methods as well as novel ensemble methods specifically developed by the authors for handwritten word recognition. The aim of the paper is to investigate the potential of ensemble methods for improving the performance of handwriting recognition systems. The base recognition systems used in this paper are hidden Markov model classifiers.

[1]  Horst Bunke,et al.  Using a Statistical Language Model to Improve the Performance of an HMM-Based Cursive Handwriting Recognition System , 2001, Int. J. Pattern Recognit. Artif. Intell..

[2]  Horst Bunke,et al.  New Boosting Algorithms for Classification Problems with Large Number of Classes Applied to a Handwritten Word Recognition Task , 2003, Multiple Classifier Systems.

[3]  Gyeonghwan Kim,et al.  An architecture for handwritten text recognition systems , 1999, International Journal on Document Analysis and Recognition.

[4]  Ching Y. Suen,et al.  Computer recognition of unconstrained handwritten numerals , 1992, Proc. IEEE.

[5]  Horst Bunke,et al.  Hidden Markov model length optimization for handwriting recognition systems , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[6]  Horst Bunke,et al.  Creation of classifier ensembles for handwritten word recognition using feature selection algorithms , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[7]  Horst Bunke,et al.  Generating Classifier Ensembles from Multiple Prototypes and Its Application to Handwriting Recognition , 2002, Multiple Classifier Systems.

[8]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[10]  J.-C. Simon,et al.  Off-line cursive word recognition , 1992, Proc. IEEE.

[11]  Horst Bunke,et al.  Automatic bankcheck processing , 1997 .

[12]  Horst Bunke,et al.  Ensembles of classifiers for handwritten word recognition , 2003, Document Analysis and Recognition.

[13]  Torsten Caesar,et al.  Sophisticated topology of hidden Markov models for cursive script recognition , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[14]  Horst Bunke,et al.  A new combination scheme for HMM-based classifiers and its application to handwriting recognition , 2002, Object recognition supported by user interaction for service robots.

[15]  Ching Y. Suen,et al.  Combination of multiple classifier decisions for optical character recognition , 1997 .

[16]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[17]  Seong-Whan Lee,et al.  Advances in Handwriting Recognition , 1999, Series in Machine Perception and Artificial Intelligence.

[18]  Adam Krzyżak,et al.  Methods of combining multiple classifiers and their applications to handwriting recognition , 1992, IEEE Trans. Syst. Man Cybern..

[19]  William B. Yates,et al.  Engineering Multiversion Neural-Net Systems , 1996, Neural Computation.

[20]  Horst Bunke,et al.  Optimizing the number of states, training iterations and Gaussians in an HMM-based handwritten word recognizer , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[21]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[22]  Horst Bunke,et al.  Handbook of Character Recognition and Document Image Analysis , 1997 .

[23]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[24]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[25]  Horst Bunke,et al.  Automatic segmentation of the IAM off-line database for handwritten English text , 2002, Object recognition supported by user interaction for service robots.