Confidence-Based Discriminative Training for Model Adaptation in Offline Arabic Handwriting Recognition

We present a novel confidence-based discriminative training for model adaptation approach for an HMM based Arabic handwriting recognition system to handle different handwriting styles and their variations.Most current approaches are maximum-likelihood trained HMM systems and try to adapt their models to different writing styles using writer adaptive training, unsupervised clustering, or additional writer specific data.Discriminative training based on the Maximum Mutual Information criterion is used to train writer independent handwriting models. For model adaptation during decoding, an unsupervised confidence-based discriminative training on a word and frame level within a two-pass decoding process is proposed. Additionally, the training criterion is extended to incorporate a margin term.The proposed methods are evaluated on the IFN/ENIT Arabic handwriting database, where the proposed novel adaptation approach can decrease the word-error-rate by 33% relative.

[1]  Thomas Schaaf,et al.  Estimating confidence using word lattices , 1997, EUROSPEECH.

[2]  Georg Heigold,et al.  On the equivalence of Gaussian HMM and Gaussian HMM-like hidden conditional random fields , 2007, INTERSPEECH.

[3]  Alex Pentland,et al.  Discriminative, generative and imitative learning , 2002 .

[4]  Daniel Povey,et al.  Minimum Phone Error and I-smoothing for improved discriminative training , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Geoffrey Zweig,et al.  LATTICE-BASED UNSUPERVISED MLLR FOR SPEAKER ADAPTATION , 2000 .

[6]  Hermann Ney,et al.  Improved MLLR speaker adaptation using confidence measures for conversational speech recognition , 2000, INTERSPEECH.

[7]  Tasos Anastasakos,et al.  The use of confidence measures in unsupervised adaptation of speech recognizers , 1998, ICSLP.

[8]  Hermann Ney,et al.  Writer Adaptive Training and Writing Variant Model Refinement for Offline Arabic Handwriting Recognition , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[9]  Gernot A. Fink,et al.  Unsupervised Estimation of Writing Style Models for Improved Unconstrained Off-line Handwriting Recognition , 2006 .

[10]  Georg Heigold,et al.  Modified MMI/MPE: a direct evaluation of the margin in speech recognition , 2008, ICML '08.

[11]  Byoung-Tak Zhang,et al.  Self-Organizing Latent Lattice Models for Temporal Gene Expression Profiling , 2003, Machine Learning.

[12]  M Volker,et al.  ICDAR 2007 - Arabic Handwriting Recognition Competition , 2007 .

[13]  Yiming Yang,et al.  Modified Logistic Regression: An Approximation to SVM and Its Applications in Large-Scale Text Categorization , 2003, ICML.

[14]  GovindarajuVenu,et al.  Offline Arabic Handwriting Recognition , 2006 .

[15]  Volker Märgner,et al.  Arabic Handwriting Recognition Competition , 2005, ICDAR.

[16]  Michiel Bacchiani,et al.  Confidence scores for acoustic model adaptation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[17]  Daniel Povey,et al.  Discriminative training for HMM-based offline handwritten character recognition , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[18]  M. Pechwitz,et al.  IFN/ENIT: database of handwritten arabic words , 2002 .

[19]  Hermann Ney,et al.  Improved Modeling in Handwriting Recognition , 2009 .

[20]  Hermann Ney,et al.  White-space models for offline Arabic handwriting recognition , 2008, 2008 19th International Conference on Pattern Recognition.

[21]  Horst Bunke,et al.  Hidden Markov model-based ensemble methods for offline handwritten text line recognition , 2008, Pattern Recognit..