Writer Adaptive Training and Writing Variant Model Refinement for Offline Arabic Handwriting Recognition

We present a writer adaptive training and writer clustering approach for an HMM based Arabic handwriting recognition system to handle different handwriting styles and their variations. Additionally, a writing variant model refinement for specific writing variants is proposed.Current approaches try to compensate the impact of different writing styles during preprocessing and normalization steps.Writer adaptive training with a CMLLR based feature adaptation is used to train writer dependent models. An unsupervised writer clustering with Bayesian information criterion based stopping condition for a CMLLR based feature adaptation during a two-pass decoding process is used to cluster different handwriting styles of unknown test writers.The proposed methods are evaluated on the IFN/ENIT Arabic handwriting database.

[1]  Vuokko Vuori,et al.  Clustering writing styles with a self-organizing map , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[2]  Stefan Jäger,et al.  Arabic and Chinese Handwriting Recognition - SACH 2006 Summit College Park, MD, USA, September 27-28, 2006 Selected Papers , 2008, SACH.

[3]  M. Pechwitz,et al.  IFN/ENIT: database of handwritten arabic words , 2002 .

[4]  Gernot A. Fink,et al.  Unsupervised Estimation of Writing Style Models for Improved Unconstrained Off-line Handwriting Recognition , 2006 .

[5]  Alexander H. Waibel,et al.  Dictionary learning for spontaneous speech recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[6]  Hermann Ney,et al.  Improved Modeling in Handwriting Recognition , 2009 .

[7]  Thierry Paquet,et al.  Handwritten Document Analysis for Automatic Writer Recognition , 2005 .

[8]  Hermann Ney,et al.  White-space models for offline Arabic handwriting recognition , 2008, 2008 19th International Conference on Pattern Recognition.

[9]  Marcus Liwicki,et al.  Recognition of handwritten historical documents: HMM-adaptation vs. writer specific training , 2008 .

[10]  S. Chen,et al.  Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion , 1998 .

[11]  Horst Bunke,et al.  Hidden Markov model-based ensemble methods for offline handwritten text line recognition , 2008, Pattern Recognit..

[12]  M Volker,et al.  ICDAR 2007 - Arabic Handwriting Recognition Competition , 2007 .

[13]  Volker Märgner,et al.  Arabic Handwriting Recognition Competition , 2005, ICDAR.

[14]  Samy Bengio,et al.  Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition , 2002, Pattern Recognit. Lett..

[15]  Mark J. F. Gales,et al.  Maximum likelihood linear transformations for HMM-based speech recognition , 1998, Comput. Speech Lang..