Multi Domain Language Model Adaptation using Explicit Semantic Analysis

This paper presents an adaptive multi domain language model built from large sources of pre existing human created structured data. The sources’ structure is exploited to create a large array of ngram language models which are dynamically interpolated at decoding time to produce a context dependent language model that continuously adapts itself to the current domain. Because the use of human annotators is expensive and impractical we explore existing sources of human created structured data and how to extract our desired data from them. The language model is evaluated on its performance with a speech recognition system used to decode the Quaero 2009 evaluation data set. Compared to the baseline language model of our Quaero 2009 evaluation system our proposed adaptive language model reduces the WER of the speech recognition system by 0.5% absolute with some shows showing reductions of up to 14.4%. Index Terms: Speech Recognition, Language Model Adaptation, Explicit Semantic Analysis

[1]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[2]  A. Waibel,et al.  A one-pass decoder based on polymorphic linguistic context assignment , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[3]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[4]  Ronald Rosenfeld,et al.  Trigger-based language models: a maximum entropy approach , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  M. Wolfel,et al.  Minimum variance distortionless response spectral estimation , 2005, IEEE Signal Processing Magazine.

[6]  Jeff A. Bilmes,et al.  Factored Language Models and Generalized Parallel Backoff , 2003, NAACL.

[7]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[8]  Reinhard Kneser,et al.  On the dynamic adaptation of stochastic language models , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[10]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[11]  Tanja Schultz,et al.  Unsupervised language model adaptation using latent semantic marginals , 2006, INTERSPEECH.

[12]  Alistair Moffat,et al.  Exploring the similarity space , 1998, SIGF.

[13]  Anthony J. Robinson,et al.  Language model adaptation using mixtures and an exponentially decaying cache , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  Renato De Mori,et al.  A Cache-Based Natural Language Model for Speech Recognition , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Lori Lamel,et al.  The translanguage English database (TED) , 1994, ICSLP.

[16]  Mark J. F. Gales,et al.  Semi-tied covariance matrices for hidden Markov models , 1999, IEEE Trans. Speech Audio Process..

[17]  Hermann Ney,et al.  Cross domain automatic transcription on the TC-STAR EPPS corpus , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..