论文信息 - Improving Language Models by Learning from Speech Recognition Errors in a Reading Tutor that Listens

Improving Language Models by Learning from Speech Recognition Errors in a Reading Tutor that Listens

Lowering the perplexity of a language model does not always translate into higher speech recognition accuracy. Our goal is to improve language models by learning from speech recognition errors. In this paper we present an algorithm that first learns to predict which n–grams are likely to increase recognition errors, and then uses that prediction to improve language models so that the errors are reduced. We show that our algorithm reduces a measure of tracking error by more than 24% on unseen test data from a Reading Tutor that listens to children read aloud.

J. Beck | Jack Mostow | S. Banerjee | Wilson Tam

[1] Jack Mostow,et al. A Prototype Reading Coach that Listens , 1994, AAAI.

[2] Ronald Rosenfeld,et al. A maximum entropy approach to adaptive statistical language modelling , 1996, Comput. Speech Lang..

[3] Mingjing Li,et al. Discriminative training on language model , 2000, INTERSPEECH.

[4] Rong Zhang,et al. Word level confidence annotation using combinations of features , 2001, INTERSPEECH.

[5] Ian H. Witten,et al. Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[6] Jack Mostow,et al. Predicting oral reading miscues , 2002, INTERSPEECH.

[7] Chin-Hui Lee,et al. Discriminative training of language models for speech recognition , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8] Satanjeev Banerjee,et al. Training a confidence measure for a reading tutor that listens , 2003, INTERSPEECH.

[9] Satanjeev Banerjee,et al. Evaluating the effect of predicting oral reading miscues , 2003, INTERSPEECH.