论文信息 - Trigger-Based Language Modeling using a Loss-Sensitive Perceptron Algorithm

Trigger-Based Language Modeling using a Loss-Sensitive Perceptron Algorithm

Discriminative language models using n-gram features have been shown to be effective in reducing speech recognition word error rates. In this paper we describe a method for incorporating discourse-level triggers into a discriminative language model. Triggers are features identifying re-occurrence of words within a conversation. We introduce triggers that are specific to particular unigrams and bigrams, as well as "back off" trigger features that allow generalizations to be made across different unigrams. We train our model using a new loss-sensitive variant of the perceptron algorithm that makes effective use of information from multiple hypotheses in an n-best list. We train and test on the switchboard data set and show a 0.5 absolute reduction in WER over a baseline discriminative model which uses n-gram features alone, and a 1.5 absolute reduction in WER over the baseline recognizer.

Michael Collins | Natasha Singh-Miller

[1] Brian Roark,et al. Discriminative Language Modeling with Conditional Random Fields and the Perceptron Algorithm , 2004, ACL.

[2] Ben Taskar,et al. Max-Margin Markov Networks , 2003, NIPS.

[3] Michael McGill,et al. Introduction to Modern Information Retrieval , 1983 .

[4] Koby Crammer,et al. Ultraconservative Online Algorithms for Multiclass Problems , 2001, J. Mach. Learn. Res..

[5] Ronald Rosenfeld,et al. Trigger-based language models: a maximum entropy approach , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6] Brian Roark,et al. Discriminative Syntactic Language Modeling for Speech Recognition , 2005, ACL.

[7] Aravind K. Joshi,et al. Ranking and Reranking with Perceptron , 2005, Machine Learning.

[8] Ronald Rosenfeld,et al. Adaptive Statistical Language Modeling; A Maximum Entropy Approach , 1994 .