论文信息 - An integrated grammar/bigram language model using path scores

An integrated grammar/bigram language model using path scores

This paper describes a language model in which context-free grammar rules are integrated into an n-gram framework, complementing it instead of attempting to replace it. This releases the grammar from the aim of parsing sentences overall (which is often undesirable as well as unrealistic), enabling it to be employed selectively in modelling phrases that are identifiable within a flow of speech. Algorithms for model training and for sentence scoring and interpretation are described. All are based on the principle of summing over paths that span the sentence, but implementation is node-based for efficiency. Perplexity results for this system (using a hierarchy of grammars from empty to full-coverage) are compared with those for n-gram models, and the system is used for re-scoring N-best sentence lists for a speaker-independent recogniser.

Gareth J. F. Jones | Jeremy H. Wright | Harvey Lloyd-Thomas

[1] Jan Robin Rohlicek,et al. Statistical language modeling combining N-gram and context-free grammars , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2] Gareth J. F. Jones,et al. A consolidated language model for speech recognition , 1993, EUROSPEECH.

[3] S. M. Peeling,et al. The ARM continuous speech recognition system , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[4] Hermann Ney,et al. On structuring probabilistic dependences in stochastic language modelling , 1994, Comput. Speech Lang..

[5] Gareth J. F. Jones,et al. The HMM interface with hybrid grammar-bigram language models for speech recognition , 1992, ICSLP.

[6] Mari Ostendorf,et al. Language Modeling with Sentence-Level Mixtures , 1994, HLT.

[7] Jhg Wright,et al. Language model training and robust parsing for speech recognition , 1994 .

[8] Victor Zue,et al. Language modelling for recognition and understanding using layered bigrams , 1992, ICSLP.

[9] Gareth J. F. Jones,et al. Training and Application of Integrated Grammar/Bigram Language Models , 1994, ICGI.