Statistical language modeling combining N-gram and context-free grammars

Linguistic structure in the form of a partial-coverage phrase structure grammar is combined with statistical N-gram techniques. The result is a robust statistical grammar which explicitly incorporates linguistic and semantic structure. This approach makes it possible to model carefully those parts of the input that are important for an application and to use robust techniques that provide a full-coverage statistical language model. This approach is being applied to the recognition of air-traffic-control transmissions, and it has already been shown that a simpler hybrid approach is useful.<<ETX>>

[1]  David D. McDonald An Efficient Chart-based Algorithm for Partial-Parsing of Unrestricted Texts , 1992, ANLP.

[2]  Ulf Grenander,et al.  Parameter Estimation for Constrained Context-Free Language Models , 1992, HLT.

[3]  Damaris M. Ayuso,et al.  Gisting conversational speech , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Stephanie Seneff A relaxation method for understanding spontaneous speech utterances , 1992 .

[5]  Pascale Fung,et al.  The estimation of powerful language models from small and large corpora , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.