GEMINI: A Natural Language System for Spoken-Language Understanding

The demands on a natural language understanding system used for spoken language differ somewhat from the demands of text processing. For processing spoken language, there is a tension between the system being as robust as necessary, and as constrained as possible. The robust system will a t tempt to find as sensible an interpretation as possible, even in the presence of performance errors by the speaker, or recognition errors by the speech recognizer. In contrast, in order to provide language constraints to a speech recognizer, a system should be able to detect that a recognized string is not a sentence of English, and disprefer that recognition hypothesis from the speech recognizer. If the coupling is to be tight, with parsing and recognition interleaved, then the parser should be able to enforce as many constraints as possible for partial utterances. The approach taken in Gemini is to tightly constrain language recognition to limit overgeneration, but to extend the language analysis to recognize certain characteristic patterns of spoken utterances (but not generally thought of as part of grammar) and to recognize specific types of performance errors by the speaker.

[1]  Julia Hirschberg,et al.  A Speech-First Model for Repair Detection and Correction , 1993, HLT.

[2]  M. Baltin,et al.  The Mental representation of grammatical relations , 1985 .

[3]  Robert C. Moore,et al.  Efficient Bottom-Up Parsing , 1991, HLT.

[4]  H. B. Allen,et al.  A Functional Grammar , 1946 .

[5]  Jaime G. Carbonell,et al.  Recovery Strategies for Parsing Extragrammatical Language , 1983, CL.

[6]  Jerry R. Hobbs,et al.  Two Principles of Parse Preference , 1990, COLING.

[7]  J. Kimball Seven principles of surface structure parsing in natural language , 1973 .

[8]  Janet D. Fodor,et al.  The sausage machine: A new two-stage parsing model , 1978, Cognition.

[9]  Chris Mellish Implementing Systemic Classification by Unification , 1988, Comput. Linguistics.

[10]  Stephanie Seneff A relaxation method for understanding spontaneous speech utterances , 1992 .

[11]  Walter L. Ruzzo,et al.  An Improved Context-Free Recognizer , 1980, ACM Trans. Program. Lang. Syst..

[12]  Geoffrey K. Pullum,et al.  Generalized Phrase Structure Grammar , 1985 .

[13]  David Stallard,et al.  Fragment Processing in the DELPHI System , 1992, HLT.

[14]  Lynette Hirschman,et al.  Multi-Site Data Collection for a Spoken Language Corpus , 1992, HLT.

[15]  H. Alshawi,et al.  The Core Language Engine , 1994 .

[16]  Fernando C. N. Pereira Natural language parsing: A new characterization of attachment preferences , 1985 .

[17]  Douglas B. Moran,et al.  Quantifier Scoping in the SRI Core Language Engine , 1988, ACL.

[18]  Douglas E. Appelt,et al.  Robust Processing of Real-World Natural-Language Texts , 1992, ANLP.

[19]  Ivan A. Sag,et al.  Information-based syntax and semantics , 1987 .

[20]  Douglas D. O'Shaughnessy Analysis of false starts in spontaneous speech , 1992, ICSLP.

[21]  Mitchell P. Marcus,et al.  A theory of syntactic recognition for natural language , 1979 .

[22]  John Bear,et al.  Integrating Multiple Knowledge Sources for Detection and Correction of Repairs in Human-Computer Dialog , 1992, ACL.

[23]  Hans Uszkoreit Categorial Unification Grammars , 1986, COLING.

[24]  Fernando Pereira,et al.  The Formalism and Implementation of PATR-II , 1983 .

[25]  Stuart M. Shieber,et al.  Sentence Disambiguation by a Shift-Reduce Parsing Technique , 1983, ACL.