This paper outlines the general strategies followed in developing the CMU (Carnegie Mellon University) speech understanding system. Our system is oriented toward the extraction of information relevant to a task. It uses a exible frame-based parser. Our system handles phenomena that are natural in spontaneous speech, for example, restarts, repeats and grammatically ill-formed utterances. It maintains a history of the key features of the dialogue. It can resolve elliptical, anaphoric and other indirect references. In this paper, we pay particular attention to how the context is modeled in our system. We will describe how the system handles corrections and queries that execeed its capabilities. We also address the issue of loose vs. tight coupling of speech recognition and natural language processing. The system has been used to model an Air Travel Information Service (ATIS) Task. In the November 92 DARPA Spoken Language Systems benchmark evaluation, the CMU ATIS system correctly answered 93.5% transcript inputs and 88.9% speech inputs. These were the best numbers reported for the evaluation.
[1]
David Stallard,et al.
The Semantic Linker- A New Fragment Combining Method
,
1993,
HLT.
[2]
Douglas E. Appelt,et al.
GEMINI: A Natural Language System for Spoken-Language Understanding
,
1993,
ACL.
[3]
Wayne H. Ward.
Understanding spontaneous speech: the Phoenix system
,
1991,
[Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.
[4]
Victor Zue,et al.
Interactive Problem Solving and Dialogue in the ATIS Domain
,
1991,
HLT.
[5]
Mei-Yuh Hwang,et al.
Speech Understanding in Open Tasks
,
1992,
HLT.
[6]
Stephanie Seneff.
A relaxation method for understanding spontaneous speech utterances
,
1992
.
[7]
Wayne H. Ward,et al.
The CMU Air Travel Information Service: Understanding Spontaneous Speech
,
1990,
HLT.
[8]
Lewis M. Norton,et al.
A Portable Approach To Last Resort Parsing And Interpretation
,
1993,
HLT.