We have taken an off-the-shelf, commercial continuous speech recogniser and conducted evaluations for the domain of Air Traffic Control. The language of this domain proved to be quite unrestricted, contrary to our initial intuitions. Our experiments show that constraints typically used by speech recognisers do not provide accurate enough results and need to be augmented with other knowledge sources and higher levels of linguistics in order to prove useful. We used three syntaxes based on a corpus of transmissions between the ATC and pilots in order to reflect differing levels of "linguistic" knowledge. Initial experiments demonstrate the benefit of a fully constrained context-free semantic grammar. Further experiments empirically show the benefit to recognition accuracy of using some form of dialogue management system to control the flow of discourse. A corpus-based statistical clustering approach to the segmentation of a dialogue into discourse segments is briefly discussed.
[1]
Eric Atwell,et al.
The Automated Evaluation of Inferred Word Classifications
,
1994,
ECAI.
[2]
J. H. Ward.
Hierarchical Grouping to Optimize an Objective Function
,
1963
.
[3]
Candace L. Sidner,et al.
Attention, Intentions, and the Structure of Discourse
,
1986,
CL.
[4]
Russell James Collingham,et al.
Towards an automatic speech recognition system for use by deaf students in lectures
,
1994
.
[5]
Clive Souter,et al.
Developing a Corpus -Based Grammar Model Within a Continuous Commercial Speech Recognition Package
,
1995
.
[6]
John Hughes,et al.
Automatically Acquiring a Classification of Words
,
1994
.
[7]
Eric Atwell,et al.
Pattern Recognition Applied to the Acquisition of a Grammatical Classification System From Unrestricted English Text
,
1987,
EACL.