This paper concerns the handling of out-of-vocabulary (OOV) words in the JUPITER weather information system. Specifically our objective is to deal with weather queries regarding unknown cities. We have implemented a system which can detect the presence of an unknown city name, and immediately propose a plausible spelling for that city. Potentially, the city can be dynamically incorporated into the recognizer lexicon. The three-stage system described in [1] was implemented in the JUPITER domain, and this paper will detail the development of a system that uses an ANGIE-based framework to model both spelling and pronunciation simultaneously, and uses automatically derived novel lexical units in the first stage. We report results on an independent test set containing unknown cities. Compared with a single-stage baseline, word error was reduced by 29.3% (from 24.6% to 17.4%) and understanding error was reduced by 67.5% (from 67.0% to 21.8%) on the three-stage configuration.
[1]
Stephanie Seneff,et al.
Towards multi-domain speech understanding using a two-stage recognizer
,
1999,
EUROSPEECH.
[2]
Victor Zue,et al.
JUPlTER: a telephone-based conversational interface for weather information
,
2000,
IEEE Trans. Speech Audio Process..
[3]
Stephanie Seneff,et al.
ANGIE: a new framework for speech analysis based on morpho-phonological modelling
,
1996,
Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[4]
Stephanie Seneff,et al.
TINA: A Natural Language System for Spoken Language Applications
,
1992,
Comput. Linguistics.
[5]
Grace Chung.
A three-stage solution for flexible vocabulary speech understanding
,
2000,
INTERSPEECH.