Recognition Confidence Scoring for Use in Speech Understanding Systems

In this paper we present an approach to recognition confidenc e scoring and a method for integrating confidence scores into t he understanding and dialogue components of a speech understa nding system. The system uses a multi-tiered approach where co nfidence scores are computed at the phonetic, word, and uttera nce levels. The scores are produced by extracting confidence fea tures from the computation of the recognition hypotheses and proc essing these features using an accept/reject classifier for word an d utterance hypotheses. The output of the confidence classifiers can then be incorporated into the parsing mechanism of the language u nderstanding component. To evaluate the system, experiment s were conducted using theJUPITERweather information system. Evaluation was performed at the understanding level using key-v alue pair concept error rate as the evaluation metric. When confid e ce scores were integrated into the understanding component of the system, the concept error rate was reduced by over 35%.

[1]  Stephanie Seneff,et al.  TINA: A Natural Language System for Spoken Language Applications , 1992, Comput. Linguistics.

[2]  I. Lee Hetherington A characterization of the problem of new, out-of-vocabulary words in continuous-speech recognition and understanding , 1995 .

[3]  James R. Glass,et al.  A probabilistic framework for feature-based speech recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[4]  Lin Lawrence Chase,et al.  Word and acoustic confidence annotation for large vocabulary speech recognition , 1997, EUROSPEECH.

[5]  Wayne H. Ward,et al.  A senone based confidence measure for speech recognition , 1997, EUROSPEECH.

[6]  Herbert Gish,et al.  Improved estimation, evaluation and applications of confidence measures for speech recognition , 1997, EUROSPEECH.

[7]  Steve Renals,et al.  Confidence measures derived from an acceptor HMM , 1998, ICSLP.

[8]  James R. Glass,et al.  Real-time telephone-based speech recognition in the Jupiter domain , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[9]  Timothy J. Hazen,et al.  Word and phone level acoustic confidence scoring , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[10]  Simo O. Kamppari Word and phone level acoustic confidence scoring for speech understanding systems , 2000 .

[11]  Victor Zue,et al.  JUPlTER: a telephone-based conversational interface for weather information , 2000, IEEE Trans. Speech Audio Process..

[12]  Stephen Cox,et al.  High-level approaches to confidence estimation in speech recognition , 2002, IEEE Trans. Speech Audio Process..