Error Detection and Recovery in Spoken Dialogue Systems

This paper describes our research on both the detection and subsequent resolution of recognition errors in spoken dialogue systems. The paper consists of two major components. The first half concerns the design of the error detection mechanism for resolving city names in our MERCURY flight reservation system, and an investigation of the behavioral patterns of users in subsequent subdialogues involving keypad entry for disambiguation. An important observation is that, upon a request for keypad entry, users are frequently unresponsive to the extent of waiting for a time-out or hanging up the phone. The second half concerns a pilot experiment investigating the feasibility of replacing the solicitation of a keypad entry with that of a “speak-and-spell” entry. A novelty of our work is the introduction of a speech synthesizer to simulate the user, which facilitates development and evaluation of our proposed strategy. We have found that the speak-and-spell strategy is quite effective in simulation mode, but it remains to be tested in real user dialogues.

[1]  H. Quast,et al.  RoBoDiMa: a dialog object based natural language speech dialog system , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[2]  Joseph Polifroni,et al.  PROMOTING PORTABILITY IN DIALOGUE MANAGEMENT , 2002 .

[3]  Roberto Pieraccini,et al.  AMICA: the AT&t mixed initiative conversational architecture , 1997, EUROSPEECH.

[4]  James R. Glass,et al.  Flexible and Personalizable Mixed-Initiative Dialogue Systems , 2003, HLT-NAACL 2003.

[5]  Roberto Pieraccini,et al.  User modeling for spoken dialogue system evaluation , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[6]  Victor Zue,et al.  Conversational interfaces: advances and challenges , 1997, Proceedings of the IEEE.

[7]  Josef G. Bauer,et al.  Accurate recognition of city names with spelling as a fall back strategy , 1999, EUROSPEECH.

[8]  Leigh Burstein,et al.  Data Collection , 1985 .

[9]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[10]  I. Scott MacKenzie,et al.  LetterWise: prefix-based disambiguation for mobile text input , 2001, UIST '01.

[11]  Stephanie Seneff,et al.  Empowering end users to personalize dialogue systems through spoken interaction , 2003, INTERSPEECH.

[12]  Joakim Gustafson,et al.  The August Spoken Dialogue System , 1999 .

[13]  James R. Glass,et al.  A probabilistic framework for feature-based speech recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[14]  Chris Schmandt,et al.  Putting people first: specifying proper names in speech interfaces , 1994, UIST '94.

[15]  James R. Glass,et al.  A multi-class approach for modelling out-of-vocabulary words , 2002, INTERSPEECH.

[16]  Stephanie Seneff,et al.  Automatic Acquisition of Names Using Speak and Spell Mode in Spoken Dialogue Systems , 2003, NAACL.

[17]  Arne Jönsson,et al.  AN ARCHITECTURE FOR MULTI-MODAL NATURAL DIALOGUE SYSTEMS , 2000 .

[18]  Victor Zue,et al.  JUPlTER: a telephone-based conversational interface for weather information , 2000, IEEE Trans. Speech Audio Process..

[19]  Hauke Schramm,et al.  Strategies for name recognition in automatic directory assistance systems , 2000, Speech Commun..

[20]  Stephanie Seneff,et al.  ANGIE: a new framework for speech analysis based on morpho-phonological modelling , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[21]  Stephanie Seneff,et al.  Dialogue Management in the Mercury Flight Reservation System , 2000 .

[22]  Stephanie Seneff,et al.  Response planning and generation in the MERCURY flight reservation system , 2002, Comput. Speech Lang..

[23]  James Raymond Davis Let Your Fingers do the Spelling: Implicit disambiguation of words spelled with the telephone keypad , 1991 .

[24]  Matthias Denecke Rapid Prototyping for Spoken Dialogue Systems , 2002, COLING.

[25]  Gregory A. Sanders,et al.  DARPA communicator dialog travel planning systems: the june 2000 data collection , 2001, INTERSPEECH.