Developing City Name Acquisition Strategies in Spoken Dialogue Systems Via User Simulation

This paper describes our recent work on mechanisms for error recovery in spoken dialogue systems. We focus on the acquisition of city names and dates in the flight reservation domain. We are specifically interested in addressing the issue of acquiring out-of-vocabulary city names through a speak-and-spell mode subdialogue. In order to explore various dialogue strategies, we developed a user simulation system, which includes a configurable simulated user and a novel method of utterance generation. The latter utilizes a concatenative speech synthesizer, along with an existing corpus of dialogues, to produce a large variety of simulated inputs. The results from various simulated user configurations are presented, along with a discussion of how the simulated user facilitates the debugging of dialogue strategies and the discovery of situations unanticipated by the system developer.

[1]  Stephanie Seneff,et al.  Response planning and generation in the MERCURY flight reservation system , 2002, Comput. Speech Lang..

[2]  James R. Glass,et al.  A multi-class approach for modelling out-of-vocabulary words , 2002, INTERSPEECH.

[3]  Arne Jönsson,et al.  AN ARCHITECTURE FOR MULTI-MODAL NATURAL DIALOGUE SYSTEMS , 2000 .

[4]  Victor Zue,et al.  JUPlTER: a telephone-based conversational interface for weather information , 2000, IEEE Trans. Speech Audio Process..

[5]  Stephanie Seneff,et al.  A dynamic vocabulary spoken dialogue interface , 2004, INTERSPEECH.

[6]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[7]  Joakim Gustafson,et al.  The august spoken dialogue system , 1999, EUROSPEECH.

[8]  James R. Glass,et al.  Information-theoretic criteria for unit selection synthesis , 2002, INTERSPEECH.

[9]  Min Tang,et al.  Combining linguistic knowledge and acoustic information in automatic pronunciation lexicon generation , 2004, INTERSPEECH.

[10]  James R. Glass,et al.  Flexible and Personalizable Mixed-Initiative Dialogue Systems , 2003, HLT-NAACL 2003.

[11]  Konrad Scheffler,et al.  Probabilistic simulation of human-machine dialogues , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[12]  Joseph Polifroni,et al.  PROMOTING PORTABILITY IN DIALOGUE MANAGEMENT , 2002 .

[13]  H. Quast,et al.  RoBoDiMa: a dialog object based natural language speech dialog system , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[14]  Stephanie Seneff,et al.  ORION: from on-line interaction to off-line delegation , 2000, INTERSPEECH.

[15]  Roberto Pieraccini,et al.  AMICA: the AT&t mixed initiative conversational architecture , 1997, EUROSPEECH.

[16]  William I. Hallahan DECtalk Software: Text-to-Speech Technology and Implementation , 1995, Digit. Tech. J..

[17]  Gregory A. Sanders,et al.  DARPA communicator dialog travel planning systems: the june 2000 data collection , 2001, INTERSPEECH.

[18]  Hermann Ney,et al.  Confidence measures for large vocabulary continuous speech recognition , 2001, IEEE Trans. Speech Audio Process..

[19]  Matthias Denecke Rapid Prototyping for Spoken Dialogue Systems , 2002, COLING.

[20]  Grace Chung,et al.  Developing a Flexible Spoken Dialog System Using Simulation , 2004, ACL.

[21]  Roberto Pieraccini,et al.  User modeling for spoken dialogue system evaluation , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[22]  Josef G. Bauer,et al.  Accurate recognition of city names with spelling as a fall back strategy , 1999, EUROSPEECH.

[23]  Alexander H. Waibel,et al.  Exploiting repair context in interactive error recovery , 1997, EUROSPEECH.

[24]  Stephanie Seneff,et al.  Error Detection and Recovery in Spoken Dialogue Systems , 2004, HLT-NAACL 2004.

[25]  Joseph Polifroni,et al.  Recognition confidence scoring and its use in speech understanding systems , 2002, Comput. Speech Lang..

[26]  Victor Zue,et al.  GALAXY-II: a reference architecture for conversational system development , 1998, ICSLP.

[27]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.