Spontaneous speech dialogue system TOSBURG II and its evaluation

Abstract We have developed a spontaneous speech dialogue system TOSBURG II, employing keyword-based spontaneous speech understanding and multimodal response generation, with adaptive speech response cancellation. Since in multimodal interaction, the user understands the system's response by a visual output before its speech response is completed, the user often interrupts the system's speech response. Therefore, our adaptive speech response cancellation serves to facilitate natural human-computer interaction by allowing the user's interruption. We have also developed an evaluation environment for dialogue data collection and the performance of TOSBURG II. Unlike conventional data collection systems, TOSBURG II collects in this environment not only speech data and the final results of speech understanding but also its intermediate results as dialogue data, to use them for the evaluation and improvement of the system. The results of our dialogue experiments using TOSBURG II prove the effectiveness of adaptive speech response cancellation for natural interaction, confirming that the dialogue data and the evaluation environment will contribute to a further development of spontaneous speech dialogue systems.

[1]  Jakob Nielsen,et al.  Iterative user-interface design , 1993, Computer.

[2]  Victor Zue,et al.  Experiments in Evaluating Interactive Spoken Language Systems , 1992, HLT.

[3]  Kiyohiro Shikano,et al.  Very-large-vocabulary continuous speech recognition algorithm for telephone directory assistance , 1993, EUROSPEECH.

[4]  Lynette Hirschman,et al.  Multi-Site Data Collection for a Spoken Language Corpus , 1992, HLT.

[5]  J. Mariani Spoken Language Processing in the Framework of Human-Machine Communication at LIMSI , 1992, HLT.

[6]  Jean-Pierre Tubach,et al.  A system for natural spoken language queries design, implementation and assessment , 1991, EUROSPEECH.

[7]  Jeremy Peckham,et al.  Speech understanding and dialogue over the telephone: an overview of progress in the sundial project , 1991, EUROSPEECH.

[8]  Victor Zue,et al.  PEGASUS: A spoken dialogue interface for on-line air travel planning , 1994, Speech Communication.

[9]  Yoichi Takebayashi,et al.  Noisy spontaneous speech understanding using noise immunity keyword spotting with adaptive speech response cancellation , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Elisabetta Gerbino,et al.  Test and evaluation of a spoken dialogue system , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Elizabeth Shriberg,et al.  Human-Machine Problem Solving Using Spoken Language Systems (SLS): Factors Affecting Performance and User Satisfaction , 1992, HLT.

[12]  Tetsunori Kobayashi,et al.  ASJ continuous speech corpus for research , 1992 .

[13]  Michael Bader,et al.  The HCRC Map Task Corpus: A Natural Spoken Dialogue Corpus , 1993 .

[14]  Pascale Fung,et al.  The BBN/HARC spoken language understanding system , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  Hideki Hashimoto,et al.  A real-time speech dialogue system using spontaneous speech understanding , 1992, ICSLP.

[16]  Roger Moore,et al.  Experiences Collecting Genuine Spoken Enquiries using WOZ Techniques , 1992, HLT.