Predicting Barge-in Utterance Errors by using Implicitly-Supervised ASR Accuracy and Barge-in Rate per User

Modeling of individual users is a promising way of improving the performance of spoken dialogue systems deployed for the general public and utilized repeatedly. We define "implicitly-supervised" ASR accuracy per user on the basis of responses following the system's explicit confirmations. We combine the estimated ASR accuracy with the user's barge-in rate, which represents how well the user is accustomed to using the system, to predict interpretation errors in barge-in utterances. Experimental results showed that the estimated ASR accuracy improved prediction performance. Since this ASR accuracy and the barge-in rate are obtainable at runtime, they improve prediction performance without the need for manual labeling.

[1]  Katsuhito Sudoh,et al.  Post-dialogue confidence scoring for unsupervised statistical language model training , 2005, Speech Commun..

[2]  Tatsuya Kawahara,et al.  User Modeling in Spoken Dialogue Systems to Generate Flexible Guidance , 2004, User Modeling and User-Adapted Interaction.

[3]  Kiyohiro Shikano,et al.  Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs , 2004, INTERSPEECH.

[4]  Hong Kook Kim,et al.  A hybrid barge-in procedure for more reliable turn-taking in human-machine dialog systems , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[5]  Marilyn A. Walker,et al.  Automatic Detection of Poor Speech Recognition at the Dialogue Level , 1999, ACL.

[6]  Alexander I. Rudnicky,et al.  Implicitly-supervised Learning in Spoken Language Interfaces: an Application to the Confidence Annotation Problem , 2007, SIGDIAL.

[7]  Tatsuya Kawahara,et al.  Analyzing temporal transition of real user's behaviors in a spoken dialogue system , 2007, INTERSPEECH.

[8]  Tatsuya Kawahara,et al.  Predicting ASR errors by exploiting barge-in rate of individual users for spoken dialogue systems , 2008, INTERSPEECH.

[9]  Maxine Eskénazi,et al.  Doing research on a deployed spoken dialogue system: one year of let's go! experience , 2006, INTERSPEECH.

[10]  KawaharaTatsuya,et al.  User Modeling in Spoken Dialogue Systems to Generate Flexible Guidance , 2005 .

[11]  Marilyn A. Walker,et al.  Learning to Predict Problematic Situations in a Spoken Dialogue System: Experiments with How May I Help You? , 2000, ANLP.