Speaker turn characterization for spoken dialog system monitoring and adaptation

This paper describes an utterance classification method based on a multiple decoding scheme. We use the Spoken Language Understanding (SLU) strategy proposed within the European project LUNA. The goal of this classification process is to characterize each speaker's turn, in a dialog context, according to different categories relevant from an SLU point of view: out-of-domain messages, requests not covered by the interpretation module, frequent requests,.... These categories are used for two purposes in an off-line mode: system monitoring for detecting changes in users' behaviour and system adaptation by selecting dialogs likely to contain some phenomenon poorly covered by the models for an active learning scheme. All the models and the evaluations are performed on the France Telecom FT3000 corpus.

[1]  Deepak Agarwal,et al.  Mining customer care dialogs for "Daily News" , 2005, IEEE Transactions on Speech and Audio Processing.

[2]  Gökhan Tür,et al.  Automated wizard-of-oz for spoken dialogue systems , 2005, INTERSPEECH.

[3]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[4]  Frédéric Béchet,et al.  Spoken Language Understanding Strategies on the France Telecom 3000 Voice Agency Corpus , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[5]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[6]  Allen L. Gorin,et al.  Dialog trajectory analysis , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Dilek Z. Hakkani-Tür,et al.  Active learning for automatic speech recognition , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.