Training Bayesian networks for realistic man-machine spoken dialogue simulation

Data collection and annotation are generally required to design or assess spoken dialogue systems. Yet, this is a very time consuming and expensive process. For these reasons, user simulation has become an important trend of research in the field of spoken dialogue systems. The general problem of user simulation is thus to produce as many as necessary natural, various and consistent interactions from as few data as possible. In this paper, we propose a user simulation method based on Bayesian networks (BN) that is able to produce consistent interactions in terms of user goal and dialogue history. The model as been introduced in previous work but parameters were hand-tuned and it was assessed in the framework of automatic learning of optimal dialogue strategies. In this paper, the BN is trained on a database of 1234 human-machine dialogues in the TownInfo domain (a tourist information application). Experiments with a state-of-the-art dialogue system (REALL-DUDE/DIPPER/OAA) have been realized and results in terms of dialog statistics are presented.

[1]  Olivier Pietquin,et al.  ASR system modeling for automatic evaluation and optimization of dialogue systems , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Steve J. Young,et al.  Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..

[3]  Thierry Dutoit,et al.  A probabilistic framework for dialog simulation and optimal strategy learning , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Oliver Lemon,et al.  DIPPER: Description and Formalisation of an Information-State Update Dialogue System Architecture , 2003, SIGDIAL Workshop.

[5]  Roberto Pieraccini,et al.  A stochastic model of human-machine interaction for learning dialog strategies , 2000, IEEE Trans. Speech Audio Process..

[6]  Steve J. Young,et al.  A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies , 2006, The Knowledge Engineering Review.

[7]  Roberto Pieraccini,et al.  User modeling for spoken dialogue system evaluation , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[8]  Thierry Dutoit,et al.  Dynamic Bayesian Networks for NLU Simulation with Applications to Dialog Optimal Strategy Learning , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[9]  Ramón López-Cózar,et al.  Testing the performance of spoken dialogue systems by means of an artificially simulated user , 2006, Artificial Intelligence Review.