Learning dialogue strategies within the Markov decision process framework

We introduce a stochastic model for dialogue systems based on the Markov decision process. Within this framework we show that the problem of dialogue strategy design can be stated as an optimization problem, and solved by a variety of methods, including the reinforcement learning approach. The advantages of this new paradigm include objective evaluation of dialogue systems and their automatic design and adaptation. We show some preliminary results on learning a dialogue strategy for an air travel information system.

[1]  Roberto Pieraccini,et al.  User Modeling For Spoken Dialogue , 1997 .

[2]  Richard R. Rosinski,et al.  Prompt constrained natural language-evolving the next generation of telephony services , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[3]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[4]  Pierre Dupont,et al.  A Cooperative spoken dialogue system based on a rational agent model: a first implementation on the AGS application , 1995 .

[5]  Roberto Pieraccini,et al.  User modeling for spoken dialogue system evaluation , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[6]  Marilyn A. Walker,et al.  PARADISE: A Framework for Evaluating Spoken Dialogue Agents , 1997, ACL.