Integration of Statistical Dialog Management Techniques to Implement Commercial Dialog Systems

In this paper we present a proposal to develop commercial dialog systems that avoids the effort of manually defining the dialog strategy for the dialog manager and also takes into account the benefits of using standards like VoiceXML. In our proposal the dialog manager is trained using a labeled dialog corpus, and selects the next system response considering a classification process based on neural networks that takes into account the complete dialog history. Thus, system developers only need to define a set of VoiceXML files, each including a system prompt and the associated grammar to recognize user responses. The statistical dialog model automatically selects the next system prompt.We have applied this technique to develop a dialog system in VoiceXML that provides railway information in Spanish.

[1]  H. Cuayahuitl,et al.  Human-computer dialogue simulation using hidden Markov models , 2005, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..

[2]  Ralph Arnote,et al.  Hong Kong (China) , 1996, OECD/G20 Base Erosion and Profit Shifting Project.

[3]  David Griol,et al.  A statistical approach to spoken dialog systems design and evaluation , 2008, Speech Commun..

[4]  Thierry Dutoit,et al.  Aided design of finite-state dialogue management systems , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[5]  David Griol,et al.  A stochastic finite-state transducer approach to spoken dialog management , 2010, INTERSPEECH.

[6]  Michael F. McTear,et al.  Book Review , 2005, Computational Linguistics.

[7]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[8]  David Suendermann-Oeft,et al.  Are We There Yet? Research in Commercial Spoken Dialog Systems , 2009, TSD.

[9]  Richard Fikes,et al.  The role of frame-based representation in reasoning , 1985, CACM.

[10]  R. Ravindra Kumar,et al.  Automatic Speech Segmentation and Multi Level Labeling Tool , 2011, ICIS 2011.

[11]  Steve J. Young,et al.  Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..

[12]  Roberto Pieraccini,et al.  A stochastic model of human-machine interaction for learning dialog strategies , 2000, IEEE Trans. Speech Audio Process..

[13]  David Griol,et al.  A statistical dialog manager for the LUNA project , 2009, INTERSPEECH.

[14]  Roberto Pieraccini,et al.  Automating spoken dialogue management design using machine learning: An industry perspective , 2008, Speech Commun..

[15]  Joelle Pineau,et al.  Spoken Dialogue Management Using Probabilistic Reasoning , 2000, ACL.

[16]  Eric Horvitz,et al.  Conversation as Action Under Uncertainty , 2000, UAI.

[17]  M. Drenth San Juan, Puerto Rico , 2001 .

[18]  Hui Ye,et al.  The Hidden Information State Approach to Dialog Management , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[19]  Salvador España Boquera,et al.  Efficient BP Algorithms for General Feedforward Neural Networks , 2007, IWINAC.

[20]  Ramón López-Cózar,et al.  A Comparison between Dialog Corpora Acquired with Real and Simulated Users , 2009, SIGDIAL Conference.

[21]  Kallirroi Georgila,et al.  User simulation for spoken dialogue systems: learning and evaluation , 2006, INTERSPEECH.

[22]  Steve J. Young,et al.  A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies , 2006, The Knowledge Engineering Review.