POMDP-based Let's Go system for spoken dialog challenge

This paper describes a POMDP-based Let's Go system which incorporates belief tracking and dialog policy optimization into the dialog manager of the reference system for the Spoken Dialog Challenge (SDC). Since all components except for the dialog manager were kept the same, component-wise comparison can be performed to investigate the effect of belief tracking and dialog policy optimization on the overall system performance. In addition, since unsupervised methods have been adopted to learn all required models to reduce human labor and development time, the effectiveness of the unsupervised approaches compared to conventional supervised approaches can be investigated. The result system participated in the 2011 SDC and showed comparable performance with the base system which has been enhanced from the reference system for the 2010 SDC. This shows the capability of the proposed method to rapidly produce an effective system with minimal human labor and experts' knowledge.

[1]  Milica Gasic,et al.  Effective handling of dialogue state in the hidden information state POMDP-based dialogue manager , 2011, TSLP.

[2]  Maxine Eskénazi,et al.  Let's go public! taking a spoken dialog system to the real world , 2005, INTERSPEECH.

[3]  Maxine Eskénazi,et al.  An Unsupervised Approach to User Simulation: Toward Self-Improving Dialog Systems , 2012, SIGDIAL Conference.

[4]  Milica Gasic,et al.  The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management , 2010, Comput. Speech Lang..

[5]  George Eastman House,et al.  Sparse Bayesian Learning and the Relevan e Ve tor Ma hine , 2001 .

[6]  Jason D. Williams Incremental partition recombination for efficient tracking of multiple dialog states , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Shie Mannor,et al.  Reinforcement learning with Gaussian processes , 2005, ICML.

[8]  Joelle Pineau,et al.  Spoken Dialogue Management Using Probabilistic Reasoning , 2000, ACL.

[9]  Maxine Eskénazi,et al.  Exploiting Machine-Transcribed Dialog Corpus to Improve Multiple Dialog States Tracking Methods , 2012, SIGDIAL Conference.

[10]  Jason D. Williams Exploiting the ASR n-best by tracking multiple dialog state hypotheses , 2008, INTERSPEECH.

[11]  Steve J. Young,et al.  Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems , 2010, Comput. Speech Lang..

[12]  Milica Gasic,et al.  Modelling user behaviour in the HIS-POMDP dialogue manager , 2008, 2008 IEEE Spoken Language Technology Workshop.

[13]  Michael E. Tipping,et al.  Fast Marginal Likelihood Maximisation for Sparse Bayesian Models , 2003 .

[14]  Maxine Eskénazi,et al.  Spoken Dialog Challenge 2010 , 2010, 2010 IEEE Spoken Language Technology Workshop.

[15]  Konstantinos Blekas,et al.  Value Function Approximation through Sparse Bayesian Modeling , 2011, EWRL.

[16]  Maxine Eskénazi,et al.  Spoken Dialog Challenge 2010: Comparison of Live and Control Test Results , 2011, SIGDIAL Conference.