Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy

This paper presents a spoken dialogue framework that helps users in making decisions. Users often do not have a definite goal or criteria for selecting from a list of alternatives. Thus the system has to bridge this knowledge gap and also provide the users with an appropriate alternative together with the reason for this recommendation through dialogue. We present a dialogue state model for such decision making dialogue. To evaluate this model, we implement a trial sightseeing guidance system and collect dialogue data. Then, we optimize the dialogue strategy based on the state model through reinforcement learning with a natural policy gradient approach using a user simulator trained on the collected dialogue corpus.

[1]  Stephanie Seneff,et al.  Dialogue Management in the Mercury Flight Reservation System , 2000 .

[2]  Oliver Lemon,et al.  Adaptive natural language generation in dialogue using reinforcement learning , 2008 .

[3]  Satoshi Nakamura,et al.  Construction and Experiment of a Spoken Consulting Dialogue System , 2010, IWSDS.

[4]  Maxine Eskénazi,et al.  ONLINE SUPERVISED LEARNING OF NON-UNDERSTANDING RECOVERY POLICIES , 2006, 2006 IEEE Spoken Language Technology Workshop.

[5]  Marilyn A. Walker,et al.  PARADISE: A Framework for Evaluating Spoken Dialogue Agents , 1997, ACL.

[6]  Satoshi Nakamura,et al.  Annotating communicative function and semantic content in dialogue act for construction of consulting dialogue systems , 2009, INTERSPEECH.

[7]  Steve J. Young,et al.  Error simulation for training statistical dialogue systems , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[8]  Alexander I. Rudnicky,et al.  Task and domain specific modelling in the Carnegie Mellon communicator system , 2000, INTERSPEECH.

[9]  Thomas L. Saaty,et al.  Multicriteria Decision Making: The Analytic Hierarchy Process: Planning, Priority Setting, Resource Allocation , 1990 .

[10]  Kallirroi Georgila,et al.  Evaluating the Effectiveness of Information Presentation in a Full End-To-End Dialogue System , 2009, SIGDIAL Conference.

[11]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[12]  Thierry Dutoit,et al.  A probabilistic framework for dialog simulation and optimal strategy learning , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Roberto Pieraccini,et al.  A stochastic model of human-machine interaction for learning dialog strategies , 2000, IEEE Trans. Speech Audio Process..

[14]  Hui Ye,et al.  Agenda-Based User Simulation for Bootstrapping a POMDP Dialogue System , 2007, NAACL.

[15]  Gary Marchionini,et al.  Exploratory search , 2006, Commun. ACM.

[16]  Steve J. Young,et al.  Bayesian update of dialogue state for robust dialogue systems , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[17]  Hong-Kwang Jeff Kuo,et al.  Dialogue management in the Bell Labs communicator system , 2000, INTERSPEECH.

[18]  KawaharaTatsuya,et al.  User Modeling in Spoken Dialogue Systems to Generate Flexible Guidance , 2005 .

[19]  Raymond J. Mooney,et al.  Content-boosted collaborative filtering for improved recommendations , 2002, AAAI/IAAI.

[20]  Joseph Polifroni,et al.  Intensional Summaries as Cooperative Responses in Dialogue: Automation and Evaluation , 2008, ACL.

[21]  Olivier Pietquin,et al.  Training Bayesian networks for realistic man-machine spoken dialogue simulation , 2009 .

[22]  Renato De Mori,et al.  Feature-based summary space for stochastic dialogue modeling with hierarchical semantic frames , 2009, INTERSPEECH.

[23]  Tatsuya Kawahara,et al.  Detection of feeling through back-channels in spoken dialogue , 2008, INTERSPEECH.

[24]  Jean-Luc Gauvain,et al.  User evaluation of the MASK kiosk , 1998, Speech Commun..

[25]  Tatsuya Kawahara,et al.  User Modeling in Spoken Dialogue Systems to Generate Flexible Guidance , 2004, User Modeling and User-Adapted Interaction.

[26]  Tatsuya Kawahara,et al.  A bootstrapping approach for developing language model of new spoken dialogue systems by selecting web texts , 2006, INTERSPEECH.

[27]  Ryuichiro Higashinaka,et al.  Dialogue Control Algorithm for Ambient Intelligence based on Partially Observable Markov Decision Processes , 2010 .

[28]  Stefan Schaal,et al.  Natural Actor-Critic , 2003, Neurocomputing.

[29]  Satoshi Nakamura,et al.  Annotating Dialogue Acts to Construct Dialogue Systems for Consulting , 2009, ALR7@IJCNLP.

[30]  Oliver Lemon,et al.  Natural Language Generation as Planning Under Uncertainty for Spoken Dialogue Systems , 2009, EACL.

[31]  Norihito Yasuda,et al.  Efficient spoken dialogue control depending on the speech recognition rate and system's database , 2003, INTERSPEECH.

[32]  Lou Boves,et al.  Issues in Spoken Dialogue Systems: Experiences with the Dutch Arise System , 2000 .

[33]  Jean-Luc Gauvain,et al.  The LIMSI ARISE system for train travel information , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[34]  S. Young,et al.  Scaling POMDPs for Spoken Dialog Management , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[35]  Sebastian Thrun,et al.  Monte Carlo POMDPs , 1999, NIPS.

[36]  Maxine Eskénazi,et al.  Let's go public! taking a spoken dialog system to the real world , 2005, INTERSPEECH.