Evaluation of a Fully Automatic Cooperative Persuasive Dialogue System

In this paper, we construct and evaluate a fully automated text-based cooperative persuasive dialogue system, which is able to persuade the user to take a specific action while maintaining user satisfaction. In our previous works, we created a dialogue management module for cooperative persuasive dialogue (Hiraoka et al., Reinforcement learning of cooperative persuasive dialogue policies using framing, Proceedings of international conference on computational linguistics (COLING), 2014), but only evaluated it in a wizard-of-Oz setting, as we did not have the capacity for natural language generation (NLG) and natural language understanding (NLU). In this work, the main technical contribution is the design of the NLU and the NLG modules which allows us to remove this bottleneck and create the first fully automatic cooperative persuasive dialogue system. Based on this system, we performed an evaluation with real users. Experimental results indicate that the learned policy is able to effectively persuade the users: the reward of the proposed model is much higher than baselines and almost the same as a dialogue manager controlled by a human. This tendency is almost the same as our previous evaluation using a wizard-of-Oz framework (Hiraoka et al., Reinforcement learning of cooperative persuasive dialogue policies using framing, Proceedings of international conference on computational linguistics (COLING), 2014), demonstrates that the proposed NLU and NLG modules are effective for cooperative persuasive dialogue.

[1]  Steve J. Young,et al.  Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..

[2]  Peter A. Heeman,et al.  Representing the Reinforcement Learning state in a negotiation dialogue , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[3]  Tomoki Toda,et al.  Dialogue management for leading the conversation in persuasive dialogue systems , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[4]  Kallirroi Georgila,et al.  Reinforcement Learning of Argumentation Dialogue Policies in Negotiation , 2011, INTERSPEECH.

[5]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[6]  Junichi Yamagishi,et al.  Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System Using Deep Recurrent Neural Networks , 2016, INTERSPEECH.

[7]  Schneider,et al.  All Frames Are Not Created Equal: A Typology and Critical Analysis of Framing Effects. , 1998, Organizational behavior and human decision processes.

[8]  Kallirroi Georgila Reinforcement Learning of Two-Issue Negotiation Dialogue Policies , 2013, SIGDIAL Conference.

[9]  YoungSteve,et al.  Partially observable Markov decision processes for spoken dialog systems , 2007 .

[10]  Kallirroi Georgila,et al.  Reinforcement Learning of Multi-Issue Negotiation Dialogue Policies , 2015, SIGDIAL Conference.

[11]  Gary Geunbae Lee,et al.  Example-based dialog modeling for practical multi-domain dialog system , 2009, Speech Commun..

[12]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[13]  Mirella Lapata,et al.  Proceedings of EMNLP 2004 , 2004 .

[14]  M. I. Jordan Leo Breiman , 2011, 1101.0929.

[15]  Yuji Matsumoto,et al.  Applying Conditional Random Fields to Japanese Morphological Analysis , 2004, EMNLP.

[16]  Stefan Schaal,et al.  Natural Actor-Critic , 2003, Neurocomputing.

[17]  Fiorella de Rosis,et al.  Artifices for Persuading to Improve Eating Habits , 2006, AAAI Spring Symposium: Argumentation for Consumers of Healthcare.

[18]  Tomoki Toda,et al.  Reinforcement Learning of Cooperative Persuasive Dialogue Policies using Framing , 2014, COLING.

[19]  Martin A. Riedmiller Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method , 2005, ECML.

[20]  Roie Zivan,et al.  POMDP based Negotiation Modeling , 2009 .

[21]  Tomoki Toda,et al.  Construction and Analysis of a Persuasive Dialogue Corpus , 2014, IWSDS.