Reinforcement Learning of Argumentation Dialogue Policies in Negotiation

We build dialogue system policies for negotiation, and in particular for argumentation. These dialogue policies are designed for negotiation against users of different cultural norms (individualists, collectivists, and altruists). In order to learn these policies we build simulated users (SUs), i.e. models that simulate the behavior of real users, and use Reinforcement Learning (RL). The SUs are trained on a spoken dialogue corpus in a negotiation domain, and then tweaked towards a particular cultural norm using hand-crafted rules. We evaluate the learned policies in a simulation setting. Our results are consistent with our SUs, in other words, the policies learn what they are designed to learn, which shows that RL is a promising technique for learning policies in domains, such as argumentation, that are more complex than standard slot-filling applications. Index Terms: spoken dialogue systems, reinforcement learning, simulated users, argumentation, negotiation, culture.

[1]  Kallirroi Georgila,et al.  An Annotation Scheme for Cross-Cultural Argumentation and Persuasion Dialogues , 2011, SIGDIAL Conference.

[2]  Jeanne M. Brett,et al.  A Cultural Analysis of the Underlying Assumptions of Negotiation Theory , 2005 .

[3]  R. Bhagat Culture's Consequences: Comparing Values, Behaviors, Institutions, and Organizations Across Nations , 2002 .

[4]  Michael English,et al.  Learning Mixed Initiative Dialog Strategies By Using Reinforcement Learning On Both Conversants , 2005, HLT.

[5]  Roie Zivan,et al.  POMDP based Negotiation Modeling , 2009 .

[6]  S. Young,et al.  Scaling POMDPs for Spoken Dialog Management , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Peter A. Heeman,et al.  Representing the Reinforcement Learning state in a negotiation dialogue , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[8]  Paul D. Allison,et al.  How Culture Induces Altruistic Behavior , 1992 .

[9]  Gary Geunbae Lee,et al.  Hybrid Approach to User Intention Modeling for Dialog Simulation , 2009, ACL/IJCNLP.

[10]  Kallirroi Georgila,et al.  Learning Culture-Specific Dialogue Models from Non Culture-Specific Data , 2011, HCI.

[11]  Kallirroi Georgila,et al.  User simulation for spoken dialogue systems: learning and evaluation , 2006, INTERSPEECH.

[12]  Oliver Lemon,et al.  Natural Language Generation as Planning Under Uncertainty for Spoken Dialogue Systems , 2009, EACL.

[13]  Kallirroi Georgila,et al.  Learning Dialogue Strategies from Older and Younger Simulated Users , 2010, SIGDIAL Conference.