论文信息 - Decoupling Strategy and Generation in Negotiation Dialogues - 字舞流文

Decoupling Strategy and Generation in Negotiation Dialogues

We consider negotiation settings in which two agents use natural language to bargain on goods. Agents need to decide on both high-level strategy (e.g., proposing \$50) and the execution of that strategy (e.g., generating "The bike is brand new. Selling for just \$50."). Recent work on negotiation trains neural models, but their end-to-end nature makes it hard to control their strategy, and reinforcement learning tends to lead to degenerate solutions. In this paper, we propose a modular approach based on coarse di- alogue acts (e.g., propose(price=50)) that decouples strategy and generation. We show that we can flexibly set the strategy using supervised learning, reinforcement learning, or domain-specific knowledge without degeneracy, while our retrieval-based generation can maintain context-awareness and produce diverse utterances. We test our approach on the recently proposed DEALORNODEAL game, and we also collect a richer dataset based on real items on Craigslist. Human evaluation shows that our systems achieve higher task success rate and more human-like negotiation behavior than previous approaches.

Derek Chen | Percy Liang | Anusha Balakrishnan | He He | Percy Liang | He He | Anusha Balakrishnan | Derek Chen

[1] Oliver Lemon,et al. Evaluating Persuasion Strategies and Deep Reinforcement Learning methods for Negotiation Dialogue agents , 2017, EACL.

[2] S. Brams. Negotiation games : applying game theory to bargaining and arbitration , 1992 .

[3] Kallirroi Georgila,et al. A Cultural Decision-Making Model for Negotiation based on Inverse Reinforcement Learning , 2012, CogSci.

[4] Stephen Clark,et al. Latent Variable Dialogue Models and their Diversity , 2017, EACL.

[5] Tomoki Toda,et al. Reinforcement Learning of Cooperative Persuasive Dialogue Policies using Framing , 2014, COLING.

[6] David R. Traum,et al. Multi-party, Multi-issue, Multi-strategy Negotiation for Multi-modal Virtual Agents , 2008, IVA.

[7] Christopher Potts,et al. Goal-Driven Answers in the CardsDialogue Corpus , 2012 .

[8] Stephen Clark,et al. Emergent Communication through Negotiation , 2018, ICLR.

[9] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[10] Jianfeng Gao,et al. Deep Reinforcement Learning for Dialogue Generation , 2016, EMNLP.

[11] Jianfeng Gao,et al. A Neural Network Approach to Context-Sensitive Generation of Conversational Responses , 2015, NAACL.

[12] Maxine Eskénazi,et al. Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders , 2017, ACL.

[13] Alan Ritter,et al. Adversarial Learning for Neural Dialogue Generation , 2017, EMNLP.

[14] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[15] David DeVault,et al. Toward Natural Turn-Taking in a Virtual Human Negotiation Agent , 2015, AAAI Spring Symposia.

[16] David Vandyke,et al. A Network-based End-to-End Trainable Task-oriented Dialogue System , 2016, EACL.

[17] Geoffrey Zweig,et al. Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning , 2017, ACL.

[18] Jianfeng Gao,et al. Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access , 2016, ACL.

[19] Oliver Lemon,et al. Strategic Dialogue Management via Deep Reinforcement Learning , 2015, NIPS 2015.

[20] Percy Liang,et al. Learning Symmetric Collaborative Dialogue Agents with Dynamic Knowledge Graph Embeddings , 2017, ACL.

[21] N. Arnett. Goal-driven Answers in the Cards Dialogue Corpus , 2012 .

[22] Milica Gasic,et al. POMDP-Based Statistical Spoken Dialog Systems: A Review , 2013, Proceedings of the IEEE.

[23] Jason Weston,et al. Learning End-to-End Goal-Oriented Dialog , 2016, ICLR.

[24] Tsung-Hsien Wen,et al. Latent Intention Dialogue Models , 2017, ICML.

[25] Nicholas Asher,et al. Discourse Structure and Dialogue Acts in Multiparty Dialogue: the STAC Corpus , 2016, LREC.

[26] Antoine Raux,et al. The Dialog State Tracking Challenge Series: A Review , 2016, Dialogue Discourse.

[27] Michael English,et al. Learning Mixed Initiative Dialog Strategies By Using Reinforcement Learning On Both Conversants , 2005, HLT.

[28] Oliver Lemon,et al. Modelling Strategic Conversation: model, annotation design and corpus , 2012 .

[29] Satoshi Nakamura,et al. Reinforcement Learning in Multi-Party Trading Dialog , 2015, SIGDIAL Conference.

[30] Yann Dauphin,et al. Deal or No Deal? End-to-End Learning of Negotiation Dialogues , 2017, EMNLP.

[31] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[32] Joelle Pineau,et al. Training End-to-End Dialogue Systems with the Ubuntu Dialogue Corpus , 2017, Dialogue Discourse.