论文信息 - MultiTalk: A Highly-Branching Dialog Testbed for Diverse Conversations

MultiTalk: A Highly-Branching Dialog Testbed for Diverse Conversations

We study conversational dialog in which there are many possible responses to a given history. We present the MultiTalk Dataset, a corpus of over 320,000 sentences of written conversational dialog that balances a high branching factor (10) with several conversation turns (6) through selective branch continuation. We make multiple contributions to study dialog generation in the highly branching setting. In order to evaluate a diverse set of generations, we propose a simple scoring algorithm, based on bipartite graph matching, to optimally incorporate a set of diverse references. We study multiple language generation tasks at different levels of predictive conversation depth, using textual attributes induced automatically from pretrained classifiers. Our culminating task is a challenging theory of mind problem, a controllable generation task which requires reasoning about the expected reaction of the listener.

[1] Rada Mihalcea,et al. MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations , 2018, ACL.

[2] Jonathan Berant,et al. Evaluating the Evaluation of Diversity in Natural Language Generation , 2020, EACL.

[3] Joelle Pineau,et al. Language GANs Falling Short , 2018, ICLR.

[4] Hang Li,et al. Neural Responding Machine for Short-Text Conversation , 2015, ACL.

[5] Yejin Choi,et al. The Curious Case of Neural Text Degeneration , 2019, ICLR.

[6] Y-Lan Boureau,et al. Towards Empathetic Open-domain Conversation Models: A New Benchmark and Dataset , 2018, ACL.

[7] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[8] Myle Ott,et al. fairseq: A Fast, Extensible Toolkit for Sequence Modeling , 2019, NAACL.

[9] Roberta Ferrario,et al. Counterfactual Reasoning , 2001, CONTEXT.

[10] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .

[11] Donghong Ji,et al. Multi-Task Learning Network for Emotion Recognition in Conversation , 2020 .

[12] Alan Ritter,et al. Unsupervised Modeling of Twitter Conversations , 2010, NAACL.

[13] Xiaoyan Zhu,et al. Emotional Chatting Machine: Emotional Conversation Generation with Internal and External Memory , 2017, AAAI.

[14] Jianfeng Gao,et al. A Persona-Based Neural Conversation Model , 2016, ACL.

[15] Ke Wang,et al. SentiGAN: Generating Sentimental Texts via Mixture Adversarial Networks , 2018, IJCAI.

[16] Soujanya Poria,et al. Emotion Recognition in Conversations with Transfer Learning from Generative Conversation Modeling , 2019, ArXiv.

[17] Honglak Lee,et al. Learning Structured Output Representation using Deep Conditional Generative Models , 2015, NIPS.

[18] E. Bates. Language and context: The acquisition of pragmatics , 1976 .

[19] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[20] Lun-Wei Ku,et al. EmotionLines: An Emotion Corpus of Multi-Party Conversations , 2018, LREC.

[21] Quoc V. Le,et al. Towards a Human-like Open-Domain Chatbot , 2020, ArXiv.

[22] William Yang Wang,et al. MojiTalk: Generating Emotional Responses at Scale , 2017, ACL.

[23] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[24] Yann Dauphin,et al. Deal or No Deal? End-to-End Learning of Negotiation Dialogues , 2017, EMNLP.

[25] Erik Cambria,et al. Conversational Memory Network for Emotion Recognition in Dyadic Dialogue Videos , 2018, NAACL.

[26] Rada Mihalcea,et al. DialogueRNN: An Attentive RNN for Emotion Detection in Conversations , 2018, AAAI.

[27] Xiao Sun,et al. Reinforcement Learning Based Emotional Editing Constraint Conversation Generation , 2019, ArXiv.

[28] Jason Weston,et al. Personalizing Dialogue Agents: I have a dog, do you have pets too? , 2018, ACL.

[29] Lun-Wei Ku,et al. SocialNLP EmotionX 2019 Challenge Overview: Predicting Emotions in Spoken Dialogues and Chats , 2019, ArXiv.

[30] Kilian Q. Weinberger,et al. BERTScore: Evaluating Text Generation with BERT , 2019, ICLR.

[31] Harold W. Kuhn,et al. The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[32] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[33] Joelle Pineau,et al. Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models , 2015, AAAI.

[34] Jason Weston,et al. Engaging Image Captioning via Personality , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[35] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[36] Myle Ott,et al. Facebook FAIR’s WMT19 News Translation Task Submission , 2019, WMT.