论文信息 - Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders - 字舞流文

Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders

While recent neural encoder-decoder models have shown great promise in modeling open-domain conversations, they often generate dull and generic responses. Unlike past work that has focused on diversifying the output of the decoder at word-level to alleviate this problem, we present a novel framework based on conditional variational autoencoders that captures the discourse-level diversity in the encoder. Our model uses latent variables to learn a distribution over potential conversational intents and generates diverse responses using only greedy decoders. We have further developed a novel variant that is integrated with linguistic prior knowledge for better performance. Finally, the training procedure is improved by introducing a bag-of-word loss. Our proposed models have been validated to generate significantly more diverse responses than baseline approaches and exhibit competence in discourse-level decision-making.

Maxine Eskénazi | Ran Zhao | Tiancheng Zhao | M. Eskénazi | Tiancheng Zhao | Ran Zhao

[1] Massimo Poesio,et al. Towards an Axiomatization of Dialogue Acts , 1998 .

[2] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[3] Joelle Pineau,et al. Bootstrapping Dialog Systems with Word Embeddings , 2014 .

[4] Gerard Salton,et al. Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[5] Joelle Pineau,et al. A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues , 2016, AAAI.

[6] Maxine Eskénazi,et al. Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning , 2016, SIGDIAL Conference.

[7] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[8] Alexander I. Rudnicky,et al. Ravenclaw: dialog management using hierarchical task decomposition and an expectation agenda , 2003, INTERSPEECH.

[9] Ewan Klein,et al. Natural Language Processing with Python , 2009 .

[10] Jianfeng Gao,et al. A Neural Network Approach to Context-Sensitive Generation of Conversational Responses , 2015, NAACL.

[11] Jianfeng Gao,et al. A Persona-Based Neural Conversation Model , 2016, ACL.

[12] Alexander M. Rush,et al. Sequence-to-Sequence Learning as Beam-Search Optimization , 2016, EMNLP.

[13] SaltonGerard,et al. Term-weighting approaches in automatic text retrieval , 1988 .

[14] Quoc V. Le,et al. A Neural Conversational Model , 2015, ArXiv.

[15] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[16] Jianfeng Gao,et al. A Diversity-Promoting Objective Function for Neural Conversation Models , 2015, NAACL.

[17] Zhou Yu,et al. Strategy and Policy Learning for Non-Task-Oriented Conversational Systems , 2016, SIGDIAL Conference.

[18] Wei-Ying Ma,et al. Topic Aware Neural Response Generation , 2016, AAAI.

[19] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[20] Maxine Eskénazi,et al. Let's go public! taking a spoken dialog system to the real world , 2005, INTERSPEECH.

[21] Honglak Lee,et al. Learning Structured Output Representation using Deep Conditional Generative Models , 2015, NIPS.

[22] Joelle Pineau,et al. How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation , 2016, EMNLP.

[23] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[24] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[25] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[26] Jianfeng Gao,et al. Deep Reinforcement Learning for Dialogue Generation , 2016, EMNLP.

[27] Johan A. K. Suykens,et al. Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[28] James F. Allen,et al. A Plan Recognition Model for Subdialogues in Conversations , 1987, Cogn. Sci..

[29] Colin Cherry,et al. A Systematic Comparison of Smoothing Techniques for Sentence-Level BLEU , 2014, WMT@ACL.

[30] Wei-Ying Ma,et al. Topic Augmented Neural Response Generation with a Joint Attention Mechanism , 2016, ArXiv.

[31] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[32] Ricardo Ribeiro,et al. The Influence of Context on Dialogue Act Recognition , 2015, ArXiv.

[33] Honglak Lee,et al. Attribute2Image: Conditional Image Generation from Visual Attributes , 2015, ECCV.

[34] Andreas Stolcke,et al. Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[35] Joelle Pineau,et al. Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models , 2015, AAAI.

[36] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[37] Steve J. Young,et al. Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..

[38] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[39] Samy Bengio,et al. Generating Sentences from a Continuous Space , 2015, CoNLL.

[40] Wojciech Zaremba,et al. Recurrent Neural Network Regularization , 2014, ArXiv.

[41] Yonatan Belinkov,et al. Fine-grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks , 2016, ICLR.