论文信息 - Zero-Resource Knowledge-Grounded Dialogue Generation - 字舞流文

Zero-Resource Knowledge-Grounded Dialogue Generation

While neural conversation models have shown great potentials towards generating informative and engaging responses via introducing external knowledge, learning such a model often requires knowledge-grounded dialogues that are difficult to obtain. To overcome the data challenge and reduce the cost of building a knowledge-grounded dialogue system, we explore the problem under a zero-resource setting by assuming no context-knowledge-response triples are needed for training. To this end, we propose representing the knowledge that bridges a context and a response and the way that the knowledge is expressed as latent variables, and devise a variational approach that can effectively estimate a generation model from a dialogue corpus and a knowledge corpus that are independent with each other. Evaluation results on three benchmarks of knowledge-grounded dialogue generation indicate that our model can achieve comparable performance with state-of-the-art methods that rely on knowledge-grounded dialogues for training, and exhibits a good generalization ability over different topics and different datasets.

Wei Wu | Can Xu | Xueliang Zhao | Chongyang Tao | Yufan Zhao | Linxiao Li | Wei Wu | Can Xu | Chongyang Tao | Xueliang Zhao | Yufan Zhao | Linxiao Li

[1] Quoc V. Le,et al. A Neural Conversational Model , 2015, ArXiv.

[2] Byeongchang Kim,et al. Sequential Latent Knowledge Selection for Knowledge-Grounded Dialogue , 2020, ICLR.

[3] Maxine Eskénazi,et al. Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders , 2017, ACL.

[4] Roger B. Grosse,et al. Isolating Sources of Disentanglement in Variational Autoencoders , 2018, NeurIPS.

[5] Joelle Pineau,et al. Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models , 2015, AAAI.

[6] Ben Poole,et al. Categorical Reparameterization with Gumbel-Softmax , 2016, ICLR.

[7] Christopher Burgess,et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.

[8] Yaser Al-Onaizan,et al. Zero-Resource Translation with Multi-Lingual Neural Machine Translation , 2016, EMNLP.

[9] Yi Pan,et al. Conversational AI: The Science Behind the Alexa Prize , 2018, ArXiv.

[10] Jianfeng Gao,et al. DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation , 2020, ACL.

[11] Jason Weston,et al. What makes a good conversation? How controllable attributes affect human judgments , 2019, NAACL.

[12] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[13] Dilek Z. Hakkani-Tür,et al. Topical-Chat: Towards Knowledge-Grounded Open-Domain Conversations , 2019, INTERSPEECH.

[14] Dilek Z. Hakkani-Tür,et al. DeepCopy: Grounded Response Generation with Hierarchical Pointer Networks , 2019, SIGdial.

[15] Alan Ritter,et al. Adversarial Learning for Neural Dialogue Generation , 2017, EMNLP.

[16] Minlie Huang,et al. Learning to Ask Questions in Open-domain Conversational Systems with Typed Decoders , 2018, ACL.

[17] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[18] Xiaoyan Zhu,et al. Commonsense Knowledge Aware Conversation Generation with Graph Attention , 2018, IJCAI.

[19] Guillaume Lample,et al. Unsupervised Machine Translation Using Monolingual Corpora Only , 2017, ICLR.

[20] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[21] Ivan Titov,et al. Differentiable Perturb-and-Parse: Semi-Supervised Parsing with a Structured Variational Autoencoder , 2018, ICLR.

[22] Jason Weston,et al. Personalizing Dialogue Agents: I have a dog, do you have pets too? , 2018, ACL.

[23] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .

[24] Yiming Yang,et al. XLNet: Generalized Autoregressive Pretraining for Language Understanding , 2019, NeurIPS.

[25] Wenhu Chen,et al. Triangular Architecture for Rare Language Translation , 2018, ACL.

[26] Joelle Pineau,et al. A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues , 2016, AAAI.

[27] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[28] Jianfeng Gao,et al. Deep Reinforcement Learning for Dialogue Generation , 2016, EMNLP.

[29] Ming-Wei Chang,et al. A Knowledge-Grounded Neural Conversation Model , 2017, AAAI.

[30] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[31] Hung-yi Lee,et al. DyKgChat: Benchmarking Dialogue Generation Grounding on Dynamic Knowledge Graphs , 2019, EMNLP.

[32] Ming Zhou,et al. Unsupervised Neural Machine Translation with SMT as Posterior Regularization , 2019, AAAI.

[33] Ioannis Konstas,et al. SEQˆ3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression , 2019, NAACL.

[34] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[35] Xiaodong Liu,et al. Unified Language Model Pre-training for Natural Language Understanding and Generation , 2019, NeurIPS.

[36] Guillaume Lample,et al. Phrase-Based & Neural Unsupervised Machine Translation , 2018, EMNLP.

[37] Yang Liu,et al. A Teacher-Student Framework for Zero-Resource Neural Machine Translation , 2017, ACL.

[38] Yu Wu,et al. Towards Explainable and Controllable Open Domain Dialogue Generation with Dialogue Acts , 2018, ArXiv.

[39] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[40] Rongzhong Lian,et al. Learning to Select Knowledge for Response Generation in Dialog Systems , 2019, IJCAI.

[41] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.

[42] Frank D. Wood,et al. Learning Disentangled Representations with Semi-Supervised Deep Generative Models , 2017, NIPS.

[43] Graham Neubig,et al. StructVAE: Tree-structured Latent Variable Models for Semi-supervised Semantic Parsing , 2018, ACL.

[44] Hung-yi Lee,et al. Learning to Encode Text as Human-Readable Summaries using Generative Adversarial Networks , 2018, EMNLP.

[45] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[46] Alan W. Black,et al. A Dataset for Document Grounded Conversations , 2018, EMNLP.

[47] J. Fleiss. Measuring nominal scale agreement among many raters. , 1971 .

[48] Harry Shum,et al. From Eliza to XiaoIce: challenges and opportunities with social chatbots , 2018, Frontiers of Information Technology & Electronic Engineering.

[49] Dongyan Zhao,et al. An Ensemble of Retrieval-Based and Generation-Based Human-Computer Conversation Systems , 2018, IJCAI.

[50] Dongyan Zhao,et al. Get The Point of My Utterance! Learning Towards Effective Responses with Multi-Head Attention Mechanism , 2018, IJCAI.

[51] Wei-Ying Ma,et al. Topic Aware Neural Response Generation , 2016, AAAI.

[52] Hang Li,et al. Neural Responding Machine for Short-Text Conversation , 2015, ACL.

[53] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.

[54] Christopher D. Manning,et al. Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[55] Stephen E. Robertson,et al. Okapi at TREC-3 , 1994, TREC.

[56] Stephen E. Robertson,et al. GatfordCentre for Interactive Systems ResearchDepartment of Information , 1996 .

[57] Joelle Pineau,et al. How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation , 2016, EMNLP.

[58] Osmar R. Zaïane,et al. Augmenting Neural Response Generation with Context-Aware Topical Attention , 2018, Proceedings of the First Workshop on NLP for Conversational AI.

[59] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[60] Jason Weston,et al. Wizard of Wikipedia: Knowledge-Powered Conversational agents , 2018, ICLR.

[61] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[62] Xueqi Cheng,et al. Learning to Control the Specificity in Neural Response Generation , 2018, ACL.

[63] Dongyan Zhao,et al. Are Training Samples Correlated? Learning to Generate Dialogue Responses with Multiple References , 2019, ACL.

[64] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[65] Jianfeng Gao,et al. A Persona-Based Neural Conversation Model , 2016, ACL.

[66] Xiaoyan Zhu,et al. Emotional Chatting Machine: Emotional Conversation Generation with Internal and External Memory , 2017, AAAI.

[67] Stefan Bauer,et al. Disentangling Factors of Variations Using Few Labels , 2020, ICLR.

[68] Eneko Agirre,et al. Unsupervised Neural Machine Translation , 2017, ICLR.

[69] Dongyan Zhao,et al. Low-Resource Knowledge-Grounded Dialogue Generation , 2020, ICLR.

[70] Eric Chu,et al. MeanSum: A Neural Model for Unsupervised Multi-Document Abstractive Summarization , 2018, ICML.

[71] Yang Feng,et al. Incremental Transformer with Deliberation Decoder for Document Grounded Conversations , 2019, ACL.

[72] Jianfeng Gao,et al. A Diversity-Promoting Objective Function for Neural Conversation Models , 2015, NAACL.

[73] Ying Wang,et al. Neural Response Generation with Meta-words , 2019, ACL.

[74] Seungwhan Moon,et al. OpenDialKG: Explainable Conversational Reasoning with Attention-based Walks over Knowledge Graphs , 2019, ACL.

[75] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[76] Samy Bengio,et al. Generating Sentences from a Continuous Space , 2015, CoNLL.

[77] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.

[78] Quoc V. Le,et al. Towards a Human-like Open-Domain Chatbot , 2020, ArXiv.

[79] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.