论文信息 - Incremental Transformer with Deliberation Decoder for Document Grounded Conversations - 字舞流文

Incremental Transformer with Deliberation Decoder for Document Grounded Conversations

Document Grounded Conversations is a task to generate dialogue responses when chatting about the content of a given document. Obviously, document knowledge plays a critical role in Document Grounded Conversations, while existing dialogue models do not exploit this kind of knowledge effectively enough. In this paper, we propose a novel Transformer-based architecture for multi-turn document grounded conversations. In particular, we devise an Incremental Transformer to encode multi-turn utterances along with knowledge in related documents. Motivated by the human cognitive process, we design a two-pass decoder (Deliberation Decoder) to improve context coherence and knowledge correctness. Our empirical study on a real-world Document Grounded Dataset proves that responses generated by our model significantly outperform competitive baselines on both context coherence and knowledge relevance.

Yang Feng | Cheng Niu | Fandong Meng | Qian Li | Jie Zhou | Zekang Li | Jie Zhou | Fandong Meng | Q. Li | Cheng Niu | Yang Feng | Zekang Li

[1] Jason Weston,et al. Reading Wikipedia to Answer Open-Domain Questions , 2017, ACL.

[2] Nenghai Yu,et al. Deliberation Networks: Sequence Generation Beyond One-Pass Decoding , 2017, NIPS.

[3] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[4] Quoc V. Le,et al. A Neural Conversational Model , 2015, ArXiv.

[5] Yun-Nung Chen,et al. Knowledge-Grounded Response Generation with Deep Attentional Latent-Variable Model , 2019, Comput. Speech Lang..

[6] Joelle Pineau,et al. Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models , 2015, AAAI.

[7] Alexander M. Rush,et al. OpenNMT: Open-Source Toolkit for Neural Machine Translation , 2017, ACL.

[8] Jianfeng Gao,et al. A Diversity-Promoting Objective Function for Neural Conversation Models , 2015, NAACL.

[9] Minlie Huang,et al. Story Ending Generation with Incremental Encoding and Commonsense Knowledge , 2018, AAAI.

[10] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[11] Quoc V. Le,et al. QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension , 2018, ICLR.

[12] Huanbo Luan,et al. Improving the Transformer Translation Model with Document-Level Context , 2018, EMNLP.

[13] Percy Liang,et al. Know What You Don’t Know: Unanswerable Questions for SQuAD , 2018, ACL.

[14] Hang Li,et al. Neural Responding Machine for Short-Text Conversation , 2015, ACL.

[15] Ming-Wei Chang,et al. A Knowledge-Grounded Neural Conversation Model , 2017, AAAI.

[16] Jason Weston,et al. Wizard of Wikipedia: Knowledge-Powered Conversational agents , 2018, ICLR.

[17] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[18] Dilek Z. Hakkani-Tür,et al. DeepCopy: Grounded Response Generation with Hierarchical Pointer Networks , 2019, SIGdial.

[19] Yang Feng,et al. Knowledge Diffusion for Neural Dialogue Generation , 2018, ACL.

[20] Hua Wu,et al. Modeling Coherence for Discourse Neural Machine Translation , 2018, AAAI.

[21] Alan W. Black,et al. A Dataset for Document Grounded Conversations , 2018, EMNLP.

[22] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[23] Eunsol Choi,et al. CONVERSATIONAL MACHINE COMPREHENSION , 2019 .

[24] Pascale Fung,et al. Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems , 2018, ACL.

[25] Joelle Pineau,et al. Extending Neural Generative Conversational Model using External Knowledge Sources , 2018, EMNLP.

[26] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[27] Danqi Chen,et al. CoQA: A Conversational Question Answering Challenge , 2018, TACL.

[28] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.