论文信息 - Augmenting Knowledge-grounded Conversations with Sequential Knowledge Transition - 字舞流文

Augmenting Knowledge-grounded Conversations with Sequential Knowledge Transition

Knowledge data are massive and widespread in the real-world, which can serve as good external sources to enrich conversations. However, in knowledge-grounded conversations, current models still lack the fine-grained control over knowledge selection and integration with dialogues, which finally leads to the knowledge-irrelevant response generation problems: 1) knowledge selection merely relies on the dialogue context, ignoring the inherent knowledge transitions along with conversation flows; 2) the models often over-fit during training, resulting with incoherent response by referring to unrelated tokens from specific knowledge content in the testing phase; 3) although response is generated upon the dialogue history and knowledge, the models often tend to overlook the selected knowledge, and hence generates knowledge-irrelevant response. To address these problems, we proposed to explicitly model the knowledge transition in sequential multi-turn conversations by abstracting knowledge into topic tags. Besides, to fully utilizing the selected knowledge in generative process, we propose pre-training a knowledge-aware response generator to pay more attention on the selected knowledge. In particular, a sequential knowledge transition model equipped with a pre-trained knowledge-aware response generator (SKT-KG) formulates the high-level knowledge transition and fully utilizes the limited knowledge data. Experimental results on both structured and unstructured knowledge-grounded dialogue benchmarks indicate that our model achieves better performance over baseline models.

Yanyan Lan | Haolan Zhan | Yongjun Bao | Zhuoye Ding | Hongshen Chen | Hainan Zhang | Yanyan Lan | Zhuoye Ding | Yongjun Bao | Haolan Zhan | Hongshen Chen | Hainan Zhang

[1] Meng Chen,et al. The JDDC Corpus: A Large-Scale Multi-Turn Chinese Dialogue Dataset for E-commerce Customer Service , 2020, LREC.

[2] Jie Zhou,et al. Diversifying Dialogue Generation with Non-Conversational Text , 2020, ACL.

[3] J. Fleiss. Measuring nominal scale agreement among many raters. , 1971 .

[4] Zheng-Yu Niu,et al. Knowledge Graph Grounded Goal Planning for Open-Domain Conversation Generation , 2020, AAAI.

[5] Yang Feng,et al. Learning to Select Context in a Hierarchical and Global Perspective for Open-Domain Dialogue Generation , 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6] Dawei Yin,et al. Modeling Topical Relevance for Multi-Turn Dialogue Generation , 2020, IJCAI.

[7] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[8] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[9] Byeongchang Kim,et al. Sequential Latent Knowledge Selection for Knowledge-Grounded Dialogue , 2020, ICLR.

[10] M. de Rijke,et al. Thinking Globally, Acting Locally: Distantly Supervised Global-to-Local Knowledge Selection for Background Based Conversation , 2019, AAAI.

[11] Xiyuan Zhang,et al. Proactive Human-Machine Conversation with Explicit Conversation Goal , 2019, ACL.

[12] Yue Hu,et al. History-Adaption Knowledge Incorporation Mechanism for Multi-Turn Dialogue System , 2020, AAAI.

[13] Ming-Wei Chang,et al. A Knowledge-Grounded Neural Conversation Model , 2017, AAAI.

[14] Yinong Long,et al. A Knowledge Enhanced Generative Conversational Service Agent , 2017 .

[15] Dan Roth,et al. KnowSemLM: A Knowledge Infused Semantic Language Model , 2019, CoNLL.

[16] Hugo Zaragoza,et al. The Probabilistic Relevance Framework: BM25 and Beyond , 2009, Found. Trends Inf. Retr..

[17] Nan Hua,et al. Universal Sentence Encoder for English , 2018, EMNLP.

[18] H. Eysenck,et al. An Empirical Study of the Relation between Astrological Factors and Personality , 1978 .

[19] Yang Feng,et al. Modeling Semantic Relationship in Multi-turn Conversations with Hierarchical Latent Variables , 2019, ACL.

[20] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[21] Yang Feng,et al. Knowledge Diffusion for Neural Dialogue Generation , 2018, ACL.

[22] Yang Feng,et al. CDL: Curriculum Dual Learning for Emotion-Controllable Response Generation , 2020, ACL.

[23] Jun Xu,et al. Reinforcing Coherence for Sequence to Sequence Model in Dialogue Generation , 2018, IJCAI.

[24] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[25] Xiaoyan Zhu,et al. Commonsense Knowledge Aware Conversation Generation with Graph Attention , 2018, IJCAI.

[26] Jason Weston,et al. Wizard of Wikipedia: Knowledge-Powered Conversational agents , 2018, ICLR.

[27] Zheng-Yu Niu,et al. Knowledge Aware Conversation Generation with Explainable Reasoning over Augmented Graphs , 2019, EMNLP.

[28] Jianfeng Gao,et al. A Diversity-Promoting Objective Function for Neural Conversation Models , 2015, NAACL.

[29] Zhaochun Ren,et al. Hierarchical Variational Memory Network for Dialogue Generation , 2018, WWW.

[30] Dongyan Zhao,et al. Low-Resource Knowledge-Grounded Dialogue Generation , 2020, ICLR.

[31] Xueqi Cheng,et al. ReCoSa: Detecting the Relevant Contexts with Self-Attention for Multi-turn Dialogue Generation , 2019, ACL.

[32] Rongzhong Lian,et al. Learning to Select Knowledge for Response Generation in Dialog Systems , 2019, IJCAI.

[33] Wei Xu,et al. Bidirectional LSTM-CRF Models for Sequence Tagging , 2015, ArXiv.

[34] Erik Cambria,et al. Augmenting End-to-End Dialogue Systems With Commonsense Knowledge , 2018, AAAI.

[35] Wei Wu,et al. Zero-Resource Knowledge-Grounded Dialogue Generation , 2020, NeurIPS.