Initiative-Aware Self-Supervised Learning for Knowledge-Grounded Conversations

In the knowledge-grounded conversation (KGC) task systems aim to produce more informative responses by leveraging external knowledge. KGC includes a vital part, knowledge selection, where conversational agents select the appropriate knowledge to be incorporated in the next response. Mixed initiative is an intrinsic feature of conversations where the user and the system can both take the initiative in suggesting new conversational directions. Knowledge selection can be driven by the user's initiative or by the system's initiative. For the former, the system usually selects knowledge according to the current user utterance that contains new topics or questions posed by the user; for the latter, the system usually selects knowledge according to the previously selected knowledge. No previous study has considered the mixed-initiative characteristics of knowledge selection to improve its performance. In this paper, we propose a mixed-initiative knowledge selection method (MIKe) for KGC, which explicitly distinguishes between user-initiative and system-initiative knowledge selection. Specifically, we introduce two knowledge selectors to model both of them separately, and design a novel initiative discriminator to discriminate the initiative type of knowledge selection at each conversational turn. A challenge for training MIKe is that we usually have no labels for indicating initiative. To tackle this challenge, we devise an initiative-aware self-supervised learning scheme that helps MIKe to learn to discriminate the initiative type via a self-supervised task. Experimental results on two datasets show that MIKe significantly outperforms state-of-the-art methods in terms of both automatic and human evaluations, indicating that it can select more appropriate knowledge and generate more informative and engaging responses.

[1]  Jianxun Lian,et al.  Self-supervised Graph Learning for Recommendation , 2020, SIGIR.

[2]  Evangelos Kanoulas,et al.  Analysing the Effect of Clarifying Questions on Document Ranking in Conversational Search , 2020, ICTIR.

[3]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[4]  M. de Rijke,et al.  RefNet: A Reference-aware Network for Background Based Conversation , 2019, AAAI.

[5]  Zheng-Yu Niu,et al.  Conversational Graph Grounded Policy Learning for Open-Domain Conversation Generation , 2020, ACL.

[6]  Zhaochun Ren,et al.  Explicit State Tracking with Semi-Supervisionfor Neural Dialogue Generation , 2018, CIKM.

[7]  Xiaojiang Liu,et al.  Response-Anticipated Memory for On-Demand Knowledge Integration in Response Generation , 2020, ACL.

[8]  Mo Yu,et al.  Self-Supervised Learning for Contextualized Extractive Summarization , 2019, ACL.

[9]  Maarten de Rijke,et al.  Conversations with Search Engines , 2020, ArXiv.

[10]  Jianfeng Gao,et al.  Challenges in Building Intelligent Open-domain Dialog Systems , 2019, ACM Trans. Inf. Syst..

[11]  Marilyn A. Walker,et al.  Mixed Initiative in Dialogue: An Investigation into Discourse Segmentation , 1990, ACL.

[12]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[13]  Min-Yen Kan,et al.  Sequicity: Simplifying Task-oriented Dialogue Systems with Single Sequence-to-Sequence Architectures , 2018, ACL.

[14]  Alan W. Black,et al.  A Dataset for Document Grounded Conversations , 2018, EMNLP.

[15]  Minlie Huang,et al.  Difference-aware Knowledge Selection for Knowledge-grounded Conversation Generation , 2020, FINDINGS.

[16]  Nan Hua,et al.  Universal Sentence Encoder , 2018, ArXiv.

[17]  Xiaodong Liu,et al.  Conversing by Reading: Contentful Neural Conversation with On-demand Machine Reading , 2019, ACL.

[18]  Wen Zheng,et al.  Enhancing Conversational Dialogue Models with Grounded Knowledge , 2019, CIKM.

[19]  Wei Wu,et al.  Learning a Simple and Effective Model for Multi-turn Response Generation with Auxiliary Tasks , 2020, EMNLP.

[20]  Zheng-Yu Niu,et al.  Knowledge Aware Conversation Generation with Reasoning on Augmented Graph , 2019, ArXiv.

[21]  Zhonghai Wu,et al.  Diverse and Informative Dialogue Generation with Context-Specific Commonsense Knowledge Awareness , 2020, ACL.

[22]  Seungwhan Moon,et al.  OpenDialKG: Explainable Conversational Reasoning with Attention-based Walks over Knowledge Graphs , 2019, ACL.

[23]  Peijie Sun,et al.  Dual Learning for Explainable Recommendation: Towards Unifying User Preference Prediction and Review Generation , 2020, WWW.

[24]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[25]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[26]  M. de Rijke,et al.  An Analysis of Mixed Initiative and Collaboration in Information-Seeking Dialogues , 2020, SIGIR.

[27]  Ke Zhou,et al.  Approximation of Response Knowledge Retrieval in Knowledge-grounded Dialogue Generation , 2020, FINDINGS.

[28]  Jamie Callan,et al.  TREC CAsT 2019: The Conversational Assistance Track Overview , 2020, ArXiv.

[29]  Omer Levy,et al.  BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension , 2019, ACL.

[30]  Jie Zhou,et al.  Bridging the Gap between Prior and Posterior Knowledge Selection for Knowledge-Grounded Dialogue Generation , 2020, EMNLP.

[31]  Sungjin Lee,et al.  Consistent Dialogue Generation with Self-supervised Feature Learning , 2019, ArXiv.

[32]  Hang Li,et al.  “ Tony ” DNN Embedding for “ Tony ” Selective Read for “ Tony ” ( a ) Attention-based Encoder-Decoder ( RNNSearch ) ( c ) State Update s 4 SourceVocabulary Softmax Prob , 2016 .

[33]  Ji-Rong Wen,et al.  S3-Rec: Self-Supervised Learning for Sequential Recommendation with Mutual Information Maximization , 2020, CIKM.

[34]  Tao Qin,et al.  Dual Learning for Machine Translation and Beyond , 2020 .

[35]  Wei Chu,et al.  Generating Informative Conversational Response using Recurrent Knowledge-Interaction and Knowledge-Copy , 2020, ACL.

[36]  Rui Yan,et al.  Learning to Detect Relevant Contexts and Knowledge for Response Selection in Retrieval-based Dialogue Systems , 2020, CIKM.

[37]  Ming-Wei Chang,et al.  A Knowledge-Grounded Neural Conversation Model , 2017, AAAI.

[38]  Xiaoyan Zhu,et al.  Commonsense Knowledge Aware Conversation Generation with Graph Attention , 2018, IJCAI.

[39]  Jason Weston,et al.  Wizard of Wikipedia: Knowledge-Powered Conversational agents , 2018, ICLR.

[40]  Yang Feng,et al.  Incremental Transformer with Deliberation Decoder for Document Grounded Conversations , 2019, ACL.

[41]  Christopher D. Manning,et al.  Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[42]  Byeongchang Kim,et al.  Sequential Latent Knowledge Selection for Knowledge-Grounded Dialogue , 2020, ICLR.

[43]  Filip Radlinski,et al.  A Theoretical Framework for Conversational Search , 2017, CHIIR.

[44]  William Yang Wang,et al.  Self-Supervised Dialogue Learning , 2019, ACL.

[45]  Yeonchan Ahn,et al.  Exploiting Text Matching Techniques for Knowledge-Grounded Conversation , 2020, IEEE Access.

[46]  Rongzhong Lian,et al.  Learning to Select Knowledge for Response Generation in Dialog Systems , 2019, IJCAI.

[47]  M. de Rijke,et al.  DukeNet: A Dual Knowledge Interaction Network for Knowledge-Grounded Conversation , 2020, SIGIR.

[48]  Le Wu,et al.  A Neural Influence Diffusion Model for Social Recommendation , 2019, SIGIR.

[49]  Jie Tang,et al.  Self-Supervised Learning: Generative or Contrastive , 2020, IEEE Transactions on Knowledge and Data Engineering.

[50]  M. de Rijke,et al.  Thinking Globally, Acting Locally: Distantly Supervised Global-to-Local Knowledge Selection for Background Based Conversation , 2019, AAAI.

[51]  Alon Lavie,et al.  Meteor Universal: Language Specific Translation Evaluation for Any Target Language , 2014, WMT@ACL.

[52]  Dongyan Zhao,et al.  Learning an Effective Context-Response Matching Model with Self-Supervised Tasks for Retrieval-based Dialogues , 2020, AAAI.

[53]  Ruifang He,et al.  Multiple Knowledge Syncretic Transformer for Natural Dialogue Generation , 2020, WWW.

[54]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[55]  Mitesh M. Khapra,et al.  Towards Exploiting Background Knowledge for Building Conversation Systems , 2018, EMNLP.

[56]  Xiangnan He,et al.  Estimation-Action-Reflection: Towards Deep Interaction Between Conversational and Recommender Systems , 2020, WSDM.

[57]  Jianfeng Gao,et al.  A Diversity-Promoting Objective Function for Neural Conversation Models , 2015, NAACL.

[58]  Minlie Huang,et al.  KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation , 2020, ACL.

[59]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[60]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[61]  Wei Wu,et al.  Knowledge-Grounded Dialogue Generation with Pre-trained Language Models , 2020, EMNLP.

[62]  Wei Wu,et al.  Zero-Resource Knowledge-Grounded Dialogue Generation , 2020, NeurIPS.

[63]  M. de Rijke,et al.  Conversations Powered by Cross-Lingual Knowledge , 2021, SIGIR.

[64]  Le Wu,et al.  Attentive Recurrent Social Recommendation , 2018, SIGIR.

[65]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[66]  Dongyan Zhao,et al.  Low-Resource Knowledge-Grounded Dialogue Generation , 2020, ICLR.