Conversations with Search Engines

In this article, we address the problem of answering complex information needs by conducting conversations with search engines, in the sense that users can express their queries in natural language and directly receive the information they need from a short system response in a conversational manner. Recently, there have been some attempts towards a similar goal, e.g., studies on Conversational Agents (CAs) and Conversational Search (CS). However, they either do not address complex information needs in search scenarios or they are limited to the development of conceptual frameworks and/or laboratory-based user studies. We pursue two goals in this article: (1) the creation of a suitable dataset, the Search as a Conversation (SaaC) dataset, for the development of pipelines for conversations with search engines, and(2) the development of a state-of-the-art pipeline for conversations with search engines, Conversations with Search Engines (CaSE), using this dataset. SaaC is built based on a multi-turn conversational search dataset, where we further employ workers from a crowdsourcing platform to summarize each relevant passage into a short, conversational response. CaSE enhances the state-of-the-art by introducing a supporting token identification module and a prior-aware pointer generator, which enables us to generate more accurate responses. We carry out experiments to show that CaSE is able to outperform strong baselines. We also conduct extensive analyses on the SaaC dataset to show where there is room for further improvement beyond CaSE. Finally, we release the SaaC dataset and the code for CaSE and all models used for comparison to facilitate future research on this topic.

[1]  Stefan Ultes,et al.  MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling , 2018, EMNLP.

[2]  Jamie Callan,et al.  CAsT-19: A Dataset for Conversational Information Seeking , 2020, SIGIR.

[3]  Xiangnan He,et al.  Interactive Path Reasoning on Graph for Conversational Recommendation , 2020, KDD.

[4]  M. de Rijke,et al.  Query Resolution for Conversational Search with Limited Supervision , 2020, SIGIR.

[5]  Fabio Crestani,et al.  Harnessing Evolution of Multi-Turn Conversations for Effective Answer Retrieval , 2020, CHIIR.

[6]  W. Bruce Croft,et al.  User Intent Prediction in Information-seeking Conversations , 2019, CHIIR.

[7]  Ali Farhadi,et al.  Bidirectional Attention Flow for Machine Comprehension , 2016, ICLR.

[8]  Hamed Zamani,et al.  Macaw: An Extensible Conversational Information Seeking Platform , 2019, SIGIR.

[9]  M. de Rijke,et al.  RefNet: A Reference-aware Network for Background Based Conversation , 2019, AAAI.

[10]  Nan Duan,et al.  Multi-Task Learning for Conversational Question Answering over a Large-Scale Knowledge Base , 2019, EMNLP.

[11]  Filip Radlinski,et al.  A Theoretical Framework for Conversational Search , 2017, CHIIR.

[12]  Yi Zhang,et al.  Conversational Recommender System , 2018, SIGIR.

[13]  M. de Rijke,et al.  Thinking Globally, Acting Locally: Distantly Supervised Global-to-Local Knowledge Selection for Background Based Conversation , 2019, AAAI.

[14]  Charles L. A. Clarke,et al.  Exploring Conversational Search With Humans, Assistants, and Wizards , 2017, CHI Extended Abstracts.

[15]  Ben Carterette,et al.  Overview of the TREC 2011 Session Track , 2011, TREC.

[16]  Martin Halvey,et al.  Conceptualizing agent-human interactions during the conversational search process , 2018 .

[17]  M. de Rijke,et al.  Leveraging Contextual Sentence Relations for Extractive Summarization Using a Neural Attention Model , 2017, SIGIR.

[18]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[19]  M. de Rijke,et al.  Conversational Exploratory Search via Interactive Storytelling , 2017, ArXiv.

[20]  Junji Tomita,et al.  Multi-style Generative Reading Comprehension , 2019, ACL.

[21]  Munindar P. Singh The Practical Handbook of Internet Computing , 2004 .

[22]  Jason Weston,et al.  Wizard of Wikipedia: Knowledge-Powered Conversational agents , 2018, ICLR.

[23]  Paul N. Bennett,et al.  Generating Clarifying Questions for Information Retrieval , 2020, WWW.

[24]  W. Bruce Croft,et al.  Analyzing and Characterizing User Intent in Information-seeking Conversations , 2018, SIGIR.

[25]  Ming-Wei Chang,et al.  The Value of Semantic Parse Labeling for Knowledge Base Question Answering , 2016, ACL.

[26]  Nicholas J. Belkin,et al.  Cases, scripts, and information-seeking strategies: On the design of interactive information retrieval systems , 1995 .

[27]  Igor Shalyminov,et al.  Fast Domain Adaptation for Goal-Oriented Dialogue Using a Hybrid Generative-Retrieval Transformer , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[28]  Ming Zhou,et al.  Dialog-to-Action: Conversational Question Answering Over a Large-Scale Knowledge Base , 2018, NeurIPS.

[29]  Richard Socher,et al.  Dynamic Coattention Networks For Question Answering , 2016, ICLR.

[30]  Rui Yan,et al.  Deep Chit-Chat: Deep Learning for Chatbots , 2019, WWW.

[31]  Igor Shalyminov,et al.  Hybrid Generative-Retrieval Transformers for Dialogue Domain Adaptation , 2020, ArXiv.

[32]  Daniel McDuff,et al.  Style and Alignment in Information-Seeking Conversation , 2018, CHIIR.

[33]  Gerhard Weikum,et al.  Look before you Hop: Conversational Question Answering over Knowledge Graphs Using Judicious Context Expansion , 2019, CIKM.

[34]  Min-Yen Kan,et al.  Sequicity: Simplifying Task-oriented Dialogue Systems with Single Sequence-to-Sequence Architectures , 2018, ACL.

[35]  Chris Dyer,et al.  The NarrativeQA Reading Comprehension Challenge , 2017, TACL.

[36]  Joelle Pineau,et al.  The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-Turn Dialogue Systems , 2015, SIGDIAL Conference.

[37]  Danqi Chen,et al.  CoQA: A Conversational Question Answering Challenge , 2018, TACL.

[38]  Ryuichiro Higashinaka,et al.  Towards an open-domain conversational system fully based on natural language processing , 2014, COLING.

[39]  Mark Sanderson,et al.  Informing the Design of Spoken Conversational Search: Perspective Paper , 2018, CHIIR.

[40]  Ming-Wei Chang,et al.  Latent Retrieval for Weakly Supervised Open Domain Question Answering , 2019, ACL.

[41]  Yoshua Bengio,et al.  HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering , 2018, EMNLP.

[42]  Jianfeng Gao,et al.  A Human Generated MAchine Reading COmprehension Dataset , 2018 .

[43]  Xin Jiang,et al.  Neural Generative Question Answering , 2015, IJCAI.

[44]  W. Bruce Croft,et al.  I3R: A new approach to the design of document retrieval systems , 1987, J. Am. Soc. Inf. Sci..

[45]  Richard Socher,et al.  A Deep Reinforced Model for Abstractive Summarization , 2017, ICLR.

[46]  Eunsol Choi,et al.  TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension , 2017, ACL.

[47]  Rajarshi Das,et al.  Multi-step Retriever-Reader Interaction for Scalable Open-domain Question Answering , 2019, ICLR.

[48]  Xu Chen,et al.  Towards Conversational Search and Recommendation: System Ask, User Respond , 2018, CIKM.

[49]  Ming Zhou,et al.  S-Net: From Answer Extraction to Answer Synthesis for Machine Reading Comprehension , 2018, AAAI.

[50]  Filip Radlinski,et al.  Towards Conversational Recommender Systems , 2016, KDD.

[51]  Mitesh M. Khapra,et al.  Complex Sequential Question Answering: Towards Learning to Converse Over Linked Question Answer Pairs with a Knowledge Graph , 2018, AAAI.

[52]  W. Bruce Croft,et al.  Conversational Product Search Based on Negative Feedback , 2019, CIKM.

[53]  W. Bruce Croft,et al.  Asking Clarifying Questions in Open-Domain Information-Seeking Conversations , 2019, SIGIR.

[54]  Hideo Joho,et al.  Towards a Model for Spoken Conversational Search , 2019, Inf. Process. Manag..

[55]  Stefan Feuerriegel,et al.  Learning from On-Line User Feedback in Neural Question Answering on the Web , 2019, WWW.

[56]  Phil Blunsom,et al.  Teaching Machines to Read and Comprehend , 2015, NIPS.

[57]  Christopher D. Manning,et al.  Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[58]  Claudia Hauff,et al.  Introducing MANtIS: a novel Multi-Domain Information Seeking Dialogues Dataset , 2019, ArXiv.

[59]  Eunsol Choi,et al.  QuAC: Question Answering in Context , 2018, EMNLP.

[60]  Mitesh M. Khapra,et al.  Towards Exploiting Background Knowledge for Building Conversation Systems , 2018, EMNLP.

[61]  Xiangnan He,et al.  Estimation-Action-Reflection: Towards Deep Interaction Between Conversational and Recommender Systems , 2020, WSDM.

[62]  James C. Lester,et al.  Conversational Agents , 2004, The Practical Handbook of Internet Computing.

[63]  Niels Ole Bernsen,et al.  Designing interactive speech systems - from first ideas to user testing , 1998 .

[64]  W. Bruce Croft,et al.  Open-Retrieval Conversational Question Answering , 2020, SIGIR.

[65]  Jian Zhang,et al.  SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[66]  Ben Carterette,et al.  Million Query Track 2007 Overview , 2008, TREC.

[67]  Lihong Li,et al.  Neural Approaches to Conversational AI , 2019, Found. Trends Inf. Retr..

[68]  Pengjie Ren,et al.  ILPS at TREC 2019 Conversational Assistant Track , 2019, TREC.