Wizard of Search Engine: Access to Information Through Conversations with Search Engines

Conversational information seeking (CIS) is playing an increasingly important role in connecting people to information. Due to a lackof suitable resources, previous studies on CIS are limited to thestudy of conceptual frameworks, laboratory-based user studies, or a particular aspect of CIS (e.g., asking clarifying questions). In this work, we make three main contributions to facilitate research into CIS: (1) We formulate a pipeline for CIS with six subtasks: intent detection, keyphrase extraction, action prediction, query selection, passage selection, and response generation. (2) We release a benchmark dataset, called wizard of search engine (WISE), which allows for comprehensive and in-depth research on all aspects of CIS. (3) We design a neural architecture capable of training and evaluating both jointly and separately on the six sub-tasks, and devise a pre-train/fine-tune learning scheme, that can reduce the requirements of WISE in scale by making full use of available data. We report useful characteristics of the CIS task based on statistics of the WISE dataset. We also show that our best performing model variant is able to achieve effective CIS. We release the dataset, code as well as evaluation scripts to facilitate future research by measuring further improvements in this important research direction.

[1]  Maarten de Rijke,et al.  Retrospective and Prospective Mixture-of-Generators for Task-oriented Dialogue Response Generation , 2020, ECAI.

[2]  Filip Radlinski,et al.  A Theoretical Framework for Conversational Search , 2017, CHIIR.

[3]  Mark Sanderson,et al.  Informing the Design of Spoken Conversational Search: Perspective Paper , 2018, CHIIR.

[4]  Xinyan Xiao,et al.  DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications , 2017, QA@ACL.

[5]  Wei Wu,et al.  Deep Chit-Chat: Deep Learning for Chatbots , 2019, SIGIR.

[6]  Yi Zhang,et al.  Conversational Recommender System , 2018, SIGIR.

[7]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[8]  Chris Dyer,et al.  The NarrativeQA Reading Comprehension Challenge , 2017, TACL.

[9]  Matthias Hagen,et al.  Conversational Search - A Report from Dagstuhl Seminar 19461 , 2020, ArXiv.

[10]  Eunsol Choi,et al.  QuAC: Question Answering in Context , 2018, EMNLP.

[11]  Mitesh M. Khapra,et al.  Towards Exploiting Background Knowledge for Building Conversation Systems , 2018, EMNLP.

[12]  M. de Rijke,et al.  Conversations with Search Engines , 2020, ArXiv.

[13]  Stefan Feuerriegel,et al.  Learning from On-Line User Feedback in Neural Question Answering on the Web , 2019, WWW.

[14]  Minlie Huang,et al.  KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation , 2020, ACL.

[15]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[16]  W. Bruce Croft,et al.  Asking Clarifying Questions in Open-Domain Information-Seeking Conversations , 2019, SIGIR.

[17]  Hideo Joho,et al.  Towards a Model for Spoken Conversational Search , 2019, Inf. Process. Manag..

[18]  Jason Weston,et al.  Wizard of Wikipedia: Knowledge-Powered Conversational agents , 2018, ICLR.

[19]  Rui Yan,et al.  Deep Chit-Chat: Deep Learning for Chatbots , 2019, WWW.

[20]  Paul N. Bennett,et al.  Generating Clarifying Questions for Information Retrieval , 2020, WWW.

[21]  M. de Rijke,et al.  Thinking Globally, Acting Locally: Distantly Supervised Global-to-Local Knowledge Selection for Background Based Conversation , 2019, AAAI.

[22]  Gerhard Weikum,et al.  Look before you Hop: Conversational Question Answering over Knowledge Graphs Using Judicious Context Expansion , 2019, CIKM.

[23]  Danqi Chen,et al.  CoQA: A Conversational Question Answering Challenge , 2018, TACL.

[24]  Alan Ritter,et al.  Fluent Response Generation for Conversational Question Answering , 2020, ACL.

[25]  Mark Sanderson,et al.  Informing the Design of Spoken Conversational Search: Perspective Paper , 2018, CHIIR.

[26]  Kam-Fai Wong,et al.  Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning , 2017, EMNLP.

[27]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[28]  Filip Radlinski,et al.  Towards Conversational Recommender Systems , 2016, KDD.

[29]  Mitesh M. Khapra,et al.  Complex Sequential Question Answering: Towards Learning to Converse Over Linked Question Answer Pairs with a Knowledge Graph , 2018, AAAI.

[30]  Jianfeng Gao,et al.  A Human Generated MAchine Reading COmprehension Dataset , 2018 .

[31]  Navdeep Jaitly,et al.  Pointer Networks , 2015, NIPS.

[32]  Yiqun Liu,et al.  Temporal Relational Ranking for Stock Prediction , 2018, ACM Trans. Inf. Syst..

[33]  Jamie Callan,et al.  CAsT-19: A Dataset for Conversational Information Seeking , 2020, SIGIR.

[34]  Daniel McDuff,et al.  Style and Alignment in Information-Seeking Conversation , 2018, CHIIR.

[35]  Jamie Callan,et al.  TREC CAsT 2019: The Conversational Assistance Track Overview , 2020, ArXiv.

[36]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[37]  Ryen W. White,et al.  Exploratory Search: Beyond the Query-Response Paradigm , 2009, Exploratory Search: Beyond the Query-Response Paradigm.

[38]  M. de Rijke,et al.  Query Resolution for Conversational Search with Limited Supervision , 2020, SIGIR.

[39]  W. Bruce Croft,et al.  User Intent Prediction in Information-seeking Conversations , 2019, CHIIR.

[40]  Hamed Zamani,et al.  Macaw: An Extensible Conversational Information Seeking Platform , 2019, SIGIR.

[41]  Charles L. A. Clarke,et al.  Exploring Conversational Search With Humans, Assistants, and Wizards , 2017, CHI Extended Abstracts.

[42]  Anton Leuski,et al.  Agent Dialogue: A Platform for Conversational Information Seeking Experimentation , 2020, SIGIR.

[43]  Martin Halvey,et al.  Conceptualizing agent-human interactions during the conversational search process , 2018 .

[44]  M. de Rijke,et al.  Advances and Challenges in Conversational Recommender Systems: A Survey , 2021, AI Open.

[45]  Xiyuan Zhang,et al.  Proactive Human-Machine Conversation with Explicit Conversation Goal , 2019, ACL.

[46]  Peng Li,et al.  Dataset and Neural Recurrent Sequence Labeling Model for Open-Domain Factoid Question Answering , 2016, ArXiv.

[47]  Yiqun Liu,et al.  Investigating Passage-level Relevance and Its Role in Document-level Relevance Judgment , 2019, SIGIR.

[48]  Christopher D. Manning,et al.  Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.