Improving the Generalizability of the Dense Passage Retriever Using Generated Datasets

[1]  Oyvind Tafjord,et al.  General-Purpose Question-Answering with Macaw , 2021, ArXiv.

[2]  Iryna Gurevych,et al.  BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models , 2021, NeurIPS Datasets and Benchmarks.

[3]  Jannis Bulian,et al.  CLIMATE-FEVER: A Dataset for Verification of Real-World Climate Claims , 2020, ArXiv.

[4]  Nick Craswell,et al.  Simulating Information Retrieval Test Collections , 2020, Synthesis Lectures on Information Concepts, Retrieval, and Services.

[5]  Paul N. Bennett,et al.  Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval , 2020, ICLR.

[6]  Kirk Roberts,et al.  TREC-COVID , 2020, SIGIR Forum.

[7]  Hannaneh Hajishirzi,et al.  Fact or Fiction: Verifying Scientific Claims , 2020, EMNLP.

[8]  M. Zaharia,et al.  ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT , 2020, SIGIR.

[9]  Daniel S. Weld,et al.  SPECTER: Document-level Representation Learning using Citation-informed Transformers , 2020, ACL.

[10]  Danqi Chen,et al.  Dense Passage Retrieval for Open-Domain Question Answering , 2020, EMNLP.

[11]  Colin Raffel,et al.  Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[12]  Ming-Wei Chang,et al.  Natural Questions: A Benchmark for Question Answering Research , 2019, TACL.

[13]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[14]  Yoshua Bengio,et al.  HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering , 2018, EMNLP.

[15]  Benno Stein,et al.  Retrieval of the Best Counterargument without Prior Topic Knowledge , 2018, ACL.

[16]  André Freitas,et al.  WWW'18 Open Challenge: Financial Opinion Mining and Question Answering , 2018, WWW.

[17]  Andreas Vlachos,et al.  FEVER: a Large-scale Dataset for Fact Extraction and VERification , 2018, NAACL.

[18]  Krisztian Balog,et al.  DBpedia-Entity v2: A Test Collection for Entity Search , 2017, SIGIR.

[19]  Stefan Riezler,et al.  A Full-Text Learning to Rank Dataset for Medical Information Retrieval , 2016, ECIR.

[20]  Timothy Baldwin,et al.  CQADupStack: A Benchmark Data Set for Community Question-Answering Research , 2015, ADCS.

[21]  M. de Rijke,et al.  Pseudo test collections for training and tuning microblog rankers , 2013, SIGIR.

[22]  M. de Rijke,et al.  Generating Pseudo Test Collections for Learning to Rank Scientific Articles , 2012, CLEF.

[23]  Jimmy J. Lin,et al.  Pseudo test collections for learning web search ranking functions , 2011, SIGIR.

[24]  W. Bruce Croft,et al.  Retrieval experiments using pseudo-desktop collections , 2009, CIKM.

[25]  M. de Rijke,et al.  Automatic construction of known-item finding test beds , 2006, SIGIR.

[26]  Jean Tague-Sutcliffe,et al.  Problems in the simulation of bibliographic retrieval systems , 1980, SIGIR '80.

[27]  Martin Potthast,et al.  Overview of Touché 2022: Argument Retrieval , 2022, CLEF.

[28]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[29]  Ellen M. Voorhees,et al.  The TREC-8 Question Answering Track Report , 1999, TREC.