论文信息 - Improving the Generalizability of the Dense Passage Retriever Using Generated Datasets - 字舞流文

Improving the Generalizability of the Dense Passage Retriever Using Generated Datasets

M. de Rijke | Thilina C. Rajapakse

[1] Oyvind Tafjord,et al. General-Purpose Question-Answering with Macaw , 2021, ArXiv.

[2] Iryna Gurevych,et al. BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models , 2021, NeurIPS Datasets and Benchmarks.

[3] Jannis Bulian,et al. CLIMATE-FEVER: A Dataset for Verification of Real-World Climate Claims , 2020, ArXiv.

[4] Nick Craswell,et al. Simulating Information Retrieval Test Collections , 2020, Synthesis Lectures on Information Concepts, Retrieval, and Services.

[5] Paul N. Bennett,et al. Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval , 2020, ICLR.

[6] Kirk Roberts,et al. TREC-COVID , 2020, SIGIR Forum.

[7] Hannaneh Hajishirzi,et al. Fact or Fiction: Verifying Scientific Claims , 2020, EMNLP.

[8] M. Zaharia,et al. ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERT , 2020, SIGIR.

[9] Daniel S. Weld,et al. SPECTER: Document-level Representation Learning using Citation-informed Transformers , 2020, ACL.

[10] Danqi Chen,et al. Dense Passage Retrieval for Open-Domain Question Answering , 2020, EMNLP.

[11] Colin Raffel,et al. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[12] Ming-Wei Chang,et al. Natural Questions: A Benchmark for Question Answering Research , 2019, TACL.

[13] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[14] Yoshua Bengio,et al. HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering , 2018, EMNLP.

[15] Benno Stein,et al. Retrieval of the Best Counterargument without Prior Topic Knowledge , 2018, ACL.

[16] André Freitas,et al. WWW'18 Open Challenge: Financial Opinion Mining and Question Answering , 2018, WWW.

[17] Andreas Vlachos,et al. FEVER: a Large-scale Dataset for Fact Extraction and VERification , 2018, NAACL.

[18] Krisztian Balog,et al. DBpedia-Entity v2: A Test Collection for Entity Search , 2017, SIGIR.

[19] Stefan Riezler,et al. A Full-Text Learning to Rank Dataset for Medical Information Retrieval , 2016, ECIR.

[20] Timothy Baldwin,et al. CQADupStack: A Benchmark Data Set for Community Question-Answering Research , 2015, ADCS.

[21] M. de Rijke,et al. Pseudo test collections for training and tuning microblog rankers , 2013, SIGIR.

[22] M. de Rijke,et al. Generating Pseudo Test Collections for Learning to Rank Scientific Articles , 2012, CLEF.

[23] Jimmy J. Lin,et al. Pseudo test collections for learning web search ranking functions , 2011, SIGIR.

[24] W. Bruce Croft,et al. Retrieval experiments using pseudo-desktop collections , 2009, CIKM.

[25] M. de Rijke,et al. Automatic construction of known-item finding test beds , 2006, SIGIR.

[26] Jean Tague-Sutcliffe,et al. Problems in the simulation of bibliographic retrieval systems , 1980, SIGIR '80.

[27] Martin Potthast,et al. Overview of Touché 2022: Argument Retrieval , 2022, CLEF.

[28] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[29] Ellen M. Voorhees,et al. The TREC-8 Question Answering Track Report , 1999, TREC.