Meta Adaptive Neural Ranking with Contrastive Synthetic Supervision

Si Sun1∗, Yingzhuo Qian2∗, Zhenghao Liu, Chenyan Xiong, Kaitao Zhang, Jie Bao, Zhiyuan Liu, Paul Bennett Department of Electronic Engineering, Tsinghua University, Beijing, China Department of Computer Science and Technology, Tsinghua University, Beijing, China Institute for Artificial Intelligence, Tsinghua University, Beijing, China Beijing National Research Center for Information Science and Technology Microsoft Research, Redmond, USA {s-sun17, qyz17, liu-zh16, zkt18}@mails.tsinghua.edu.cn; {bao, liuzy}@tsinghua.edu.cn; {chenyan.xiong, Paul.N.Bennett}@microsoft.com Abstract

[1]  Bhaskar Mitra,et al.  Overview of the TREC 2019 deep learning track , 2020, ArXiv.

[2]  Tom M. Mitchell,et al.  Learning Data Manipulation for Augmentation and Weighting , 2019, NeurIPS.

[3]  Bin Yang,et al.  Learning to Reweight Examples for Robust Deep Learning , 2018, ICML.

[4]  Olivier Chapelle,et al.  Expected reciprocal rank for graded relevance , 2009, CIKM.

[5]  Kirk Roberts,et al.  TREC-COVID: rationale and structure of an information retrieval shared task for COVID-19 , 2020, J. Am. Medical Informatics Assoc..

[6]  Jimmy J. Lin,et al.  Critically Examining the "Neural Hype": Weak Baselines and the Additivity of Effectiveness Gains from Neural Ranking Models , 2019, SIGIR.

[7]  W. Bruce Croft,et al.  Neural Ranking Models with Weak Supervision , 2017, SIGIR.

[8]  W. Bruce Croft,et al.  A Markov random field model for term dependencies , 2005, SIGIR '05.

[9]  Colin Raffel,et al.  Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[10]  Davis Liang,et al.  Embedding-based Zero-shot Retrieval through Query Generation , 2020, ArXiv.

[11]  Zhiyuan Liu,et al.  End-to-End Neural Ad-hoc Ranking with Kernel Pooling , 2017, SIGIR.

[12]  Qi Xie,et al.  Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting , 2019, NeurIPS.

[13]  Nazli Goharian,et al.  CEDR: Contextualized Embeddings for Document Ranking , 2019, SIGIR.

[14]  Jimmy J. Lin,et al.  Capreolus: A Toolkit for End-to-End Neural Ad Hoc Retrieval , 2020, WSDM.

[15]  Paul N. Bennett,et al.  Few-Shot Generative Conversational Query Rewriting , 2020, SIGIR.

[16]  Haggai Roitman,et al.  Ad-hoc Document Retrieval Using Weak-Supervision with BERT and GPT2 , 2020, EMNLP.

[17]  Yulia Tsvetkov,et al.  Balancing Training for Multilingual Neural Machine Translation , 2020, ACL.

[18]  Bhaskar Mitra,et al.  An Introduction to Neural Information Retrieval , 2018, Found. Trends Inf. Retr..

[19]  Ahmed Hassan Awadallah,et al.  Meta Label Correction for Learning with Weak Supervision , 2019, ArXiv.

[20]  Jacob Eisenstein,et al.  Sparse, Dense, and Attentional Representations for Text Retrieval , 2020, Transactions of the Association for Computational Linguistics.

[21]  Jimmy J. Lin,et al.  Rapidly Deploying a Neural Search Engine for the COVID-19 Open Research Dataset , 2020, NLPCOVID19.

[22]  Jianfeng Gao,et al.  CMT in TREC-COVID Round 2: Mitigating the Generalization Gaps from Web to Special Domain Search , 2020, ArXiv.

[23]  Allan Hanbury,et al.  Local Self-Attention over Long Text for Efficient Document Retrieval , 2020, SIGIR.

[24]  Chenyan Xiong,et al.  Selective Weak Supervision for Neural Information Retrieval , 2020, WWW.

[25]  Andrew Yates,et al.  Content-Based Weak Supervision for Ad-Hoc Re-Ranking , 2017, SIGIR.

[26]  Ye Li,et al.  Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval , 2020, ArXiv.

[27]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[28]  Yiqun Liu,et al.  Investigating Weak Supervision in Deep Ranking , 2019, Data Inf. Manag..

[29]  M. de Rijke,et al.  A Neural Click Model for Web Search , 2016, WWW.

[30]  W. Bruce Croft,et al.  A Deep Relevance Matching Model for Ad-hoc Retrieval , 2016, CIKM.

[31]  Jamie Callan,et al.  Deeper Text Understanding for IR with Contextual Neural Language Modeling , 2019, SIGIR.

[32]  Ji Ma,et al.  Neural Passage Retrieval with Improved Negative Contrast , 2020, ArXiv.

[33]  Danqi Chen,et al.  Dense Passage Retrieval for Open-Domain Question Answering , 2020, EMNLP.

[34]  Nazli Goharian,et al.  SLEDGE: A Simple Yet Effective Baseline for COVID-19 Scientific Knowledge Search , 2020 .

[35]  Hugo Zaragoza,et al.  The Probabilistic Relevance Framework: BM25 and Beyond , 2009, Found. Trends Inf. Retr..

[36]  Zhiyuan Liu,et al.  Understanding the Behaviors of BERT in Ranking , 2019, ArXiv.

[37]  Ji Ma,et al.  Zero-shot Neural Retrieval via Domain-targeted Synthetic Query Generation , 2020, ArXiv.

[38]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[39]  Zhiyuan Liu,et al.  Convolutional Neural Networks for Soft-Matching N-Grams in Ad-hoc Search , 2018, WSDM.

[40]  Tie-Yan Liu,et al.  Learning to rank for information retrieval , 2009, SIGIR.

[41]  Gerard de Melo,et al.  PACRR: A Position-Aware Neural IR Model for Relevance Matching , 2017, EMNLP.

[42]  Kyunghyun Cho,et al.  Passage Re-ranking with BERT , 2019, ArXiv.

[43]  Bhaskar Mitra,et al.  An Axiomatic Approach to Regularizing Neural Ranking Models , 2019, SIGIR.

[44]  Md. Mustafizur Rahman,et al.  Neural information retrieval: at the end of the early years , 2017, Information Retrieval Journal.

[45]  W. Bruce Croft,et al.  Linear feature-based models for information retrieval , 2007, Information Retrieval.

[46]  Tie-Yan Liu,et al.  Word-Entity Duet Representations for Document Ranking , 2017, SIGIR.

[47]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[48]  Luyu Gao,et al.  Complementing Lexical Retrieval with Semantic Residual Embedding , 2020, ArXiv.

[49]  Sérgio Matos,et al.  Frugal neural reranking: evaluation on the Covid-19 literature , 2020, NLP4COVID@EMNLP.

[50]  Paul N. Bennett,et al.  Generic Intent Representation in Web Search , 2019, SIGIR.

[51]  Ming-Wei Chang,et al.  Latent Retrieval for Weakly Supervised Open Domain Question Answering , 2019, ACL.