论文信息 - Passage Ranking with Weak Supervsion - 字舞流文

Passage Ranking with Weak Supervsion

In this paper, we propose a \textit{weak supervision} framework for neural ranking tasks based on the data programming paradigm \citep{Ratner2016}, which enables us to leverage multiple weak supervision signals from different sources. Empirically, we consider two sources of weak supervision signals, unsupervised ranking functions and semantic feature similarities. We train a BERT-based passage-ranking model (which achieves new state-of-the-art performances on two benchmark datasets with full supervision) in our weak supervision framework. Without using ground-truth training labels, BERT-PR models outperform BM25 baseline by a large margin on all three datasets and even beat the previous state-of-the-art results with full supervision on two of the datasets.

Ramesh Nallapati | Xiaofei Ma | Bing Xiang | Peng Xu | Bing Xiang | Ramesh Nallapati | Peng Xu | Xiaofei Ma

[1] Iryna Gurevych,et al. Representation Learning for Answer Selection with LSTM-Based Importance Weighting , 2017, IWCS.

[2] Bowen Zhou,et al. Applying deep learning to answer selection: A study and an open task , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).

[3] Nick Craswell,et al. Learning to Match using Local and Distributed Representations of Text for Web Search , 2016, WWW.

[4] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[5] W. Bruce Croft,et al. aNMM: Ranking Short Answer Texts with Attention-Based Neural Matching Model , 2016, CIKM.

[6] Sheng Li,et al. A Review on Deep Learning Techniques Applied to Answer Selection , 2018, COLING.

[7] Jason Weston,et al. Reading Wikipedia to Answer Open-Domain Questions , 2017, ACL.

[8] Jianfeng Gao,et al. A Human Generated MAchine Reading COmprehension Dataset , 2018 .

[9] Filip Radlinski,et al. TREC Complex Answer Retrieval Overview , 2018, TREC.

[10] W. Bruce Croft,et al. A Deep Relevance Matching Model for Ad-hoc Retrieval , 2016, CIKM.

[11] Lei Yu,et al. Deep Learning for Answer Sentence Selection , 2014, ArXiv.

[12] Jascha Sohl-Dickstein,et al. Measuring the Effects of Data Parallelism on Neural Network Training , 2018, J. Mach. Learn. Res..

[13] Nan Hua,et al. Universal Sentence Encoder , 2018, ArXiv.

[14] Andrew Yates,et al. An Approach for Weakly-Supervised Deep Information Retrieval , 2017, ArXiv.

[15] W. Bruce Croft,et al. Neural Ranking Models with Weak Supervision , 2017, SIGIR.

[16] Christopher De Sa,et al. Data Programming: Creating Large Training Sets, Quickly , 2016, NIPS.

[17] W. Bruce Croft,et al. WikiPassageQA: A Benchmark Collection for Research on Non-factoid Answer Passage Retrieval , 2018, SIGIR.

[18] Kyunghyun Cho,et al. Passage Re-ranking with BERT , 2019, ArXiv.

[19] Gerard de Melo,et al. Co-PACRR: A Context-Aware Neural IR Model for Ad-hoc Retrieval , 2017, WSDM.

[20] Zhiyuan Liu,et al. End-to-End Neural Ad-hoc Ranking with Kernel Pooling , 2017, SIGIR.