Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval
暂无分享,去创建一个
Defu Lian | Chaozhuo Li | Hao Sun | Shitao Xiao | Xing Xie | Yingxia Shao | Qi Zhang | Zheng Liu | Weihao Han | Jianjin Zhang | Denvy Deng | Liangjie Zhang
[1] Yury Malkov,et al. Revisiting the Inverted Indices for Billion-Scale Approximate Nearest Neighbors , 2018, ECCV.
[2] Oriol Vinyals,et al. Representation Learning with Contrastive Predictive Coding , 2018, ArXiv.
[3] Ravishankar Krishnaswamy,et al. FreshDiskANN: A Fast and Accurate Graph-Based ANN Index for Streaming Similarity Search , 2021, ArXiv.
[4] Tianyu Gao,et al. SimCSE: Simple Contrastive Learning of Sentence Embeddings , 2021, EMNLP.
[5] Yizhou Sun,et al. Differentiable Product Quantization for End-to-End Embedding Compression , 2019, ICML.
[6] Yoshua Bengio,et al. Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation , 2013, ArXiv.
[7] Geoffrey E. Hinton,et al. A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.
[8] David G. Lowe,et al. Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.
[9] Minjia Zhang,et al. HM-ANN: Efficient Billion-Point Nearest Neighbor Search on Heterogeneous Memory , 2020, NeurIPS.
[10] King-Sun Fu,et al. IEEE Transactions on Pattern Analysis and Machine Intelligence Publication Information , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[11] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.
[12] Yelong Shen,et al. Learning semantic representations using convolutional neural networks for web search , 2014, WWW.
[13] Nicole Immorlica,et al. Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.
[14] Jimmy J. Lin,et al. Distilling Dense Representations for Ranking using Tightly-Coupled Teachers , 2020, ArXiv.
[15] Ping Li,et al. MOBIUS: Towards the Next Generation of Query-Ad Matching in Baidu's Sponsored Search , 2019, KDD.
[16] Ming-Wei Chang,et al. REALM: Retrieval-Augmented Language Model Pre-Training , 2020, ICML.
[17] Linjun Yang,et al. Embedding-based Retrieval in Facebook Search , 2020, KDD.
[18] Cordelia Schmid,et al. Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[19] Shuaiqiang Wang,et al. Pre-trained Language Model for Web-scale Retrieval in Baidu Search , 2021, KDD.
[20] Hua Wu,et al. RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering , 2020, NAACL.
[21] Defu Lian,et al. Matching-oriented Product Quantization For Ad-hoc Retrieval , 2021, 2104.07858.
[22] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[23] Deng Cai,et al. Fast Approximate Nearest Neighbor Search With The Navigating Spreading-out Graph , 2017, Proc. VLDB Endow..
[24] Jacob Eisenstein,et al. Sparse, Dense, and Attentional Representations for Text Retrieval , 2021, Transactions of the Association for Computational Linguistics.
[25] Danqi Chen,et al. Dense Passage Retrieval for Open-Domain Question Answering , 2020, EMNLP.
[26] Quoc V. Le,et al. Distributed Representations of Sentences and Documents , 2014, ICML.
[27] Ruofei Zhang,et al. TwinBERT: Distilling Knowledge to Twin-Structured Compressed BERT Models for Large-Scale Retrieval , 2020, CIKM.
[28] Kaiming He,et al. Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[29] Suhas Jayaram Subramanya,et al. DiskANN : Fast Accurate Billion-point Nearest Neighbor Search on a Single Node , 2019 .
[30] Ming-Wei Chang,et al. Latent Retrieval for Weakly Supervised Open Domain Question Answering , 2019, ACL.
[31] Yiming Yang,et al. XLNet: Generalized Autoregressive Pretraining for Language Understanding , 2019, NeurIPS.
[32] Jiafeng Guo,et al. Optimizing Dense Retrieval Model Training with Hard Negatives , 2021, SIGIR.
[33] David G. Lowe,et al. Shape indexing using approximate nearest-neighbour search in high-dimensional spaces , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[34] Bhaskar Mitra,et al. Overview of the TREC 2019 deep learning track , 2020, ArXiv.
[35] Omer Levy,et al. SpanBERT: Improving Pre-training by Representing and Predicting Spans , 2019, TACL.
[36] Kwan Hui Lim,et al. An Unsupervised Sentence Embedding Method by Mutual Information Maximization , 2020, EMNLP.
[37] Wei-Cheng Chang,et al. Pre-training Tasks for Embedding-based Large-scale Retrieval , 2020, ICLR.
[38] Ye Li,et al. Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval , 2020, ArXiv.
[39] Jian Sun,et al. Optimized Product Quantization , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[40] Matthijs Douze,et al. Searching in one billion vectors: Re-rank with source coding , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[41] Yury A. Malkov,et al. Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.