Utilizing Passages in Fusion-based Document Retrieval

The usage of passage-level information has been successfully demonstrated in many core IR tasks, and among such tasks, the task of passage-based document retrieval. In this work, we study the merits of utilizing similar information for the fusion-based document retrieval task. Overall, we show that such information can be highly useful for this task as well. To this end, we propose three passage-based fusion methods and show that their performance can transcend that of strong document-level fusion methods.

[1]  W. Bruce Croft,et al.  Relevance-Based Language Models , 2001, SIGIR '01.

[2]  Shengli Wu,et al.  Data Fusion in Information Retrieval , 2012, Adaptation, Learning, and Optimization.

[3]  Maarten de Rijke,et al.  Manifold Learning for Rank Aggregation , 2018, WWW.

[4]  Luo Si,et al.  Discriminative probabilistic models for passage based retrieval , 2008, SIGIR '08.

[5]  Oren Kurland,et al.  A Probabilistic Fusion Framework , 2016, CIKM.

[6]  H. P. Young,et al.  An axiomatization of Borda's rule , 1974 .

[7]  W. Bruce Croft,et al.  WikiPassageQA: A Benchmark Collection for Research on Non-factoid Answer Passage Retrieval , 2018, SIGIR.

[8]  Charles L. A. Clarke,et al.  Reciprocal rank fusion outperforms condorcet and individual rank learning methods , 2009, SIGIR.

[9]  W. Bruce Croft,et al.  Predicting query performance , 2002, SIGIR '02.

[10]  Oren Kurland,et al.  Utilizing relevance feedback in fusion-based retrieval , 2014, SIGIR.

[11]  Kyunghyun Cho,et al.  Passage Re-ranking with BERT , 2019, ArXiv.

[12]  W. Bruce Croft,et al.  Passage retrieval based on language models , 2002, CIKM '02.

[13]  Haggai Roitman Utilizing Pseudo-Relevance Feedback in Fusion-based Retrieval , 2018, ICTIR.

[14]  Oren Kurland,et al.  Predicting Query Performance by Query-Drift Estimation , 2009, TOIS.

[15]  W. Bruce Croft,et al.  Retrieving Passages and Finding Answers , 2014, ADCS '14.

[16]  Jimmy J. Lin,et al.  Quantitative evaluation of passage retrieval algorithms for question answering , 2003, SIGIR.

[17]  Justin Zobel,et al.  Passage retrieval revisited , 1997, SIGIR '97.

[18]  W. Bruce Croft,et al.  Beyond Factoid QA: Effective Methods for Non-factoid Answer Sentence Retrieval , 2016, ECIR.

[19]  Daniel Ortiz Arroyo,et al.  Applying Data Fusion Methods to Passage Retrieval in QAS , 2007, MCS.

[20]  Oren Kurland,et al.  Query performance prediction for IR , 2012, SIGIR '12.

[21]  Mathias Géry,et al.  BM25t: a BM25 extension for focused information retrieval , 2012, Knowledge and Information Systems.

[22]  Oren Kurland,et al.  Cluster-based fusion of retrieved lists , 2011, SIGIR.

[23]  Oren Kurland,et al.  A study of the integration of passage-, document-, and cluster-based information for re-ranking search results , 2011, Information Retrieval.

[24]  Ross Wilkinson,et al.  Effective retrieval of structured documents , 1994, SIGIR '94.

[25]  Oren Kurland,et al.  Predicting the performance of passage retrieval for question answering , 2012, CIKM.

[26]  Jun Xu,et al.  Modeling Diverse Relevance Patterns in Ad-hoc Retrieval , 2018, SIGIR.

[27]  W. Bruce Croft,et al.  Query performance prediction in web search environments , 2007, SIGIR.

[28]  Oren Kurland,et al.  From "Identical" to "Similar": Fusing Retrieved Lists Based on Inter-document Similarities , 2009, ICTIR.

[29]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[30]  Oren Kurland,et al.  Utilizing Passage-Based Language Models for Document Retrieval , 2008, ECIR.

[31]  J. Shane Culpepper,et al.  Fusion in Information Retrieval: SIGIR 2018 Half-Day Tutorial , 2018, SIGIR.

[32]  Jimmy J. Lin,et al.  End-to-End Open-Domain Question Answering with BERTserini , 2019, NAACL.

[33]  Haggai Roitman,et al.  An Extended Query Performance Prediction Framework Utilizing Passage-Level Information , 2018, ICTIR.

[34]  Ani Nenkova,et al.  A Survey of Text Summarization Techniques , 2012, Mining Text Data.

[35]  W. Bruce Croft,et al.  A Language Modeling Approach to Information Retrieval , 1998, SIGIR Forum.

[36]  Oren Kurland,et al.  From "Identical" to "Similar": Fusing Retrieved Lists Based on Inter-document Similarities , 2011, ICTIR.