A bayesian learning approach to promoting diversity in ranking for biomedical information retrieval

In this paper, we propose a Bayesian learning approach to promoting diversity for information retrieval in biomedicine and a re-ranking model to improve retrieval performance in the biomedical domain. First, the re-ranking model computes the maximum posterior probability of the hidden property corresponding to each retrieved passage. Then it iteratively groups the passages into subsets according to their properties. Finally, these passages are re-ranked from the subsets as our output. There is no need for our proposed method to use any external biomedical resource. We evaluate our Bayesian learning approach by conducting extensive experiments on the TREC 2004-2007 Genomics data sets. The experimental results show the effectiveness of the proposed Bayesian learning approach for promoting diversity in ranking for biomedical information retrieval on four years TREC data sets.

[1]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[2]  C. D. Litton,et al.  Theory of Probability (3rd Edition) , 1984 .

[3]  Yi Zhang,et al.  Novelty and redundancy detection in adaptive filtering , 2002, SIGIR '02.

[4]  Hua Li,et al.  Improving web search results using affinity graph , 2005, SIGIR '05.

[5]  Stephen E. Robertson,et al.  Okapi at TREC-5 , 1996, TREC.

[6]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[7]  Xiaojin Zhu,et al.  Ranking Biomedical Passages for Relevance and Diversity: University of Wisconsin, Madison at TREC Genomics 2006 , 2006, TREC.

[8]  Charles L. A. Clarke,et al.  Domain-Specific Synonym Expansion and Validation for Biomedical Information Retrieval (MultiText Experiments for TREC 2004) , 2004, TREC.

[9]  Patrick Ruch,et al.  Combining Resources to Find Answers to Biomedical Questions , 2007, TREC.

[10]  Luo Si,et al.  York University at TREC 2007: Genomics Track , 2005, TREC.

[11]  Marti A. Hearst,et al.  TREC 2007 Genomics Track Overview , 2007, TREC.

[12]  Xiangji Huang,et al.  A platform for Okapi-based contextual information retrieval , 2006, SIGIR '06.

[13]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[14]  Qinmin Hu,et al.  Passage extraction and result combination for genomics information retrieval , 2009, Journal of Intelligent Information Systems.

[15]  L. M. M.-T. Theory of Probability , 1929, Nature.

[16]  Stephen E. Robertson,et al.  Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval , 1994, SIGIR '94.

[17]  Ming Zhong,et al.  Concept-based biomedical text retrieval , 2006, SIGIR '06.

[18]  Xiangji Huang,et al.  Applying Data Mining to Pseudo-Relevance Feedback for High Performance Text Retrieval , 2006, Sixth International Conference on Data Mining (ICDM'06).

[19]  Xiaojin Zhu,et al.  Improving Diversity in Ranking using Absorbing Random Walks , 2007, NAACL.

[20]  Clement T. Yu,et al.  TREC Genomics Track at UIC , 2007, TREC.