LDA based PSEUDO relevance feedback for cross language information retrieval

This paper introduced a LDA-based pseudo relevance feedback (PRF) model for cross language information retrieval. To validate the performance of PRF techniques in CLIR task, we conducted cross language query expansion experiments based on a self-constructed CLIR system, the LDA-based PRF model was applied before or after the query translating process, namely the pre-translation-PRF, the post-translation-PRF, and the combined-PRF strategy. We also compared this model with the classical VSM-based PRF algorithm. Experiment results showed that the proposed LDA-based PRF method was effective for improving the performance of CLIR.

[1]  David A. Evans,et al.  The Effect of Pseudo Relevance Feedback on MT-Based CLIR , 2000, RIAO.

[2]  W. Bruce Croft,et al.  Statistical Methods for Cross-Language Information Retrieval , 1998 .

[3]  C. J. van Rijsbergen,et al.  Phrase Identification in Cross-Language Information Retrieval , 2000, RIAO.

[4]  Jian-Yun Nie,et al.  Cross-language information retrieval based on parallel texts and automatic mining of parallel texts from the Web , 1999, SIGIR '99.

[5]  Jianqiang Wang,et al.  User-assisted query translation for interactive cross-language information retrieval , 2008, Inf. Process. Manag..

[6]  Gary Marchionini,et al.  Examining the effectiveness of real-time query expansion , 2007, Inf. Process. Manag..

[7]  Rudolf Kruse,et al.  Relevance Feedback for Association Rules by Leveraging Concepts from Information Retrieval , 2007, SGAI Conf..

[8]  John D. Lafferty,et al.  Model-based feedback in the language modeling approach to information retrieval , 2001, CIKM '01.

[9]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[10]  Donna K. Harman,et al.  Relevance feedback revisited , 1992, SIGIR '92.

[11]  W. Bruce Croft,et al.  Resolving ambiguity for cross-language retrieval , 1998, SIGIR '98.

[12]  Wei Wang,et al.  Cross language information retrieval based on LDA , 2009, 2009 IEEE International Conference on Intelligent Computing and Intelligent Systems.

[13]  Max Welling,et al.  Fast collapsed gibbs sampling for latent dirichlet allocation , 2008, KDD.

[14]  Hongfei Lin,et al.  Finding a good query-related topic for boosting pseudo-relevance feedback , 2011, J. Assoc. Inf. Sci. Technol..

[15]  Mounia Lalmas,et al.  A survey on the use of relevance feedback for information access systems , 2003, The Knowledge Engineering Review.

[16]  W. Bruce Croft,et al.  LDA-based document models for ad-hoc retrieval , 2006, SIGIR.

[17]  Gerard Salton,et al.  Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[18]  W. Bruce Croft,et al.  Phrasal translation and query expansion techniques for cross-language information retrieval , 1997, SIGIR '97.