Query Rewriting for Archived Information Retrieval

Word mismatch is a fundamental issue in the field of information retrieval. To address this problem, one of the major approaches is query formulation. Previous work mainly focused on the study of synonym based term replacement and the query paraphrasing. However, the word level replacement overlooked the context of terms and the query level paraphrasing yielded on the low performance. In this paper, we proposed a phrase-rewriting scheme for query formulation in Web search by exploiting multiple online search engines. Through the use of phrases, we actually explore the term context in the query. Meanwhile, phrase rewriting was employed to explore the semantic similarity of phrases. Experimental results showed that the proposed approach outperformed the baselines significantly.

[1]  Zhendong Niu,et al.  Concept Based Query Expansion , 2013, 2013 Ninth International Conference on Semantics, Knowledge and Grids.

[2]  Yi Liu,et al.  Statistical Machine Translation for Query Expansion in Answer Retrieval , 2007, ACL.

[3]  Xuehua Shen,et al.  Context-sensitive information retrieval using implicit feedback , 2005, SIGIR '05.

[4]  W. Bruce Croft,et al.  Quary Expansion Using Local and Global Document Analysis , 1996, SIGIR Forum.

[5]  Ming Zhou,et al.  Learning Question Paraphrases for QA from Encarta Logs , 2007, IJCAI.

[6]  K. Sparck Jones,et al.  What makes an automatic keyword classification effective , 1971 .

[7]  Haifeng Wang,et al.  Pivot Approach for Extracting Paraphrase Patterns from Bilingual Corpora , 2008, ACL.

[8]  Xuelong Li,et al.  Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search , 2013, IEEE Transactions on Image Processing.

[9]  W. Bruce Croft,et al.  Improving the effectiveness of information retrieval with local context analysis , 2000, TOIS.

[10]  France T́elécom Learning Paraphrases to Improve a Question-Answering System , 2003 .

[11]  Ming Zhou,et al.  Combining Multiple Resources to Improve SMT-based Paraphrasing Model , 2008, ACL.

[12]  Karen Sparck Jones Automatic keyword classification for information retrieval , 1971 .

[13]  Matthew W. Bilotti,et al.  Query expansion techniques for question answering , 2004 .

[14]  Gerard Salton,et al.  Research and Development in Information Retrieval , 1982, Lecture Notes in Computer Science.

[15]  W. Bruce Croft,et al.  Query expansion using local and global document analysis , 1996, SIGIR '96.

[16]  Iryna Gurevych,et al.  Answering Learners’ Questions by Retrieving Question Paraphrases from Social Q&A Sites , 2008 .

[17]  Eduard H. Hovy,et al.  The Use of External Knowledge of Factoid QA , 2001, TREC.

[18]  Haifeng Wang,et al.  Leveraging Multiple MT Engines for Paraphrase Generation , 2010, COLING.

[19]  Chris Callison-Burch,et al.  Paraphrasing with Bilingual Parallel Corpora , 2005, ACL.

[20]  Paolo Rosso,et al.  A WordNet-based Query Expansion Method for Geographical Information Retrieval , 2005, CLEF.

[21]  Chris Callison-Burch,et al.  Syntactic Constraints on Paraphrases Extracted from Parallel Corpora , 2008, EMNLP.

[22]  Kevin Knight Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics , 2005 .