A Position-Aware Deep Model for Relevance Matching in Information Retrieval

In order to adopt deep learning for ad-hoc information retrieval, it is essential to establish suitable representations of query-document pairs and to design neural architectures that are able to digest such representations. In particular, they ought to capture all relevant information required to assess the relevance of a document for a given user query, including term overlap as well as positional information such as proximity and term dependencies. While previous work has successfully captured unigram term matches, none has successfully used position-dependent information on a standard benchmark test collection. In this work, we address this gap by encoding the relevance matching in terms of similarity matrices and using a deep model to digest such matrices. We present a novel model architecture consisting of convolutional layers to capture term dependencies and proximity among query term occurrences, followed by a recurrent layer to capture relevance over di‚erent query terms. Extensive experiments on Trec Web Track data con€rm that the proposed model with similarity matrix representations yields improved search results.

[1]  Larry P. Heck,et al.  Learning deep structured semantic models for web search using clickthrough data , 2013, CIKM.

[2]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[3]  W. Bruce Croft,et al.  A Markov random field model for term dependencies , 2005, SIGIR '05.

[4]  W. Bruce Croft,et al.  Quary Expansion Using Local and Global Document Analysis , 1996, SIGIR Forum.

[5]  W. Bruce Croft,et al.  A Comparison of Retrieval Models using Term Dependencies , 2014, CIKM.

[6]  Olivier Chapelle,et al.  Expected reciprocal rank for graded relevance , 2009, CIKM.

[7]  Ellen M. Voorhees,et al.  TREC 2014 Web Track Overview , 2015, TREC.

[8]  Ellen M. Voorhees,et al.  Variations in relevance judgments and the measurement of retrieval effectiveness , 1998, SIGIR '98.

[9]  W. Bruce Croft,et al.  A framework for selective query expansion , 2004, CIKM '04.

[10]  Tao Tao,et al.  An exploration of proximity measures in information retrieval , 2007, SIGIR.

[11]  David Maxwell Chickering,et al.  Here or there: preference judgments for relevance , 2008 .

[12]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[13]  John D. Lafferty,et al.  A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[14]  Xueqi Cheng,et al.  A Study of MatchPyramid Models on Ad-hoc Retrieval , 2016, ArXiv.

[15]  W. Bruce Croft,et al.  Adaptability of Neural Networks on Varying Granularity IR Tasks , 2016, ArXiv.

[16]  Mark Sanderson,et al.  Relevance judgments between TREC and Non-TREC assessors , 2008, SIGIR '08.

[17]  W. Bruce Croft,et al.  A Deep Relevance Matching Model for Ad-hoc Retrieval , 2016, CIKM.

[18]  Xueqi Cheng,et al.  Text Matching as Image Recognition , 2016, AAAI.

[19]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[20]  Omar Alonso,et al.  Using crowdsourcing for TREC relevance assessment , 2012, Inf. Process. Manag..

[21]  Yelong Shen,et al.  A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval , 2014, CIKM.

[22]  Nir Ailon,et al.  Ranking from pairs and triplets: information quality, evaluation methods and query complexity , 2011, WSDM '11.

[23]  Nick Craswell,et al.  Learning to Match using Local and Distributed Representations of Text for Web Search , 2016, WWW.

[24]  Yelong Shen,et al.  Learning semantic representations using convolutional neural networks for web search , 2014, WWW.

[25]  Alessandro Moschitti,et al.  Learning to Rank Short Text Pairs with Convolutional Deep Neural Networks , 2015, SIGIR.