Learning Better Representations for Neural Information Retrieval with Graph Information

Neural ranking models have recently gained much attention in Information Retrieval community and obtain good ranking performance. However, most of these retrieval models focus on capturing the textual matching signals between query and document but do not consider user behavior information that may be helpful for the retrieval task. Specifically, users' click and query reformulation behavior can be represented by a click-through bipartite graph and a session-flow graph, respectively. Such graph representations contain rich user behavior information and may help us better understand users' search intent beyond the textual information. In this study, we aim to incorporate this rich information encoded in these two graphs into existing neural ranking models. We present two graph-based neural ranking models (\emphEmbRanker and AggRanker ) to enrich learned text representations with graph information that captures rich users' interaction behavior information. Experimental results on a large-scale publicly available benchmark dataset show that the two models outperform most existing neural ranking models that only consider textual information, which illustrates the effectiveness of integrating graph information with textual information. Further analyses show how graph information complements text matching signals and examine whether these two models can be adopted in practical applications.

[1]  Yiqun Liu,et al.  Hierarchical Attention Network for Context-Aware Query Suggestion , 2018, AIRS.

[2]  Yisong Yue,et al.  Beyond position bias: examining result attractiveness as a source of presentation bias in clickthrough data , 2010, WWW '10.

[3]  Yuchen Zhang,et al.  A noise-aware click model for web search , 2012, WSDM '12.

[4]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[5]  Jure Leskovec,et al.  Graph Convolutional Neural Networks for Web-Scale Recommender Systems , 2018, KDD.

[6]  Yuan Luo,et al.  Graph Convolutional Networks for Text Classification , 2018, AAAI.

[7]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[8]  Larry P. Heck,et al.  Learning deep structured semantic models for web search using clickthrough data , 2013, CIKM.

[9]  ChengXiang Zhai,et al.  Learning Query and Document Relevance from a Web-scale Click Graph , 2016, SIGIR.

[10]  Yiqun Liu,et al.  TianGong-ST: A New Dataset with Large-scale Refined Real-world Web Search Sessions , 2019, CIKM.

[11]  Stephen E. Robertson,et al.  Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval , 1994, SIGIR '94.

[12]  Florian Boudin,et al.  TopicRank: Graph-Based Topic Ranking for Keyphrase Extraction , 2013, IJCNLP.

[13]  Nitesh V. Chawla,et al.  CoupledLP: Link Prediction in Coupled Networks , 2015, KDD.

[14]  Dong Wang,et al.  Neural IR Meets Graph Embedding: A Ranking Model for Product Search , 2019, WWW.

[15]  M. de Rijke,et al.  Click Models for Web Search , 2015, Click Models for Web Search.

[16]  Zhiyuan Liu,et al.  Understanding the Behaviors of BERT in Ranking , 2019, ArXiv.

[17]  Yiqun Liu,et al.  Incorporating Non-sequential Behavior into Click Models , 2015, SIGIR.

[18]  W. Bruce Croft,et al.  A Deep Look into Neural Ranking Models for Information Retrieval , 2019, Inf. Process. Manag..

[19]  Zhiyuan Liu,et al.  End-to-End Neural Ad-hoc Ranking with Kernel Pooling , 2017, SIGIR.

[20]  Xueqi Cheng,et al.  Text Matching as Image Recognition , 2016, AAAI.

[21]  Thang D. Bui,et al.  Neural Graph Learning: Training Neural Networks Using Graphs , 2018, WSDM.

[22]  Jian-Yun Nie,et al.  Empirical Study of Multi-level Convolution Models for IR Based on Representations and Interactions , 2018, ICTIR.

[23]  Michael R. Lyu,et al.  Learning latent semantic relations from clickthrough data for query suggestion , 2008, CIKM '08.

[24]  Philip S. Yu,et al.  A Comprehensive Survey on Graph Neural Networks , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[26]  Ben Carterette,et al.  Evaluating Retrieval over Sessions: The TREC Session Track 2011-2014 , 2016, SIGIR.

[27]  Nick Craswell,et al.  An experimental comparison of click position-bias models , 2008, WSDM '08.

[28]  Yiqun Liu,et al.  Building a click model: From idea to practice , 2016, CAAI Trans. Intell. Technol..

[29]  Nick Craswell,et al.  Learning to Match using Local and Distributed Representations of Text for Web Search , 2016, WWW.

[30]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[31]  M. de Rijke,et al.  An Introduction to Click Models for Web Search: SIGIR 2015 Tutorial , 2015, SIGIR.

[32]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[33]  Jiang Guo,et al.  A General Framework for Content-enhanced Network Representation Learning , 2016, ArXiv.

[34]  Philip S. Yu,et al.  PathSelClus: Integrating Meta-Path Selection with User-Guided Object Clustering in Heterogeneous Information Networks , 2013, TKDD.

[35]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[36]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[37]  Xueqi Cheng,et al.  A Deep Investigation of Deep IR Models , 2017, ArXiv.

[38]  Tie-Yan Liu,et al.  Learning to rank for information retrieval , 2009, SIGIR.

[39]  Yiqun Liu,et al.  Teach Machine How to Read: Reading Behavior Inspired Relevance Estimation , 2019, SIGIR.

[40]  Hang Li,et al.  Convolutional Neural Network Architectures for Matching Natural Language Sentences , 2014, NIPS.

[41]  Xavier Bresson,et al.  Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[42]  W. Bruce Croft,et al.  Estimating Embedding Vectors for Queries , 2016, ICTIR.