Cross-Positional Attention for Debiasing Clicks

A well-known challenge in leveraging implicit user feedback like clicks to improve real-world search services and recommender systems is its inherent bias. Most existing click models are based on the examination hypothesis in user behaviors and differ in how to model such an examination bias. However, they are constrained by assuming a simple position-based bias or enforcing a sequential order in user examination behaviors. These assumptions are insufficient to capture complex real-world user behaviors and hardly generalize to modern user interfaces (UI) in web applications (e.g., results shown in a grid view). In this work, we propose a fully data-driven neural model for the examination bias, Cross-Positional Attention (XPA), which is more flexible in fitting complex user behaviors. Our model leverages the attention mechanism to effectively capture cross-positional interactions among displayed items and is applicable to arbitrary UIs. We employ XPA in a novel neural click model that can both predict clicks and estimate relevance. Our experiments on offline synthetic data sets show that XPA is robust among different click generation processes. We further apply XPA to a large-scale real-world recommender system, showing significantly better results than baselines in online A/B experiments that involve millions of users. This validates the necessity to model more complex user behaviors than those proposed in the literature.

[1]  Yi Tay,et al.  Are Neural Rankers still Outperformed by Gradient Boosted Decision Trees? , 2021, ICLR.

[2]  W. Bruce Croft,et al.  Unbiased Learning to Rank with Unbiased Propensity Estimation , 2018, SIGIR.

[3]  M. de Rijke,et al.  Click Models for Web Search , 2015, Click Models for Web Search.

[4]  Chao Liu,et al.  Efficient multiple-click models in web search , 2009, WSDM '09.

[5]  Yiqun Liu,et al.  A Context-Aware Click Model for Web Search , 2020, WSDM.

[6]  Olivier Chapelle,et al.  A dynamic bayesian network click model for web search ranking , 2009, WWW '09.

[7]  Michael Bendersky,et al.  Permutation Equivariant Document Interaction Network for Neural Learning to Rank , 2020, ICTIR.

[8]  Zhen Qin,et al.  Do RNN and LSTM have Long Memory? , 2020, ICML.

[9]  Jian Xu,et al.  A Deep Recurrent Survival Model for Unbiased Ranking , 2020, SIGIR.

[10]  M. de Rijke,et al.  Ranking for Relevance and Display Preferences in Complex Presentation Layouts , 2018, SIGIR.

[11]  Thorsten Joachims,et al.  Unbiased Learning-to-Rank with Biased Feedback , 2016, WSDM.

[12]  Thorsten Joachims,et al.  Training linear SVMs in linear time , 2006, KDD '06.

[13]  Zhen Qin,et al.  Attribute-based Propensity for Unbiased Learning in Recommender Systems: Algorithm and Case Studies , 2020, KDD.

[14]  Li Wei,et al.  Recommending what video to watch next: a multitask ranking system , 2019, RecSys.

[15]  Jun Xu,et al.  SetRank: Learning a Permutation-Invariant Ranking Model for Information Retrieval , 2020, SIGIR.

[16]  Xiuqiang He,et al.  A General Knowledge Distillation Framework for Counterfactual Recommendation via Uniform Data , 2020, SIGIR.

[17]  Nick Craswell,et al.  An experimental comparison of click position-bias models , 2008, WSDM '08.

[18]  Razvan Pascanu,et al.  On the difficulty of training recurrent neural networks , 2012, ICML.

[19]  Chao Liu,et al.  Click chain model in web search , 2009, WWW '09.

[20]  Yiqun Liu,et al.  Incorporating Non-sequential Behavior into Click Models , 2015, SIGIR.

[21]  Marc Najork,et al.  Learning to Rank with Selection Bias in Personal Search , 2016, SIGIR.

[22]  Yang Wang,et al.  Unbiased LambdaMART: An Unbiased Pairwise Learning-to-Rank Algorithm , 2018, WWW.

[23]  M. de Rijke,et al.  A Neural Click Model for Web Search , 2016, WWW.

[24]  Benjamin Piwowarski,et al.  A user browsing model to predict search engine click data from past observations. , 2008, SIGIR '08.

[25]  Marc Najork,et al.  Position Bias Estimation for Unbiased Learning to Rank in Personal Search , 2018, WSDM.

[26]  Matthew Richardson,et al.  Predicting clicks: estimating the click-through rate for new ads , 2007, WWW '07.

[27]  Thorsten Joachims,et al.  Estimating Position Bias without Intrusive Interventions , 2018, WSDM.

[28]  Fernando Diaz,et al.  Robust models of mouse movement on dynamic web search results pages , 2013, CIKM.

[29]  Meng Wang,et al.  Investigating Examination Behavior of Image Search Users , 2017, SIGIR.

[30]  Shaoping Ma,et al.  Constructing an Interaction Behavior Model for Web Image Search , 2018, SIGIR.

[31]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[32]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[33]  Thorsten Joachims,et al.  Intervention Harvesting for Context-Dependent Examination-Bias Estimation , 2018, SIGIR.

[34]  M. de Rijke,et al.  A Click Sequence Model for Web Search , 2018, SIGIR.

[35]  Shaoping Ma,et al.  Constructing Click Models for Mobile Search , 2018, SIGIR.

[36]  Weiwei Deng,et al.  Model Ensemble for Click Prediction in Bing Search Ads , 2017, WWW.

[37]  Yi Chang,et al.  Yahoo! Learning to Rank Challenge Overview , 2010, Yahoo! Learning to Rank Challenge.

[38]  Nan Wang,et al.  Non-Clicks Mean Irrelevant? Propensity Ratio Scoring As a Correction , 2021, WSDM.

[39]  Michael Bendersky,et al.  Feature Transformation for Neural Ranking Models , 2020, SIGIR.

[40]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[41]  Huifeng Guo,et al.  PAL: a position-bias aware learning framework for CTR prediction in live recommender systems , 2019, RecSys.

[42]  M. de Rijke,et al.  Cascade Model-based Propensity Estimation for Counterfactual Learning to Rank , 2020, SIGIR.

[43]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.