Online Expectation-Maximization for Click Models

Click models allow us to interpret user click behavior in search interactions and to remove various types of bias from user clicks. Existing studies on click models consider a static scenario where user click behavior does not change over time. We show empirically that click models deteriorate over time if retraining is avoided. We then adapt online expectation-maximization (EM) techniques to efficiently incorporate new click/skip observations into a trained click model. Our instantiation of Online EM for click models is orders of magnitude more efficient than retraining the model from scratch using standard EM, while loosing little in quality. To deal with outdated click information, we propose a variant of online EM called EM with Forgetting, which surpasses the performance of complete retraining while being as efficient as Online EM.

[1]  Chao Liu,et al.  Bayesian Browsing Model: Exact Inference of Document Relevance from Petabyte-Scale Data , 2010, TKDD.

[2]  Benjamin Piwowarski,et al.  A user browsing model to predict search engine click data from past observations. , 2008, SIGIR '08.

[3]  Bo Thiesson,et al.  Accelerating EM for Large Databases , 2001, Machine Learning.

[4]  Chao Liu,et al.  Click chain model in web search , 2009, WWW '09.

[5]  Maarten de Rijke,et al.  Evaluating and Analyzing Click Simulation in Web Search , 2017, ICTIR.

[6]  M. de Rijke,et al.  A Comparative Study of Click Models for Web Search , 2015, CLEF.

[7]  M. de Rijke,et al.  Click Models for Web Search , 2015, Click Models for Web Search.

[8]  Ciya Liao,et al.  A model to estimate intrinsic document relevance from the clickthrough logs of a web search engine , 2010, WSDM '10.

[9]  Olivier Chapelle,et al.  A dynamic bayesian network click model for web search ranking , 2009, WWW '09.

[10]  Milad Shokouhi,et al.  Behavioral dynamics on the web: Learning, modeling, and prediction , 2013, TOIS.

[11]  Mike Thelwall,et al.  Synthesis Lectures on Information Concepts, Retrieval, and Services , 2009 .

[12]  Dan Klein,et al.  Online EM for Unsupervised Models , 2009, NAACL.

[13]  M. de Rijke,et al.  Click model-based information retrieval metrics , 2013, SIGIR.

[14]  Chao Liu,et al.  BBM: bayesian browsing model from petabyte-scale data , 2009, KDD.

[15]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[16]  Maarten de Rijke,et al.  Vertical-Aware Click Model-Based Effectiveness Metrics , 2014, CIKM.

[17]  Yiqun Liu,et al.  Incorporating user preferences into click models , 2013, CIKM.