Incorporating Position Bias into Click-Through Bipartite Graph

Click-through bipartite graph has been regarded as an effective method in search user behavior analysis researches. In most existing bipartite graph construction studies, user clicks are treated as equally important. However, considering the existence of position bias factor in user click-through behavior, clicks on results in different ranking positions should be treated separately. In this work, we choose a classical click-through bipartite graph model, which named label propagation model, and evaluate the improvement of performance by considering the effect of position bias. We propose three hypotheses to explain the influence of position bias, and modify the formulas of label propagation algorithm. We use AUC as the evaluation metric, which express the effectiveness of spam URLs identification by label propagation algorithm and its improved methods. The experimental results demonstrate that the proposed methods work better than the baseline method.

[1]  Nick Craswell,et al.  An experimental comparison of click position-bias models , 2008, WSDM '08.

[2]  Yiqun Liu,et al.  Fighting against web spam: a novel propagation method based on click-through data , 2012, SIGIR '12.

[3]  Matthew Richardson,et al.  Predicting clicks: estimating the click-through rate for new ads , 2007, WWW '07.

[4]  Farzin Maghoul,et al.  Query clustering using click-through graph , 2009, WWW '09.

[5]  Wei Wu,et al.  Learning query and document similarities from click-through bipartite graph with metadata , 2013, WSDM.

[6]  Yiqun Liu,et al.  Pornography Detection with the Wisdom of Crowds , 2013, AIRS.

[7]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[8]  Thorsten Joachims,et al.  Accurately interpreting clickthrough data as implicit feedback , 2005, SIGIR '05.

[9]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[10]  Wei-Ying Ma,et al.  Optimizing web search using web click-through data , 2004, CIKM '04.

[11]  Xiao Li,et al.  Learning query intent from regularized click graphs , 2008, SIGIR '08.

[12]  ChengXiang Zhai,et al.  Learning Query and Document Relevance from a Web-scale Click Graph , 2016, SIGIR.

[13]  Ioannis Antonellis,et al.  Simrank++: query rewriting through link analysis of the clickgraph (poster) , 2007, Proc. VLDB Endow..

[14]  Enhong Chen,et al.  Context-aware query suggestion by mining click-through and session data , 2008, KDD.

[15]  Susan T. Dumais,et al.  Learning user interaction models for predicting web search result preferences , 2006, SIGIR.