Eliminating Search Intent Bias in Learning to Rank

Click-through data has proven to be a valuable resource for improving search-ranking quality. Search engines can easily collect click data, but biases introduced in the data can make it difficult to use the data effectively. In order to measure the effects of biases, many click models have been proposed in the literature. However, none of the models can explain the observation that users with different search intent (e.g., informational, navigational, etc.) have different click behaviors. In this paper, we study how differences in user search intent can influence click activities and determined that there exists a bias between user search intent and the relevance of the document relevance. Based on this observation, we propose a search intent bias hypothesis that can be applied to most existing click models to improve their ability to learn unbiased relevance. Experimental results demonstrate that after adopting the search intent hypothesis, click models can better interpret user clicks and substantially improve retrieval performance.

[1]  Kenneth A. Loparo,et al.  A Common Gene Expression Signature Analysis Method for Multiple Types of Cancer , 2019, ICDM.

[2]  Kenneth A. Loparo,et al.  Information Extraction from Free Text in Clinical Trials with Knowledge-Based Distant Supervision , 2019, 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC).

[3]  Benjamin Piwowarski,et al.  A user browsing model to predict search engine click data from past observations. , 2008, SIGIR '08.

[4]  Li Qingshan,et al.  User personalization mechanism in Agent-based Meta search engine , 2012 .

[5]  Michael Bendersky,et al.  Addressing Trust Bias for Unbiased Learning-to-Rank , 2019, WWW.

[6]  Kenneth A. Loparo,et al.  Context Aware Image Annotation in Active Learning with Batch Mode , 2019, 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC).

[7]  Homa B. Hashemi,et al.  Query Intent Detection using Convolutional Neural Networks , 2016 .

[8]  Olivier Chapelle,et al.  A dynamic bayesian network click model for web search ranking , 2009, WWW '09.

[9]  Xin Li,et al.  Coupling feature selection and machine learning methods for navigational query identification , 2006, CIKM '06.

[10]  Ben Carterette,et al.  Estimating Clickthrough Bias in the Cascade Model , 2018, CIKM.

[11]  Yiqun Liu,et al.  Automatic Query Type Identification Based on Click Through Information , 2006, AIRS.

[12]  Zou Yan-xin,et al.  Ontology based user personalization mechanism in meta search engine , 2012, 2012 2nd International Conference on Uncertainty Reasoning and Knowledge Engineering.

[13]  Zhihua Zhang,et al.  Learning click models via probit bayesian inference , 2010, CIKM.

[14]  Kenneth A. Loparo,et al.  Knowledge-guided Text Structuring in Clinical Trials , 2019, ICDM.

[15]  李青山,et al.  Agent-based intelligent meta search engine system , 2012 .

[16]  Kenneth A. Loparo,et al.  Learning - based Adaptation Framework for Elastic Software Systems , 2019, SEKE.

[17]  Li Qingshan,et al.  Complex query recognition based on dynamic learning mechanism , 2012 .

[18]  Shaoping Ma,et al.  Constructing Click Models for Mobile Search , 2018, SIGIR.

[19]  Kenneth A. Loparo,et al.  Topic Shift Detection in Online Discussions using Structural Context , 2019, 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC).

[20]  Marc Najork,et al.  Learning to Rank with Selection Bias in Personal Search , 2016, SIGIR.

[21]  Yingcheng Sun,et al.  Conversational Structure Aware and Context Sensitive Topic Model for Online Discussions , 2020, 2020 IEEE 14th International Conference on Semantic Computing (ICSC).

[22]  Yifan Guo,et al.  Quantized Adversarial Training: An Iterative Quantized Local Search Approach , 2019, 2019 IEEE International Conference on Data Mining (ICDM).

[23]  Yiqun Liu,et al.  Training Deep Ranking Model with Weak Relevance Labels , 2017, ADC.

[24]  Yiqun Liu,et al.  Incorporating vertical results into search click models , 2013, SIGIR.

[25]  Daniel E. Rose,et al.  Understanding user goals in web search , 2004, WWW '04.

[26]  Chao Liu,et al.  Efficient multiple-click models in web search , 2009, WSDM '09.

[27]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[28]  Nick Craswell,et al.  An experimental comparison of click position-bias models , 2008, WSDM '08.

[29]  Alejandro Figueroa,et al.  Exploring effective features for recognizing the user intent behind web queries , 2015, Comput. Ind..

[30]  Marc Najork,et al.  Position Bias Estimation for Unbiased Learning to Rank in Personal Search , 2018, WSDM.

[31]  Yuchen Zhang,et al.  User-click modeling for understanding and predicting search-behavior , 2011, KDD.

[32]  Yingcheng Sun,et al.  Opinion Spam Detection Based on Heterogeneous Information Network , 2019, 2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI).

[33]  Kenneth A. Loparo,et al.  A Clicked-URL Feature for Transactional Query Identification , 2019, 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC).

[34]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[35]  Thorsten Joachims,et al.  Unbiased Learning-to-Rank with Biased Feedback , 2016, WSDM.

[36]  Mike Thelwall,et al.  Synthesis Lectures on Information Concepts, Retrieval, and Services , 2009 .

[37]  Yuchen Zhang,et al.  Characterizing search intent diversity into click models , 2011, WWW.

[38]  M. de Rijke,et al.  Using Intent Information to Model User Behavior in Diversified Search , 2013, DIR.

[39]  Edward Cutrell,et al.  An eye tracking study of the effect of target rank on web search , 2007, CHI.

[40]  Evgeniy Gabrilovich,et al.  Wikipedia-based Semantic Interpretation for Natural Language Processing , 2014, J. Artif. Intell. Res..

[41]  Yifan Guo,et al.  Differentially Private Community Detection in Attributed Social Networks , 2019, ACML.

[42]  Qingshan Li,et al.  An Agent Based Intelligent Meta Search Engine , 2012, WISM.

[43]  Matthew Richardson,et al.  Predicting clicks: estimating the click-through rate for new ads , 2007, WWW '07.

[44]  Ruohui Wang,et al.  Edge Detection Using Convolutional Neural Network , 2016, ISNN.

[45]  Nina Mishra,et al.  Domain bias in web search , 2012, WSDM '12.

[46]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[47]  Peter Mika,et al.  Ad-hoc object retrieval in the web of data , 2010, WWW '10.

[48]  Yingcheng Sun,et al.  Context Aware Image Annotation in Active Learning , 2020, ArXiv.