Multi-view random walk framework for search task discovery from click-through log

Search engine users often have clear search tasks hidden behind their queries. Inspired by this, the modern search engines are providing an increasing number of services to help users simplify their key tasks. However, the problem of what are the major user search tasks with high traffic for which search engines should design special services is still underexplored. In this paper, we propose a novel Multi-view Random Walk (MRW) algorithm to measure the search task oriented similarity between queries, and then group search queries with similar tasks so that the major search tasks of users can be identified from search engine click-through log. The proposed MRW, which is a general framework to combine knowledge from different views in a random walk process, allows the random surfer to walk across different views to integrate information for search task discovery. Experimental results on click-through log of a commonly used commercial search engine show that our proposed MRW algorithm can effectively discover user search tasks.

[1]  Gang Wang,et al.  Understanding user's query intent with wikipedia , 2009, WWW '09.

[2]  W. Reed The Pareto, Zipf and other power laws , 2001 .

[3]  Tie-Yan Liu,et al.  Actively predicting diverse search intent from user browsing behaviors , 2010, WWW '10.

[4]  Koby Crammer,et al.  Resolving Identity Uncertainty with Learned Random Walks , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[5]  Xiaoxin Yin,et al.  Building taxonomy of web search intents for name entity queries , 2010, WWW '10.

[6]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[7]  Christopher J. C. Burges,et al.  Spectral clustering and transductive learning with multiple views , 2007, ICML '07.

[8]  Hongyan Liu,et al.  TagClus: a random walk-based method for tag clustering , 2010, Knowledge and Information Systems.

[9]  Daniel E. Rose,et al.  Understanding user goals in web search , 2004, WWW '04.

[10]  Aristides Gionis,et al.  Query similarity by projecting the query-flow graph , 2010, SIGIR.

[11]  Shui-Lung Chuang,et al.  Towards automatic generation of query taxonomy: a hierarchical query clustering approach , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[12]  Qiang Yang,et al.  Building bridges for web query classification , 2006, SIGIR.

[13]  Ryen W. White,et al.  Studying the use of popular destinations to enhance web search interaction , 2007, SIGIR.

[14]  William W. Cohen,et al.  Learning Graph Walk Based Similarity Measures for Parsed Text , 2008, EMNLP.

[15]  Zhenyu Liu,et al.  Automatic identification of user goals in Web search , 2005, WWW '05.

[16]  Lu Wang,et al.  Clustering query refinements by user intent , 2010, WWW '10.

[17]  Hang Li,et al.  Named entity mining from click-through data using weakly supervised latent dirichlet allocation , 2009, KDD.