Mining, Ranking and Recommending Entity Aspects

Entity queries constitute a large fraction of web search queries and most of these queries are in the form of an entity mention plus some context terms that represent an intent in the context of that entity. We refer to these entity-oriented search intents as entity aspects. Recognizing entity aspects in a query can improve various search applications such as providing direct answers, diversifying search results, and recommending queries. In this paper we focus on the tasks of identifying, ranking, and recommending entity aspects, and propose an approach that mines, clusters, and ranks such aspects from query logs. We perform large-scale experiments based on users' search sessions from actual query logs to evaluate the aspect ranking and recommendation tasks. In the aspect ranking task, we aim to satisfy most users' entity queries, and evaluate this task in a query-independent fashion. We find that entropy-based methods achieve the best performance compared to maximum likelihood and language modeling approaches. In the aspect recommendation task, we recommend other aspects related to the aspect currently being queried. We propose two approaches based on semantic relatedness and aspect transitions within user sessions and find that a combined approach gives the best performance. As an additional experiment, we utilize entity aspects for actual query recommendation and find that our approach improves the effectiveness of query recommendations built on top of the query-flow graph.

[1]  ChengXiang Zhai,et al.  Mining entity attribute synonyms via compact clustering , 2013, CIKM.

[2]  Maarten de Rijke,et al.  Identifying entity aspects in microblog posts , 2012, SIGIR '12.

[3]  Peter Mika,et al.  Ad-hoc object retrieval in the web of data , 2010, WWW '10.

[4]  Ryen W. White,et al.  Supporting Complex Search Tasks , 2014, CIKM.

[5]  Fabrizio Silvestri,et al.  Efficient query recommendations in the long tail via center-piece subgraphs , 2012, SIGIR '12.

[6]  Fabrizio Silvestri,et al.  Identifying task-based sessions in search engine query logs , 2011, WSDM '11.

[7]  Emine Yilmaz,et al.  Entity Oriented Task Extraction from Query Logs , 2014, CIKM.

[8]  Aristides Gionis,et al.  Improving recommendation for long-tail queries via templates , 2011, WWW.

[9]  Michael Gamon,et al.  Active objects: actions for entity-centric search , 2012, WWW.

[10]  ChengXiang Zhai,et al.  Unsupervised identification of synonymous query intent templates for attribute intents , 2013, CIKM.

[11]  Wei Chu,et al.  Learning to extract cross-session search tasks , 2013, WWW.

[12]  Patrick Pantel,et al.  Jigs and Lures: Associating Web Queries with Structured Entities , 2011, ACL.

[13]  Benjamin Van Durme,et al.  What You Seek Is What You Get: Extraction of Class Attributes from Query Logs , 2007, IJCAI.

[14]  M. de Rijke,et al.  Adding semantics to microblog posts , 2012, WSDM '12.

[15]  Francesco Bonchi,et al.  From machu_picchu to "rafting the urubamba river": anticipating information needs via the entity-query graph , 2013, WSDM '13.

[16]  Hongbo Deng,et al.  Identifying and labeling search tasks via query-based hawkes processes , 2014, KDD.

[17]  Yang Song,et al.  Evaluating the effectiveness of search task trails , 2012, WWW.

[18]  Qinghua Zheng,et al.  Mining query subtopics from search log data , 2012, SIGIR '12.

[19]  Julio Gonzalo,et al.  A comparison of extrinsic clustering evaluation metrics based on formal constraints , 2009, Information Retrieval.

[20]  James Allan,et al.  Task-aware query recommendation , 2013, SIGIR.

[21]  Niranjan Balasubramanian,et al.  Topic Pages: An Alternative to the Ten Blue Links , 2010, 2010 IEEE Fourth International Conference on Semantic Computing.

[22]  Aristides Gionis,et al.  The query-flow graph: model and applications , 2008, CIKM '08.

[23]  Michael Gamon,et al.  Mining Entity Types from Query Logs via User Intent Modeling , 2012, ACL.

[24]  Xiaoxin Yin,et al.  Building taxonomy of web search intents for name entity queries , 2010, WWW '10.

[25]  Jackie Chi Kit Cheung,et al.  Sequence clustering and labeling for unsupervised query intent discovery , 2012, WSDM '12.

[26]  Fabrizio Silvestri,et al.  Discovering tasks from search engine query logs , 2013, TOIS.

[27]  Giuseppe Ottaviano,et al.  Fast and Space-Efficient Entity Linking for Queries , 2015, WSDM.

[28]  Fabrizio Silvestri,et al.  Modeling and predicting the task-by-task behavior of search engine users , 2013, OAIR.

[29]  M. de Rijke,et al.  Using Intent Information to Model User Behavior in Diversified Search , 2013, DIR.

[30]  Rosie Jones,et al.  Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs , 2008, CIKM '08.

[31]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[32]  Soumen Chakrabarti,et al.  Learning joint query interpretation and response ranking , 2013, WWW '13.

[33]  M. de Rijke,et al.  Mapping queries to the Linking Open Data cloud: A case study using DBpedia , 2011, J. Web Semant..

[34]  Roi Blanco,et al.  Entity Recommendations in Web Search , 2013, SEMWEB.

[35]  Maarten de Rijke,et al.  Prior-informed Distant Supervision for Temporal Evidence Classification , 2014, COLING.

[36]  Roi Blanco,et al.  Web usage mining with semantic analysis , 2013, WWW.

[37]  Wei Song,et al.  Multi-aspect query summarization by composite query , 2012, SIGIR '12.

[38]  Jayant Madhavan,et al.  Identifying Aspects for Web-Search Queries , 2011, J. Artif. Intell. Res..