From "Selena Gomez" to "Marlon Brando": Understanding Explorative Entity Search

Consider a user who submits a search query "Shakira" having a specific search goal in mind (such as her age) but at the same time willing to explore information for other entities related to her, such as comparable singers. In previous work, a system called Spark, was developed to provide such search experience. Given a query submitted to the Yahoo search engine, Spark provides related entity suggestions for the query, exploiting, among else, public knowledge bases from the Semantic Web. We refer to this search scenario as explorative entity search. The effectiveness and efficiency of the approach has been demonstrated in previous work. The way users interact with these related entity suggestions and whether this interaction can be predicted have however not been studied. In this paper, we perform a large-scale analysis into how users interact with the entity results returned by Spark. We characterize the users, queries and sessions that appear to promote an explorative behavior. Based on this analysis, we develop a set of query and user-based features that reflect the click behavior of users and explore their effectiveness in the context of a prediction task.

[1]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[2]  Gary Marchionini,et al.  Exploratory search , 2006, Commun. ACM.

[3]  Kevin Chen-Chuan Chang,et al.  EntityRank: Searching Entities Directly and Holistically , 2007, VLDB.

[4]  Filip Radlinski,et al.  How does clickthrough data reflect retrieval quality? , 2008, CIKM '08.

[5]  Ricardo Baeza-Yates,et al.  Modern Information Retrieval - the concepts and technology behind search, Second edition , 2011 .

[6]  Yi Chang,et al.  Ranking related entities for web search queries , 2011, WWW.

[7]  Ian H. Witten,et al.  Learning to link with wikipedia , 2008, CIKM '08.

[8]  Ben Carterette,et al.  Simulating simple user behavior for system effectiveness evaluation , 2011, CIKM '11.

[9]  Susan T. Dumais,et al.  Modeling and predicting behavioral dynamics on the web , 2012, WWW.

[10]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[11]  Matthew Richardson,et al.  Predicting clicks: estimating the click-through rate for new ads , 2007, WWW '07.

[12]  Morgan Harvey,et al.  Workshop on searching for fun 2014 , 2014, IIiX.

[13]  Doug Downey,et al.  Models of Searching and Browsing: Languages, Studies, and Application , 2007, IJCAI.

[14]  Edgar Meij,et al.  Investigating the Semantic Gap through Query Log Analysis , 2009, SEMWEB.

[15]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[16]  James Allan,et al.  Predicting searcher frustration , 2010, SIGIR.

[17]  Erick Cantú-Paz,et al.  Personalized click prediction in sponsored search , 2010, WSDM '10.

[18]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[19]  Ryen W. White,et al.  Characterizing and predicting search engine switching behavior , 2009, CIKM.

[20]  Roi Blanco,et al.  Effective and Efficient Entity Search in RDF Data , 2011, SEMWEB.

[21]  Xiaojun Yuan,et al.  Building the trail best traveled: effects of domain knowledge on web search trailblazing , 2012, CHI.

[22]  Xiaoxin Yin,et al.  Building taxonomy of web search intents for name entity queries , 2010, WWW '10.

[23]  Ryen W. White,et al.  Task tours: helping users tackle complex search tasks , 2012, CIKM.

[24]  Rada Mihalcea,et al.  Wikify!: linking documents to encyclopedic knowledge , 2007, CIKM '07.

[25]  Amanda Spink,et al.  Determining the user intent of web search engine queries , 2007, WWW '07.

[26]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[27]  Malcolm Slaney,et al.  Precision-Recall Is Wrong for Multimedia , 2011, IEEE MultiMedia.

[28]  Michael Gamon,et al.  Active objects: actions for entity-centric search , 2012, WWW.

[29]  Ganesh Ramakrishnan,et al.  Collective annotation of Wikipedia entities in web text , 2009, KDD.

[30]  Yang Song,et al.  Evaluating and predicting user engagement change with degraded search relevance , 2013, WWW.

[31]  Ryen W. White,et al.  Exploratory Search: Beyond the Query-Response Paradigm , 2009, Exploratory Search: Beyond the Query-Response Paradigm.

[32]  Benjamin Rey,et al.  Generating query substitutions , 2006, WWW '06.

[33]  Roi Blanco,et al.  Entity Recommendations in Web Search , 2013, SEMWEB.

[34]  Roi Blanco,et al.  Web usage mining with semantic analysis , 2013, WWW.

[35]  Roi Blanco,et al.  Enhanced results for web search , 2011, SIGIR.

[36]  Kuansan Wang,et al.  PSkip: estimating relevance ranking quality from web search clickthrough data , 2009, KDD.

[37]  Gerhard Weikum,et al.  Robust Disambiguation of Named Entities in Text , 2011, EMNLP.