Identifying Aspects for Web-Search Queries

Many web-search queries serve as the beginning of an exploration of an unknown space of information, rather than looking for a specific web page. To answer such queries effectively, the search engine should attempt to organize the space of relevant information in a way that facilitates exploration. We describe the ASPECTOR system that computes aspects for a given query. Each aspect is a set of search queries that together represent a distinct information need relevant to the original search query. To serve as an effective means to explore the space, ASPECTOR computes aspects that are orthogonal to each other and to have high combined coverage. ASPECTOR combines two sources of information to compute aspects. We discover candidate aspects by analyzing query logs, and cluster them to eliminate redundancies. We then use a mass-collaboration knowledge base (e.g., Wikipedia) to compute candidate aspects for queries that occur less frequently and to group together aspects that are likely to be "semantically" related. We present a user study that indicates that the aspects we compute are rated favorably against three competing alternatives - related searches proposed by Google, cluster labels assigned by the Clusty search engine, and navigational searches proposed by Bing.

[1]  Ron Weiss,et al.  Fast and effective query refinement , 1997, SIGIR '97.

[2]  Fuchun Peng,et al.  Unsupervised query segmentation using generative language models and wikipedia , 2008, WWW.

[3]  Sreenivas Gollapudi,et al.  Diversifying search results , 2009, WSDM '09.

[4]  Aristides Gionis,et al.  Topical query decomposition , 2008, KDD.

[5]  Daniela Rus,et al.  Journal of Graph Algorithms and Applications the Star Clustering Algorithm for Static and Dynamic Information Organization , 2022 .

[6]  Fernando Pereira,et al.  Lightly-Supervised Attribute Extraction , 2007 .

[7]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[8]  Daniel E. Rose,et al.  Understanding user goals in web search , 2004, WWW '04.

[9]  Wei-Ying Ma,et al.  Learning to cluster web search results , 2004, SIGIR '04.

[10]  Brendan T. O'Connor,et al.  Cheap and Fast – But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks , 2008, EMNLP.

[11]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[12]  Qi Su,et al.  Internet-scale collection of human-reviewed data , 2007, WWW '07.

[13]  ChengXiang Zhai,et al.  Learn from web search logs to organize search results , 2007, SIGIR.

[14]  Ryen W. White,et al.  Query suggestion based on user landing pages , 2007, SIGIR.

[15]  Chong Wang,et al.  Continuous Time Dynamic Topic Models , 2008, UAI.

[16]  Kevin Li,et al.  Faceted metadata for image search and browsing , 2003, CHI '03.

[17]  Benjamin Van Durme,et al.  What You Seek Is What You Get: Extraction of Class Attributes from Query Logs , 2007, IJCAI.

[18]  Shrikanth Narayanan,et al.  Collaborative classification applications in sensor networks , 2002, Sensor Array and Multichannel Signal Processing Workshop Proceedings, 2002.

[19]  Xiaoying Gao,et al.  Query Directed Web Page Clustering , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[20]  Qin Iris Wang,et al.  Learning Noun Phrase Query Segmentation , 2007, EMNLP.

[21]  Benjamin Rey,et al.  Generating query substitutions , 2006, WWW '06.

[22]  James Allan,et al.  Web Page Clustering Using Heuristic Search in the Web Graph , 2007, IJCAI.

[23]  Reiner Kraft,et al.  Mining anchor text for query refinement , 2004, WWW '04.

[24]  Deepayan Chakrabarti,et al.  Mining broad latent query aspects from search sessions , 2009, KDD.

[25]  Shankar Kumar,et al.  Video suggestion and discovery for youtube: taking random walks through the view graph , 2008, WWW.

[26]  Ryen W. White,et al.  Exploratory Search: Beyond the Query-Response Paradigm , 2009, Exploratory Search: Beyond the Query-Response Paradigm.

[27]  John Battelle The Search: How Google and Its Rivals Rewrote the Rules of Business and Transformed Our Culture , 2005 .