Functional Faceted Web Query Analysis

We propose a faceted classification scheme for web queries. Unlike previous work, our functional scheme ties its classification to actionable strategies for search engines to take. Our scheme consists of four facets of ambiguity, authority sensitivity, temporal sensitivity and spatial sensitivity. We hypothesize that the classification of queries into such facets yields insight on user intent and information needs. To validate our classification scheme, we asked users to annotate queries with respect to our facets and obtained high agreement. We also assess the coverage of our faceted classification on a random sample of queries from logs. Finally, we discuss the algorithmic approaches we take in our current work to automate such faceted classification.

[1]  Zhenyu Liu,et al.  Automatic identification of user goals in Web search , 2005, WWW '05.

[2]  Jimmy J. Lin,et al.  The role of context in question answering systems , 2003, CHI Extended Abstracts.

[3]  Bernard J. Jansen,et al.  A review of web searching studies and a framework for future research , 2001 .

[4]  Ellen M. Voorhees,et al.  Overview of the TREC 2004 Novelty Track. , 2005 .

[5]  Jimmy J. Lin,et al.  What Makes a Good Answer? The Role of Context in Question Answering , 2003, INTERACT.

[6]  In-Ho Kang,et al.  Query type classification for web document retrieval , 2003, SIGIR.

[7]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[8]  Steve Chien,et al.  Semantic similarity between search engine queries using temporal correlation , 2005, WWW '05.

[9]  Monika Henzinger,et al.  Analysis of a very large web search engine query log , 1999, SIGF.

[10]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[11]  Ying Li,et al.  Detecting dominant locations from search queries , 2005, SIGIR '05.

[12]  Daniel E. Rose,et al.  Understanding user goals in web search , 2004, WWW '04.

[13]  Luis Gravano,et al.  Categorizing web queries according to geographical locality , 2003, CIKM '03.

[14]  Min-Yen Kan,et al.  Detecting and supporting known item queries in online public access catalogs , 2005, Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05).

[15]  Amanda Spink,et al.  Real life, real users, and real needs: a study and analysis of user queries on the web , 2000, Inf. Process. Manag..

[16]  Charles L. A. Clarke,et al.  Question Answering By Passage Selection , 2008 .

[17]  Nenad Stojanovic,et al.  On Analysing Query Ambiguity for Query Refinement: The Librarian Agent Approach , 2003, ER.

[18]  Eduard Hovy,et al.  A question/answer typology with surface text patterns , 2002 .