A Multi-faceted Approach to Query Intent Classification

In this paper we report results for automatic classification of queries in a wide set of facets that are useful to the identification of query intent. Our hypothesis is that the performance of single-faceted classification of queries can be improved by introducing information of multi-faceted training samples into the learning process. We test our hypothesis by performing a multi-faceted classification of queries based on the combination of correlated facets. Our experimental results show that this idea can significantly improve the quality of the classification. Since most of previous works in query intent classification are oriented to the study of single facets, these results are a first step to an integrated query intent classification model.

[1]  Amanda Spink,et al.  Determining the informational, navigational, and transactional intent of Web queries , 2008, Inf. Process. Manag..

[2]  Daniel E. Rose,et al.  Understanding user goals in web search , 2004, WWW '04.

[3]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[4]  Yu-ping Qin,et al.  Study on Multi-label Text Classification Based on SVM , 2009, 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery.

[5]  Ricardo A. Baeza-Yates,et al.  The Intention Behind Web Queries , 2006, SPIRE.

[6]  Yiqun Liu,et al.  Automatic Query Type Identification Based on Click Through Information , 2006, AIRS.

[7]  Sreenivas Gollapudi,et al.  Diversifying search results , 2009, WSDM '09.

[8]  Fang Liu,et al.  Automatic user goals identification based on anchor text and click-through data , 2008, Wuhan University Journal of Natural Sciences.

[9]  Xiao Li,et al.  Learning with click graph for query intent classification , 2010, TOIS.

[10]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[11]  Zhenyu Liu,et al.  Automatic identification of user goals in Web search , 2005, WWW '05.

[12]  Ricardo Baeza-Yates,et al.  Towards a Deeper Understanding of the User’s Query Intent , 2010 .

[13]  Roi Blanco,et al.  Probabilistic static pruning of inverted files , 2010, TOIS.

[14]  Parthasarathy Ramachandran,et al.  Intent based clustering of search engine query log , 2009, 2009 IEEE International Conference on Automation Science and Engineering.

[15]  Min-Yen Kan,et al.  Functional Faceted Web Query Analysis , 2007 .

[16]  Krishna Bharat,et al.  Diversifying web search results , 2010, WWW '10.

[17]  Marco Cristo,et al.  Exploring features for the automatic identification of user goals in web search , 2010, Inf. Process. Manag..

[18]  Luis Gravano,et al.  Categorizing web queries according to geographical locality , 2003, CIKM '03.

[19]  Qiang Yang,et al.  Building bridges for web query classification , 2006, SIGIR.

[20]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[21]  Fuchun Peng,et al.  Improving search relevance for implicitly temporal queries , 2009, SIGIR.

[22]  Yong Yu,et al.  Identification of ambiguous queries in web search , 2009, Inf. Process. Manag..

[23]  Ji-Rong Wen,et al.  Multi-dimensional search result diversification , 2011, WSDM '11.

[24]  Susan T. Dumais,et al.  To personalize or not to personalize: modeling queries with variation in user intent , 2008, SIGIR '08.

[25]  Wei Vivian Zhang,et al.  Geographic intention and modification in web search , 2008, Int. J. Geogr. Inf. Sci..

[26]  Jaime Teevan,et al.  Query log analysis: social and technological challenges , 2007, SIGF.

[27]  Ophir Frieder,et al.  Improving automatic query classification via semi-supervised learning , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[28]  Fernando Diaz,et al.  Temporal profiles of queries , 2007, TOIS.

[29]  N. Belkin,et al.  A classification of interactions with information , 2002 .

[30]  Wojciech Rytter,et al.  Extracting Powers and Periods in a String from Its Runs Structure , 2010, SPIRE.