Subtopic Mining Based on Head-Modifier Relation and Co-occurrence of Intents Using Web Documents

This paper proposes a method that mines subtopics using the head-modifier relation and co-occurrence of users' intents from web documents in Japanese. We extracted subtopics using the simple patterns based on the head-modifier relation between the query and its adjacent words, and returned the ranked list of subtopics by the proposed score equation. We re-ranked subtopics according to the intent co-occurrence measure. Our method achieved good performance than the baseline methods and suggested queries from the major web search engine. The results of our method will be useful in various search scenarios, such as query suggestion and result diversification.

[1]  Claudio Carpineto,et al.  An information-theoretic approach to automatic query expansion , 2001, TOIS.

[2]  Yen-Jen Oyang,et al.  Relevant term suggestion in interactive web search based on contextual information in query session logs , 2003, J. Assoc. Inf. Sci. Technol..

[3]  Craig MacDonald,et al.  Exploiting query reformulations for web search result diversification , 2010, WWW '10.

[4]  Gareth J. F. Jones,et al.  Applying summarization techniques for term selection in relevance feedback , 2001, SIGIR '01.

[5]  Sumio Fujita,et al.  Click-graph modeling for facet attribute estimation of web search queries , 2010, RIAO.

[6]  Craig MacDonald,et al.  University of Glasgow at the NTCIR-9 Intent task: Experiments with Terrier on Subtopic Mining and Document Ranking , 2011, NTCIR.

[7]  Tetsuya Sakai,et al.  Microsoft Research Asia at the NTCIR-10 Intent Task , 2013, NTCIR.

[8]  Xianpei Han,et al.  ISCAS at Subtopic Mining Task in NTCIR9 , 2011, NTCIR.

[9]  Benjamin Rey,et al.  Generating query substitutions , 2006, WWW '06.

[10]  Tetsuya Sakai,et al.  Constructing a Test Collection with Multi-Intent Queries , 2010, EVIA@NTCIR.

[11]  Mark Sanderson,et al.  Ambiguous queries: test collections need more sense , 2008, SIGIR '08.

[12]  Wei-Ying Ma,et al.  Learning to cluster web search results , 2004, SIGIR '04.

[13]  Prasenjit Mitra,et al.  Query suggestions in the absence of query logs , 2011, SIGIR.

[14]  Ricardo A. Baeza-Yates,et al.  Query Recommendation Using Query Logs in Search Engines , 2004, EDBT Workshops.

[15]  Doug Beeferman,et al.  Agglomerative clustering of a search engine query log , 2000, KDD '00.

[16]  W. Bruce Croft,et al.  Query reformulation using anchor text , 2010, WSDM '10.