Suggestions for Fresh Search Queries by Mining Mircoblog Topics

Query suggestion of Web search has been an effective approach to help users quickly express their information need and more accurately get the information they need. All major web-search engines and most proposed methods that suggest queries rely on query logs of search engine to determine possible query suggestions. However, for search systems, it is much more difficult to effectively suggest relevant queries to a fresh search query which has no or few historical evidences in query logs. In this paper, we propose a suggestion approach for fresh queries by mining the new social network media, i.e, mircoblog topics. We leverage the comment information in the microblog topics to mine potential suggestions. We utilize word frequency statistics to extract a set of ordered candidate words. As soon as a user starts typing a query word, words that match with the partial user query word are selected as completions of the partial query word and are offered as query suggestions. We collect a dataset from Sina microblog topics and compare the final results by selecting different suggestion context source. The experimental results clearly demonstrate the effectiveness of our approach in suggesting queries with high quality. Our conclusion is that the suggestion context source of a topic consists of the tweets from authenticated Sina users is more effective than the tweets from all Sina users.

[1]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[2]  Prasenjit Mitra,et al.  Query suggestions in the absence of query logs , 2011, SIGIR.

[3]  Benjamin Rey,et al.  Generating query substitutions , 2006, WWW '06.

[4]  Kenneth Ward Church,et al.  Query suggestion using hitting time , 2008, CIKM '08.

[5]  Ryen W. White,et al.  Query suggestion based on user landing pages , 2007, SIGIR.

[6]  Xing Chen,et al.  Recommending Related Microblogs: A Comparison Between Topic and WordNet based Approaches , 2012, AAAI.

[7]  Enhong Chen,et al.  Context-aware query suggestion by mining click-through and session data , 2008, KDD.

[8]  Michael R. Lyu,et al.  Learning latent semantic relations from clickthrough data for query suggestion , 2008, CIKM '08.

[9]  Javed A. Aslam,et al.  Evaluation of phrasal query suggestions , 2007, CIKM '07.

[10]  Francesco Bonchi,et al.  Query suggestions using query-flow graphs , 2009, WSCD '09.

[11]  Ali A. Ghorbani,et al.  A Novel Approach for Frequent Phrase Mining in Web Search Engine Query Streams , 2007, Fifth Annual Conference on Communication Networks and Services Research (CNSR '07).

[12]  Andrei Z. Broder,et al.  Online expansion of rare queries for sponsored search , 2009, WWW '09.

[13]  Wei-Ying Ma,et al.  Probabilistic query expansion using query logs , 2002, WWW '02.

[14]  Ricardo A. Baeza-Yates,et al.  Query Recommendation Using Query Logs in Search Engines , 2004, EDBT Workshops.

[15]  Fabrizio Silvestri,et al.  Mining Query Logs: Turning Search Usage Data into Knowledge , 2010, Found. Trends Inf. Retr..

[16]  Yang Song,et al.  Optimal rare query suggestion with implicit user feedback , 2010, WWW '10.

[17]  Zhiyuan Liu,et al.  Mining the interests of Chinese microbloggers via keyword extraction , 2012, Frontiers of Computer Science.

[18]  M. Newman,et al.  Why social networks are different from other types of networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  Longbing Cao,et al.  In-depth behavior understanding and use: The behavior informatics approach , 2010, Inf. Sci..

[20]  Wolfgang Lindner,et al.  Current Trends in Database Technology - EDBT 2004 Workshops, EDBT 2004 Workshops PhD, DataX, PIM, P2P&DB, and ClustWeb, Heraklion, Crete, Greece, March 14-18, 2004, Revised Selected Papers , 2004, EDBT Workshops.

[21]  Wei Gao,et al.  Cross-lingual query suggestion using query logs of different languages , 2007, SIGIR.

[22]  Karl Gyllstrom,et al.  A comparison of query and term suggestion features for interactive searching , 2009, SIGIR.