Clustering Search Engine Suggests by Modeling Topics of Web Pages collected with Suggests

In this paper, we address the issue of how to overview the knowledge of a given query keyword. We especially focus on concerns of those who search for Web pages with a given query keyword, and study how to efficiently overview the whole list of Web search information needs of a given query keyword. First, we collect Web search information needs of a given query keyword through search engine suggests. Although we collect up to around 1,000 suggests given a query keyword, some of them are redundant in that they originate from almost the same Web search information needs. In order to aggregate such redundant search engine suggests, we take an approach of clustering search engine suggests based on a topic model. Evaluation result shows that the proposed clustering approach proves to be quite useful for efficiently overviewing Web search information needs of a given query keyword. We also develop an interface system for overviewing those aggregated search engine suggests of a given query keyword as well as links to top ranked Web pages that are closely related to those aggregated search engine suggests.

[1]  Michael R. Lyu,et al.  Learning latent semantic relations from clickthrough data for query suggestion , 2008, CIKM '08.

[2]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[3]  Sadao Kurohashi,et al.  Web Information Organization Using Keyword Distillation Based Clustering , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[4]  Ryoji Kataoka,et al.  Search Result Clustering Using Informatively Named Entities , 2007, Int. J. Hum. Comput. Interact..

[5]  M. de Rijke,et al.  Identifying Facets in Query-Biased Sets of Blog Posts , 2007, ICWSM.

[6]  Sadao Kurohashi,et al.  Summarizing Search Results using PLSI , 2010 .

[7]  Xueqi Cheng,et al.  A structured approach to query recommendation with social annotation data , 2010, CIKM.

[8]  Kenneth Wai-Ting Leung,et al.  Personalized Concept-Based Clustering of Search Engine Queries , 2008, IEEE Transactions on Knowledge and Data Engineering.