Using digest pages to increase user result space: Preliminary designs

It is well known that in web search, users access only a small fraction of the presented results. Increasing the result space of web users to provide them more relevant information but without expecting them to access more results is thus important as the amount of information published on the web is continuously growing. In this paper, we introduce the concept of a digest page, which is a fictitious document built from the clustering of result documents returned by a search engine as answers to a query, and where each cluster has its documents summarized into the digest page. This paper presents preliminary designs regarding the construction, the presentation, and the ranking of digest pages. It also shows how digest pages can be used to capture the context of a query through the concept of an aggregated digest page, which is based on the aggregated search paradigm offered by some search engines.

[1]  Amanda Spink,et al.  From E-Sex to E-Commerce: Web Search Changes , 2002, Computer.

[2]  Amanda Spink,et al.  An analysis of document viewing patterns of Web search engine users , 2005 .

[3]  Filip Radlinski,et al.  Improving personalized web search using result diversification , 2006, SIGIR.

[4]  David Evans,et al.  Tracking and summarizing news on a daily basis with Columbia's Newsblaster , 2002 .

[5]  Weiguo Fan,et al.  WebInEssence: A Personalized Web-Based Multi-Document Summarization and Recommendation System , 2008 .

[6]  SpinkAmanda,et al.  Real life, real users, and real needs , 2000 .

[7]  Dianne P. O'Leary,et al.  QCS: A system for querying, clustering and summarizing documents , 2007, Inf. Process. Manag..

[8]  Susan T. Dumais,et al.  Optimizing search by showing results in context , 2001, CHI.

[9]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[10]  Wei-Ying Ma,et al.  Learning to cluster web search results , 2004, SIGIR '04.

[11]  Hua Li,et al.  Improving web search results using affinity graph , 2005, SIGIR '05.

[12]  Barry Smyth,et al.  On the Importance of Being Diverse: Analysing Similarity and Diversity in Web Search , 2004, Intelligent Information Processing.

[13]  David McSherry,et al.  Diversity-Conscious Retrieval , 2002, ECCBR.

[14]  Susan T. Dumais,et al.  Beyond the Commons: Investigating the Value of Personalizing Web Search , 2005 .

[15]  Amanda Spink,et al.  How are we searching the World Wide Web? A comparison of nine search engine transaction logs , 2006, Inf. Process. Manag..

[16]  Amanda Spink,et al.  An Analysis of Web Documents Retrieved and Viewed , 2003, International Conference on Internet Computing.

[17]  Amanda Spink,et al.  Real life, real users, and real needs: a study and analysis of user queries on the web , 2000, Inf. Process. Manag..

[18]  Peter Ingwersen,et al.  The development of a method for the evaluation of interactive information retrieval systems , 1997, J. Documentation.

[19]  Dragomir R. Radev,et al.  NewsInEssence: summarizing online news topics , 2005, Commun. ACM.

[20]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[21]  Stephen E. Robertson,et al.  Deciphering cluster representations , 2001, Inf. Process. Manag..