Current Approaches to Search Result Diversification

With the growth of the Web and the variety of search engine users, Web search efectiveness and user satisfaction can be improved by diversirication. This paper surveys recent approaches to search result diversification in both full text and structured content search. We identify commonalities in the proposed methods describing an overall framework for result diversification. We discuss diferent diversity dimensions and measures as well as possible ways of considering the relevance / diversity trade-off. We also summarise existing efforts evaluating diversity in search. Moreover, for each of these steps, we point out aspects which are missing in current approaches as possible directions for future work.

[1]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[2]  Douglas Douglas,et al.  The multi-dimensional approach to linguistic analyses of genre variation: An overview of methodology and findings , 1992, Comput. Humanit..

[3]  Daniel G. McDonald,et al.  The Conceptualization and Measurement of Diversity , 2003, Commun. Res..

[4]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[5]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[6]  Fausto Giunchiglia Managing Diversity in Knowledge , 2006, IEA/AIE.

[7]  Ximena Olivares,et al.  Visual diversification of image search results , 2009, WWW '09.

[8]  Craig MacDonald,et al.  Exploiting query reformulations for web search result diversification , 2010, WWW '10.

[9]  Charles L. A. Clarke,et al.  Novelty and diversity in information retrieval evaluation , 2008, SIGIR '08.

[10]  Sreenivas Gollapudi,et al.  An axiomatic approach for result diversification , 2009, WWW '09.

[11]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[12]  Peter Fankhauser,et al.  DivQ: diversification for keyword search over structured databases , 2010, SIGIR.

[13]  Sihem Amer-Yahia,et al.  Efficient Computation of Diverse Query Results , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[14]  Fausto Giunchiglia,et al.  Foundations for the representation of diversity, evolution, opinion and bias , 2009 .

[15]  Jayant R. Haritsa,et al.  Providing Diversity in K-Nearest Neighbor Query Results , 2003, PAKDD.

[16]  Sreenivas Gollapudi,et al.  Diversifying search results , 2009, WSDM '09.

[17]  Kilian Q. Weinberger,et al.  Resolving tag ambiguity , 2008, ACM Multimedia.

[18]  Chong-Wah Ngo,et al.  Practical elimination of near-duplicates from web video search , 2007, ACM Multimedia.

[19]  Wei-Hao Lin,et al.  Identifying ideological perspectives in text and video , 2009 .

[20]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[21]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.