Diversifying search results of controversial queries

Diversifying search results of queries seeking for different view points about controversial topics is key to improving satisfaction of users. The challenge for finding different opinions is how to maximize the number of discussed arguments without being biased against specific sentiments. This paper addresses the issue by first introducing a new model that represents the patterns occurring in documents about controversial topics. Second, proposing an opinion diversification model that uses (1) relevance of documents, (2) semantic diversification to capture different arguments and (3) sentiment diversification to identify positive, negative and neutral sentiments about the query topic. We have conducted our experiments using queries on various controversial topics and applied our diversification model on the set of documents returned by Google search engine. The results show that our model outperforms the native ranking of Web pages about controversial topics by a significant margin.

[1]  Charles L. A. Clarke,et al.  Novelty and diversity in information retrieval evaluation , 2008, SIGIR '08.

[2]  Jun Wang,et al.  Portfolio theory of information retrieval , 2009, SIGIR.

[3]  Gianluca Demartini,et al.  ARES: A Retrieval Engine Based on Sentiments - Sentiment-Based Search Result Annotation and Diversification , 2011, ECIR.

[4]  Janyce Wiebe,et al.  Just How Mad Are You? Finding Strong and Weak Opinion Clauses , 2004, AAAI.

[5]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[6]  Sreenivas Gollapudi,et al.  An axiomatic approach for result diversification , 2009, WWW '09.

[7]  Refael Hassin,et al.  Approximation algorithms for maximum dispersion , 1997, Oper. Res. Lett..

[8]  Ben Carterette,et al.  Probabilistic models of ranking novel documents for faceted topic retrieval , 2009, CIKM.

[9]  Filip Radlinski,et al.  Improving personalized web search using result diversification , 2006, SIGIR.

[10]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[11]  B. Korte,et al.  An Analysis of the Greedy Heuristic for Independence Systems , 1978 .

[12]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[13]  Xiaoyan Zhu,et al.  Movie review mining and summarization , 2006, CIKM '06.

[14]  Craig MacDonald,et al.  Selectively diversifying web search results , 2010, CIKM.

[15]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[16]  John D. Lafferty,et al.  A risk minimization framework for information retrieval , 2006, Inf. Process. Manag..

[17]  Hua Xu,et al.  Clustering product features for opinion mining , 2011, WSDM '11.

[18]  Gerhard Weikum,et al.  Language-model-based pro/con classification of political text , 2010, SIGIR.

[19]  Sreenivas Gollapudi,et al.  Diversifying search results , 2009, WSDM '09.