Ranking with Diverse Intents and Correlated Contents

We consider the following document ranking problem: We have a collection of documents, each containing some topics (e.g. sports, politics, economics). We also have a set of users with diverse interests. Assume that user u is interested in a subset I_u of topics. Each user u is also associated with a positive integer K_u, which indicates that u can be satisfied by any K_u topics in I_u. Each document s contains information for a subset C_s of topics. The objective is to pick one document at a time such that the average satisfying time is minimized, where a user's satisfying time is the first time that at least K_u topics in I_u are covered in the documents selected so far. Our main result is an O({\rho})-approximation algorithm for the problem, where {\rho} is the algorithmic integrality gap of the linear programming relaxation of the set cover instance defined by the documents and topics. This result generalizes the constant approximations for generalized min-sum set cover and ranking with unrelated intents and the logarithmic approximation for the problem of ranking with submodular valuations (when the submodular function is the coverage function), and can be seen as an interpolation between these results. We further extend our model to the case when each user may interest in more than one sets of topics and when the user's valuation function is XOS, and obtain similar results for these models.

[1]  D. Atkin OR scheduling algorithms. , 2000, Anesthesiology.

[2]  Jian Li,et al.  A unified approach to ranking in probabilistic databases , 2009, The VLDB Journal.

[3]  Nikhil Bansal,et al.  A constant factor approximation algorithm for generalized min-sum set cover , 2010, SODA '10.

[4]  Thomas Erlebach,et al.  Constant-Factor Approximation for Minimum-Weight (Connected) Dominating Sets in Unit Disk Graphs , 2006, APPROX-RANDOM.

[5]  Jun Wang,et al.  Portfolio theory of information retrieval , 2009, SIGIR.

[6]  Atri Rudra,et al.  Approximating Matches Made in Heaven , 2009, ICALP.

[7]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[8]  S. Robertson The probability ranking principle in IR , 1997 .

[9]  S. Boucheron,et al.  A sharp concentration inequality with applications , 1999, Random Struct. Algorithms.

[10]  Matt Gibson,et al.  Algorithms for Dominating Set in Disk Graphs: Breaking the logn Barrier - (Extended Abstract) , 2010, ESA.

[11]  Subhash Khot,et al.  Optimal Long Code Test with One Free Bit , 2009, 2009 50th Annual IEEE Symposium on Foundations of Computer Science.

[12]  Yossi Azar,et al.  Multiple intents re-ranking , 2009, STOC '09.

[13]  Kasturi R. Varadarajan Weighted geometric set cover via quasi-uniform sampling , 2010, STOC '10.

[14]  Chandra Chekuri,et al.  Dependent Randomized Rounding for Matroid Polytopes and Applications , 2009, 0909.4348.

[15]  Sreenivas Gollapudi,et al.  An axiomatic approach for result diversification , 2009, WWW '09.

[16]  Yossi Azar,et al.  Ranking with submodular valuations , 2010, SODA '11.

[17]  Refael Hassin,et al.  An Approximation Algorithm for the Minimum Latency Set Cover Problem , 2005, ESA.

[18]  Yuli Ye,et al.  Max-Sum diversification, monotone submodular functions and dynamic updates , 2012, PODS '12.

[19]  Russ Bubley,et al.  Randomized algorithms , 1995, CSUR.

[20]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[21]  Edith Cohen,et al.  Efficient sequences of trials , 2003, SODA '03.

[22]  Nabil H. Mustafa,et al.  PTAS for geometric hitting set problems via local search , 2009, SCG '09.

[23]  David P. Williamson,et al.  A note on the generalized min-sum set cover problem , 2011, Oper. Res. Lett..

[24]  Vijay V. Vazirani,et al.  Approximation Algorithms , 2001, Springer Berlin Heidelberg.

[25]  Maxim Sviridenko,et al.  Preemptive and non-preemptive generalized min sum set cover , 2014, Math. Program..

[26]  László Lovász,et al.  Approximating Min Sum Set Cover , 2004, Algorithmica.

[27]  Daniel Lehmann,et al.  Combinatorial auctions with decreasing marginal utilities , 2001, EC '01.

[28]  Dror Rawitz,et al.  Hitting sets when the VC-dimension is small , 2005, Inf. Process. Lett..

[29]  Sreenivas Gollapudi,et al.  Diversifying search results , 2009, WSDM '09.

[30]  Atri Rudra,et al.  When LP Is the Cure for Your Matching Woes: Improved Bounds for Stochastic Matchings , 2010, Algorithmica.

[31]  Timothy M. Chan,et al.  Approximation Algorithms for Maximum Independent Set of Pseudo-Disks , 2009, Discrete & Computational Geometry.

[32]  Evaggelia Pitoura,et al.  Search result diversification , 2010, SGMD.

[33]  Sriram V. Pemmaraju,et al.  Approximation Algorithms for Domatic Partitions of Unit Disk Graphs , 2009, APPROX-RANDOM.

[34]  Michael T. Goodrich,et al.  Almost optimal set covers in finite VC-dimension , 1995, Discret. Comput. Geom..

[35]  Filip Radlinski,et al.  Learning diverse rankings with multi-armed bandits , 2008, ICML '08.

[36]  Tao Li,et al.  Addressing diverse user preferences in SQL-query-result navigation , 2007, SIGMOD '07.

[37]  Joseph Naor,et al.  Approximation Algorithms for Diversified Search Ranking , 2010, ICALP.

[38]  J. G. Pierce,et al.  Geometric Algorithms and Combinatorial Optimization , 2016 .