The Minimum Conductance Dissimilarity Cut ( MCDC ) Algorithm to Increase Novelty and Diversity of Recommendations

As established by Herlocker et al. and Ekstrand et al., recommender systems that consider diversity and novelty as well as expected ratings when selecting sets of items to recommend can increase user engagement and exposure to a broader base of inventory addressing the Pigeonhole and Blockbuster problems. Novelty, diversity, and expected rating define a multi-objective optimization problem that is difficult to solve. Existing approaches rely on iterative optimization techniques like evolutionary algorithms or simulated annealing. In this paper, we present MCDC, an algorithm for approximately solving this problem as an eigenvector problem, solving a graph partition problem on a weighted dissimilarity graph. MCDC efficiently exploits the local-global structure of the problem (novelty is a global property and diversity is a local property) to approximate a Pareto-optimal balance between novelty, coverage (which is related to diversity), and the recommendation score. Results with several datasets, including Jester and the latest Movielens data, show that MCDC provides recommendations sets with up to 21% higher combined geometric mean values and that MCDC is up to an order of magnitude faster than previous methods. MCDC can be easily applied to any recommender system based on a user-item ratings matrix. Code is provided at: https://github.com/sjyk/mcdc

[1]  Mi Zhang,et al.  Avoiding monotony: improving the diversity of recommendation lists , 2008, RecSys '08.

[2]  J. Borwein The geometry of Pareto efficiency over cones , 1980 .

[3]  Marco Laumanns,et al.  SPEA2: Improving the strength pareto evolutionary algorithm , 2001 .

[4]  Saul Vargas,et al.  Novelty and diversity enhancement and evaluation in recommender systems and information retrieval , 2014, SIGIR.

[5]  Charles L. A. Clarke,et al.  Novelty and diversity in information retrieval evaluation , 2008, SIGIR '08.

[6]  Santosh S. Vempala,et al.  On clusterings-good, bad and spectral , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[7]  L. Thompson,et al.  Improving the creativity of organizational work groups , 2003 .

[8]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[9]  F. Maxwell Harper,et al.  User perception of differences in recommender algorithms , 2014, RecSys '14.

[10]  R. Armstrong The Long Tail: Why the Future of Business Is Selling Less of More , 2008 .

[11]  Andrew Topper,et al.  Facilitating Student Interactions through Discursive Moves: An Instructor's Experience Teaching Online Graduate Courses in Educational Technology. , 2005 .

[12]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  S. Krishnan,et al.  Comparing Three Online Civic Engagement Platforms using the “ Spectrum of Public Participation ” Framework , 2014 .

[14]  Reynold Xin,et al.  GraphX: a resilient distributed graph system on Spark , 2013, GRADES.

[15]  Kenneth Y. Goldberg,et al.  Eigentaste 5.0: constant-time adaptability in a recommender system using item clustering , 2007, RecSys '07.

[16]  Sanjay Krishnan,et al.  A methodology for learning, analyzing, and mitigating social influence bias in recommender systems , 2014, RecSys '14.

[17]  Mark W. Krentel The Complexity of Optimization Problems , 1986, Computational Complexity Conference.

[18]  Yi Zhang,et al.  Novelty and redundancy detection in adaptive filtering , 2002, SIGIR '02.

[19]  Yisong Yue,et al.  Linear Submodular Bandits and their Application to Diversified Retrieval , 2011, NIPS.

[20]  Dan Cosley,et al.  Think different: increasing online community participation using uniqueness and group dissimilarity , 2004, CHI.

[21]  Adriano Veloso,et al.  Pareto-efficient hybridization for multi-objective recommender systems , 2012, RecSys.

[22]  Jade Goldstein-Stewart,et al.  The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries , 1998, SIGIR Forum.

[23]  Alison Williams,et al.  Collaborative creativity: a complex systems model with distributed affect , 2011, CHI.

[24]  Y. Censor Pareto optimality in multiobjective problems , 1977 .

[25]  Sean M. McNee,et al.  Improving recommendation lists through topic diversification , 2005, WWW '05.