Evaluating content novelty in recommender systems

Recommender systems are frequently evaluated using performance indexes based on variants and extensions of precision-like measures. As these measures are biased toward popular items, a list of recommendations simply must include a few popular items to perform well. To address the popularity bias challenge, new approaches for novelty and diversity evaluation have been proposed. On the one hand, novelty-based approaches model the quality of being new as apposed to that which is already known. Novelty approaches are commonly based on item views or user rates. On the other hand, diversity approaches model the quality of an item that is composed of different content elements. Diversity measures are commonly rooted in content-based features that characterize the diversity of the content of an item in terms of the presence/absence of a number of predefined nuggets of information. As item contents are also biased to popular contents (e.g., drama in movies or pop in music), diversity-based measures are also popularity biased. To alleviate the effect of popularity bias on diversity measures, we used an evaluation approach based on the degree of novelty of the elements that make up each item. We named this approach content novelty, as it mixes content and diversity approaches in a single and coherent evaluation framework. Experimental results show that our proposal is feasible and useful. Our findings demonstrate that the proposed measures yield consistent and interpretable results, producing insights that reduce the impact of popularity bias in the evaluation of recommender systems.

[1]  Sean M. McNee,et al.  Improving recommendation lists through topic diversification , 2005, WWW '05.

[2]  Ricardo A. Baeza-Yates,et al.  Modeling user search behavior , 2005, Third Latin American Web Congress (LA-WEB'2005).

[3]  Thomas Hofmann,et al.  Latent semantic models for collaborative filtering , 2004, TOIS.

[4]  Zhenhua Wang,et al.  An improved collaborative movie recommendation system using computational intelligence , 2014, J. Vis. Lang. Comput..

[5]  Ricardo Baeza-Yates,et al.  A Web Search Analysis Considering the Intention behind Queries , 2008, 2008 Latin American Web Conference.

[6]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[7]  David M. Pennock,et al.  Probabilistic Models for Unified Collaborative and Content-Based Recommendation in Sparse-Data Environments , 2001, UAI.

[8]  Charles L. A. Clarke,et al.  Novelty and diversity in information retrieval evaluation , 2008, SIGIR '08.

[9]  Diana Inkpen,et al.  A survey of book recommender systems , 2017, Journal of Intelligent Information Systems.

[10]  Domonkos Tikk,et al.  Major components of the gravity recommendation system , 2007, SKDD.

[11]  Pasquale Lops,et al.  Content-based Recommender Systems: State of the Art and Trends , 2011, Recommender Systems Handbook.

[12]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[13]  Arbee L. P. Chen,et al.  A Music Recommendation System Based on Music and User Grouping , 2005, Journal of Intelligent Information Systems.

[14]  Saul Vargas,et al.  Novelty and Diversity in Recommender Systems , 2015, Recommender Systems Handbook.

[15]  Tefko Saracevic,et al.  Evaluation of evaluation in information retrieval , 1995, SIGIR '95.

[16]  Michael J. Pazzani,et al.  Content-Based Recommendation Systems , 2007, The Adaptive Web.

[17]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[18]  CARLOS A. GOMEZ-URIBE,et al.  The Netflix Recommender System , 2015, ACM Trans. Manag. Inf. Syst..

[19]  Wei Chen,et al.  Opinion-Based Collaborative Filtering to Solve Popularity Bias in Recommender Systems , 2013, DEXA.

[20]  Ian Soboroff,et al.  Collaborative filtering and the generalized vector space model (poster session) , 2000, SIGIR '00.

[21]  Carlos A. Hurtado,et al.  A Statistical Model of Query Log Generation , 2006, SPIRE.

[22]  Alejandro Bellogín,et al.  Precision-oriented evaluation of recommender systems: an algorithmic comparison , 2011, RecSys '11.

[23]  Sergei Vassilvitskii,et al.  Getting recommender systems to think outside the box , 2009, RecSys '09.

[24]  Michael D. Ekstrand,et al.  Recommender Response to Diversity and Popularity Bias in User Profiles , 2017, FLAIRS.

[25]  John Riedl,et al.  The Tag Genome: Encoding Community Knowledge to Support Novel Interaction , 2012, TIIS.

[26]  Marko Tkalcic,et al.  Evaluating Group Recommender Systems , 2018 .

[27]  Maria Soledad Pera,et al.  All The Cool Kids, How Do They Fit In?: Popularity and Demographic Biases in Recommender Evaluation and Effectiveness , 2018, FAT.

[28]  Anita Elberse,et al.  The Power of Stars: Do Star Actors Drive the Success of Movies? , 2007 .

[29]  Jaana Kekäläinen,et al.  IR evaluation methods for retrieving highly relevant documents , 2000, SIGIR '00.

[30]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[31]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[32]  Giuseppe M. L. Sarnè A novel hybrid approach improving effectiveness of recommender systems , 2014, Journal of Intelligent Information Systems.

[33]  Michael Hahsler recommenderlab: An R Framework for Developing and Testing Recommendation Algorithms , 2022, ArXiv.

[34]  Matevz Kunaver,et al.  Diversity in recommender systems - A survey , 2017, Knowl. Based Syst..

[35]  Michael J. Pazzani,et al.  A Framework for Collaborative, Content-Based and Demographic Filtering , 1999, Artificial Intelligence Review.