The recommender problem revisited

In 2006, Netflix announced a $1M prize competition to advance recommendation algorithms. The recommendation problem was simplified as the accuracy in predicting a user rating measured by the Root Mean Squared Error (RMSE). While that formulation helped get the attention of the research community in the area, it may have put an excessive focus on what is simply one of the many possible approaches to recommendations. In this tutorial we will describe different components of modern recommender systems such as: personalized ranking, similarity, explanations, context-awareness, or search as recommendation. We will use the Netflix use case as a driving example of a prototypical industrial-scale recommender system that has evolved from focusing on rating prediction to full page optimization. We will also review the usage of modern algorithmic approaches that include algorithms such as Factorization Machines [9], Restricted Boltzmann Machines [10], SimRank [7], Deep Neural Networks, or Listwise Learning-to-rank [6, 12, 11]. The original recommendation problem was formulated around the existence of explicit user ratings. However, recommender systems can be built using different kinds of data including implicit behavioral data, social connections, or demographics. In this tutorial we will review the usage of different data types and discuss what the availability of Big Data has brought into the research area. Finally, and also related to the availability of large quantities of data, we will talk about how system and architectural decisions play a role in understanding the recommender problem. This tutorial is in part based on recent publications by the author [5, 1, 8, 2, 4, 3].

[1]  Martha Larson,et al.  TFMAP: optimizing MAP for top-n context-aware recommendation , 2012, SIGIR '12.

[2]  Nuria Oliver,et al.  Multiverse recommendation: n-dimensional tensor factorization for context-aware collaborative filtering , 2010, RecSys '10.

[3]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[4]  Shuang-Hong Yang,et al.  Collaborative competitive filtering: learning recommender using context of user choice , 2011, SIGIR.

[5]  Bamshad Mobasher,et al.  Recommendation with Differential Context Weighting , 2013, UMAP.

[6]  Geoffrey E. Hinton,et al.  Restricted Boltzmann machines for collaborative filtering , 2007, ICML '07.

[7]  Bernd Ludwig,et al.  InCarMusic: Context-Aware Music Recommendations in a Car , 2011, EC-Web.

[8]  Tian Xia,et al.  Direct optimization of ranking measures for learning to rank models , 2013, KDD.

[9]  Xavier Amatriain,et al.  Data Mining Methods for Recommender Systems , 2011, Recommender Systems Handbook.

[10]  Francesco Ricci,et al.  Context-based splitting of item ratings in collaborative filtering , 2009, RecSys '09.

[11]  Chao Liu,et al.  Wisdom of the better few: cold start recommendation via representative based rating elicitation , 2011, RecSys '11.

[12]  Yehuda Koren,et al.  Collaborative filtering with temporal dynamics , 2009, KDD.

[13]  Xavier Amatriain,et al.  Big & personal: data and models behind netflix recommendations , 2013, BigMine '13.

[14]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[15]  Philomena Jurey Fair and Balanced , 2005 .

[16]  Lars Schmidt-Thieme,et al.  Fast context-aware recommendations with factorization machines , 2011, SIGIR.

[17]  Bamshad Mobasher,et al.  Differential Context Relaxation for Context-Aware Travel Recommendation , 2012, EC-Web.

[18]  Linas Baltrunas,et al.  Towards Time-Dependant Recommendation based on Implicit Feedback , 2009 .

[19]  Steffen Rendle,et al.  Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.

[20]  Gediminas Adomavicius,et al.  Context-aware recommender systems , 2008, RecSys '08.

[21]  Xavier Amatriain,et al.  Building industrial-scale real-world recommender systems , 2012, RecSys.

[22]  Bernd Ludwig,et al.  Matrix factorization techniques for context aware recommendation , 2011, RecSys '11.

[23]  Martha Larson,et al.  CLiMF: learning to maximize reciprocal rank with collaborative less-is-more filtering , 2012, RecSys.

[24]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[25]  Tie-Yan Liu,et al.  Learning to rank: from pairwise approach to listwise approach , 2007, ICML '07.

[26]  Sahin Albayrak,et al.  Inferring Contextual User Profiles - Improving Recommender Performance , 2011 .

[27]  Martin Ester,et al.  TrustWalker: a random walk model for combining trust-based and item-based recommendation , 2009, KDD.

[28]  Thore Graepel,et al.  Matchbox: large scale online bayesian recommendations , 2009, WWW '09.

[29]  Christopher J. C. Burges,et al.  From RankNet to LambdaRank to LambdaMART: An Overview , 2010 .

[30]  Xavier Amatriain,et al.  Mining large streams of user data for personalized recommendations , 2013, SKDD.