Missing Data Modeling with User Activity and Item Popularity in Recommendation

User feedback such as movie watching history, ratings and consumptions of products, is valuable for improving the performance of recommender systems. However, only a few interactions between users and items can be observed in implicit data. The missing of a user-item entry is caused by two reasons: the user didn’t see the item (in most cases); or the user saw but disliked it. Separating these two cases leads to modeling missing interactions at a finer granularity, which is helpful in understanding users’ preferences more accurately. However, the former case has not been well-studied in previous work. Most existing studies resort to assign a uniform weight to the missing data, while such a uniform assumption is invalid in real-world settings. In this paper, we propose a novel approach to weight the missing data based on user activity and item popularity, which is more effective and flexible than the uniform-weight assumption. Experimental results based on 2 real-world datasets (Movielens, Flixster) show that our approach outperforms 3 state-of-the-art models including BPR, WMF, and ExpoMF.

[1]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[2]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[3]  Patrick Seemann,et al.  Matrix Factorization Techniques for Recommender Systems , 2014 .

[4]  Nicolas Kourtellis,et al.  Dynamic Matrix Factorization with Priors on Unknown Values , 2015, KDD.

[5]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[6]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[7]  David M. Blei,et al.  Modeling User Exposure in Recommendation , 2015, WWW.

[8]  Roberto Turrin,et al.  Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.

[9]  Tat-Seng Chua,et al.  Neural Collaborative Filtering , 2017, WWW.

[10]  George Karypis,et al.  SLIM: Sparse Linear Methods for Top-N Recommender Systems , 2011, 2011 IEEE 11th International Conference on Data Mining.

[11]  Domonkos Tikk,et al.  Fast als-based matrix factorization for explicit and implicit feedback datasets , 2010, RecSys '10.

[12]  Yiqun Liu,et al.  Predicting the popularity of web 2.0 items based on user comments , 2014, SIGIR.

[13]  Harald Steck,et al.  Training and testing of recommender systems on data missing not at random , 2010, KDD.

[14]  Tat-Seng Chua,et al.  Fast Matrix Factorization for Online Recommendation with Implicit Feedback , 2016, SIGIR.

[15]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[16]  Maksims Volkovs,et al.  Effective Latent Models for Binary Feedback in Recommender Systems , 2015, SIGIR.

[17]  Qiang Yang,et al.  One-Class Collaborative Filtering , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[18]  Steffen Rendle,et al.  Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.