Towards Optimal Active Learning for Matrix Factorization in Recommender Systems

Recommender systems help web users to address information overload. However their performance depends on the number of provided ratings by users. This problem is amplified for a new user because he/she has not provided any rating. To address this problem, active learning methods have been proposed to acquire those ratings from users, that will help most in determining their interests. The optimal active learning selects a query that directly optimizes the expected error for the test data. This approach is applicable for prediction models in which this question can be answered in closed-form given the distribution of test data is known. Unfortunately, there are many tasks and models for which the optimal selection cannot efficiently be found in closed-form. Therefore, most of the active learning methods optimize different, non-optimal criteria, such as uncertainty. Nevertheless, in this paper we exploit the characteristics of matrix factorization, which leads to a closed-form solution and by being inspired from existing optimal active learning for the regression task, develop a method that approximates the optimal solution for recommender systems. Our results demonstrate that the proposed method improves the prediction accuracy of MF.

[1]  Arnd Kohrs,et al.  Improving collaborative filtering for new-users by smart object selection , 2001 .

[2]  Arnold W. M. Smeulders,et al.  Active learning using pre-clustering , 2004, ICML.

[3]  Andrew McCallum,et al.  Toward Optimal Active Learning through Monte Carlo Estimation of Error Reduction , 2001, ICML 2001.

[4]  W. Newey,et al.  Large sample estimation and hypothesis testing , 1986 .

[5]  Sean M. McNee,et al.  Getting to know you: learning new user preferences in recommender systems , 2002, IUI '02.

[6]  DIMITRIOS PIERRAKOS,et al.  User Modeling and User-Adapted Interaction , 1994, User Modeling and User-Adapted Interaction.

[7]  Kun Deng,et al.  Balancing exploration and exploitation: a new algorithm for active machine learning , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[8]  Lars Schmidt-Thieme,et al.  Online-updating regularized kernel matrix factorization models for large-scale recommender systems , 2008, RecSys '08.

[9]  Daphne Koller,et al.  Active Learning for Parameter Estimation in Bayesian Networks , 2000, NIPS.

[10]  Greg Schohn,et al.  Less is More: Active Learning with Support Vector Machines , 2000, ICML.

[11]  Lars Schmidt-Thieme,et al.  Non-myopic active learning for recommender systems based on Matrix Factorization , 2011, 2011 IEEE International Conference on Information Reuse & Integration.

[12]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[13]  Robin D. Burke,et al.  Hybrid Recommender Systems: Survey and Experiments , 2002, User Modeling and User-Adapted Interaction.

[14]  Bradley N. Miller,et al.  GroupLens: applying collaborative filtering to Usenet news , 1997, CACM.

[15]  Luo Si,et al.  A Bayesian Approach toward Active Learning for Collaborative Filtering , 2004, UAI.

[16]  Ran El-Yaniv,et al.  Online Choice of Active Learning Algorithms , 2003, J. Mach. Learn. Res..

[17]  BurkeRobin Hybrid Recommender Systems , 2002 .

[18]  Lars Schmidt-Thieme,et al.  Active learning for aspect model in recommender systems , 2011, 2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[19]  Gerard Salton,et al.  Research and Development in Information Retrieval , 1982, Lecture Notes in Computer Science.

[20]  David A. Cohn,et al.  Active Learning with Statistical Models , 1996, NIPS.

[21]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[22]  Thomas Hofmann,et al.  Latent Class Models for Collaborative Filtering , 1999, IJCAI.

[23]  Thomas Hofmann,et al.  Collaborative filtering via gaussian probabilistic latent semantic analysis , 2003, SIGIR.

[24]  Craig Boutilier,et al.  Active Collaborative Filtering , 2002, UAI.