Predicting Preference Tags to Improve Item Recommendation

Collaborative filtering (CF) based recommender systems identify and recommend interesting items to a given user based on the user’s past rating activity. These systems improve their recommendations by identifying user preferences and item related information from external sources, like reviews written by users, or concept tags shared by users about these items. These preferences are often reflected through a multi-criterion rating. In this study, we seek to improve recommender systems by integrating user preferences as side information within standard neighborhoodbased and matrix factorization based methods. We assume that a user’s choice of tags for an item provides additional information about the user’s personal preference and additional features about the item. Since, querying users to provide tags and multi-criteria rating imposes an additional burden on the user base, we propose using collective classification to predict tags for both the users and items. We also investigate the use of active learning approaches integrated within the collective classification framework when tag information (users or items) is limited. Our experimental results on several real world datasets show the advantages of using tag-based information within the recommender systems. We are also able to show the effectiveness of collective classification algorithms in estimating user preferences and item features. Keywords— Tag-based Recommendation System, Active Learning, Collective Classification

[1]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[2]  Pasquale Lops,et al.  Content-based Recommender Systems: State of the Art and Trends , 2011, Recommender Systems Handbook.

[3]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[4]  Wu-Jun Li,et al.  TagiCoFi: tag informed collaborative filtering , 2009, RecSys '09.

[5]  Ruslan Salakhutdinov,et al.  Bayesian probabilistic matrix factorization using Markov chain Monte Carlo , 2008, ICML '08.

[6]  Huzefa Rangwala,et al.  FLIP: Active Learning for Relational Network Classification , 2014, ECML/PKDD.

[7]  Lise Getoor,et al.  Link-Based Classification , 2003, Encyclopedia of Machine Learning and Data Mining.

[8]  Yiming Yang,et al.  Personalized active learning for collaborative filtering , 2008, SIGIR '08.

[9]  Francesco Ricci,et al.  Cold-Start Management with Cross-Domain Collaborative Filtering and Tags , 2013, EC-Web.

[10]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[11]  Luo Si,et al.  A Bayesian Approach toward Active Learning for Collaborative Filtering , 2004, UAI.

[12]  Martha Larson,et al.  Tags as bridges between domains: improving recommendation with tag-induced cross-domain collaborative filtering , 2011, UMAP'11.

[13]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[14]  Yehuda Koren,et al.  Advances in Collaborative Filtering , 2011, Recommender Systems Handbook.

[15]  Steffen Rendle Scaling Factorization Machines to Relational Data , 2013, Proc. VLDB Endow..

[16]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[17]  Jennifer Neville,et al.  Iterative Classification in Relational Data , 2000 .

[18]  Lars Schmidt-Thieme,et al.  Semi-supervised Tag Recommendation - Using Untagged Resources to Mitigate Cold-Start Problems , 2010, PAKDD.

[19]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[20]  Nan Du,et al.  Improved recommendation based on collaborative tagging behaviors , 2008, IUI '08.

[21]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[22]  Hui Xiong,et al.  User Preference Learning with Multiple Information Fusion for Restaurant Recommendation , 2014, SDM.

[23]  Sean M. McNee,et al.  Getting to know you: learning new user preferences in recommender systems , 2002, IUI '02.

[24]  Foster J. Provost,et al.  Classification in Networked Data: a Toolkit and a Univariate Case Study , 2007, J. Mach. Learn. Res..

[25]  Thomas Hofmann,et al.  Collaborative filtering via gaussian probabilistic latent semantic analysis , 2003, SIGIR.

[26]  David M. Pennock,et al.  Categories and Subject Descriptors , 2001 .

[27]  Yi-Cheng Zhang,et al.  Tag-Aware Recommender Systems: A State-of-the-Art Survey , 2011, Journal of Computer Science and Technology.

[28]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[29]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[30]  Philip S. Yu,et al.  Multi-Label Collective Classification , 2011, SDM.