Exploiting Linked Open Data in Cold-start Recommendations with Positive-only Feedback

In recommender systems, user preferences can be acquired either explicitly by means of ratings, or implicitly --e.g., by processing text reviews, and by mining item browsing and purchasing records. Most existing collaborative filtering approaches have been designed to deal with numerical ratings, such as the 5-star ratings in Amazon and Netflix, for both rating prediction and item ranking (a.k.a. top-N recommendation) tasks. In many e-commerce and social network sites, however, user preferences are usually expressed in the form of binary and unary (positive-only) ratings, such as the thumbs up/down in YouTube and the likes in Facebook, respectively. Moreover, in these cases, the well-known problem of cold-start --i.e., the scarcity of user preferences-- is highly remarkable. To address this situation, we explore a number of graph-based and matrix factorization recommendation models that jointly exploit user ratings and item metadata. In this work, such metadata are automatically obtained from DBpedia --the queriable and structured version of Wikipedia which is considered as the core knowledge repository of the Linked Open Data initiative--, and the models are evaluated with a Facebook dataset covering three distinct domains, namely books, movies and music. The results achieved in our experiments show that the proposed hybrid recommendation models, which exploit rating and semantic data, outperform content-based and collaborative filtering baselines.

[1]  Alejandro Bellogín,et al.  Precision-oriented evaluation of recommender systems: an algorithmic comparison , 2011, RecSys '11.

[2]  Philip S. Yu,et al.  HeteSim: A General Framework for Relevance Measure in Heterogeneous Networks , 2013, IEEE Transactions on Knowledge and Data Engineering.

[3]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[4]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[5]  Iván Cantador,et al.  Alleviating the new user problem in collaborative filtering by exploiting personality information , 2016, User Modeling and User-Adapted Interaction.

[6]  Roberto Turrin,et al.  Cross-Domain Recommender Systems , 2015, Recommender Systems Handbook.

[7]  Sangkeun Lee,et al.  PathRank: a novel node ranking measure on a heterogeneous graph for recommender systems , 2012, CIKM.

[8]  Gediminas Adomavicius,et al.  Improving Aggregate Recommendation Diversity Using Ranking-Based Techniques , 2012, IEEE Transactions on Knowledge and Data Engineering.

[9]  Martha Larson,et al.  CLiMF: learning to maximize reciprocal rank with collaborative less-is-more filtering , 2012, RecSys.

[10]  Paolo Tomeo,et al.  An analysis of users' propensity toward diversity in recommendations , 2014, RecSys '14.

[11]  Philip S. Yu,et al.  PathSim , 2011, Proc. VLDB Endow..

[12]  Paolo Tomeo,et al.  SPrank: Semantic Path-Based Ranking for Top-N Recommendations Using Linked Open Data , 2016, ACM Trans. Intell. Syst. Technol..

[13]  Joseph A. Konstan,et al.  Evaluating recommender behavior for new users , 2014, RecSys '14.

[14]  Steffen Rendle,et al.  Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.

[15]  Yehuda Koren,et al.  Advances in Collaborative Filtering , 2011, Recommender Systems Handbook.

[16]  Yizhou Sun,et al.  Personalized entity recommendation: a heterogeneous information network approach , 2014, WSDM.

[17]  George Karypis,et al.  Sparse linear methods with side information for top-n recommendations , 2012, RecSys.

[18]  Geoffrey J. Gordon,et al.  Relational learning via collective matrix factorization , 2008, KDD.

[19]  George Karypis,et al.  SLIM: Sparse Linear Methods for Top-N Recommender Systems , 2011, 2011 IEEE 11th International Conference on Data Mining.

[20]  Francesco Ricci,et al.  Cold-Start Management with Cross-Domain Collaborative Filtering and Tags , 2013, EC-Web.

[21]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[22]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[23]  Martha Larson,et al.  Collaborative Filtering beyond the User-Item Matrix , 2014, ACM Comput. Surv..