Factorization Machines Leveraging Lightweight Linked Open Data-Enabled Features for Top-N Recommendations

With the popularity of Linked Open Data (LOD) and the associated rise in freely accessible knowledge that can be accessed via LOD, exploiting LOD for recommender systems has been widely studied based on various approaches such as graph-based or using different machine learning models with LOD-enabled features. Many of the previous approaches require construction of an additional graph to run graph-based algorithms or to extract path-based features by combining user- item interactions (e.g., likes, dislikes) and background knowledge from LOD. In this paper, we investigate Factorization Machines (FMs) based on particularly lightweight LOD-enabled features which can be directly obtained via a public SPARQL Endpoint without any additional effort to construct a graph. Firstly, we aim to study whether using FM with these lightweight LOD-enabled features can provide competitive performance compared to a learning-to-rank approach leveraging LOD as well as other well-established approaches such as kNN-item and BPRMF. Secondly, we are interested in finding out to what extent each set of LOD-enabled features contributes to the recommendation performance. Experimental evaluation on a standard dataset shows that our proposed approach using FM with lightweight LOD-enabled features provides the best performance compared to other approaches in terms of five evaluation metrics. In addition, the study of the recommendation performance based on different sets of LOD-enabled features indicate that property-object lists and PageRank scores of items are useful for improving the performance, and can provide the best performance through using them together for FM. We observe that subject-property lists of items does not contribute to the recommendation performance but rather decreases the performance.

[1]  Amit P. Sheth,et al.  Harnessing relationships for domain-specific subgraph extraction: A recommendation use case , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[2]  Lars Schmidt-Thieme,et al.  MyMediaLite: a free recommender system library , 2011, RecSys '11.

[3]  Conor Hayes,et al.  SemStim at the LOD-RecSys 2014 Challenge , 2014, SemWebEval@ESWC.

[4]  Pasquale Lops,et al.  Semantics-aware Graph-based Recommender Systems Exploiting Linked Open Data , 2016, UMAP.

[5]  Alexandre Passant,et al.  dbrec - Music Recommendations Using DBpedia , 2010, SEMWEB.

[6]  Achim Rettinger,et al.  PageRank on Wikipedia: Towards General Importance Scores for Entities , 2016, @ESWC.

[7]  Qiang Wu,et al.  Adapting boosting for information retrieval measures , 2010, Information Retrieval.

[8]  José Paulo Leal,et al.  Using proximity to compute semantic relatedness in RDF graphs , 2013, Comput. Sci. Inf. Syst..

[9]  Yehuda Koren,et al.  Collaborative filtering with temporal dynamics , 2009, KDD.

[10]  Matthew Rowe Transferring Semantic Categories with Vertex Kernels: Recommendations with SemanticSVD++ , 2014, International Semantic Web Conference.

[11]  John G. Breslin,et al.  Computing the Semantic Similarity of Resources in DBpedia for Recommendation Purposes , 2015, JIST.

[12]  Alexandre Passant,et al.  Measuring Semantic Distance on Linking Data and Using it for Resources Recommendations , 2010, AAAI Spring Symposium: Linked Data Meets Artificial Intelligence.

[13]  Tommaso Di Noia,et al.  Linked Open Data-Enabled Recommender Systems: ESWC 2014 Challenge on Book Recommendation , 2014, SemWebEval@ESWC.

[14]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[15]  Tommaso Di Noia,et al.  Top-N recommendations from implicit feedback leveraging linked open data , 2013, IIR.

[16]  Pasquale Lops,et al.  ExpLOD: A Framework for Explaining Recommendations based on the Linked Open Data Cloud , 2016, RecSys.

[17]  Tommaso Di Noia,et al.  Exploiting the web of data in model-based recommender systems , 2012, RecSys.

[18]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[19]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[20]  Benjamin Heitmann,et al.  An open framework for multi-source, cross-domain personalisation with semantic interest graphs , 2012, RecSys.

[21]  Conor Hayes,et al.  Using Linked Data to Build Open, Collaborative Recommender Systems , 2010, AAAI Spring Symposium: Linked Data Meets Artificial Intelligence.

[22]  Phuong Nguyen,et al.  An evaluation of SimRank and Personalized PageRank to build a recommender system for the Web of Data , 2015, WWW.

[23]  Markus Zanker,et al.  Linked open data to support content-based recommender systems , 2012, I-SEMANTICS '12.

[24]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[25]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[26]  Paolo Tomeo,et al.  A SPRank : Semantic Path-based Ranking for Top-N Recommendations using Linked Open Data , 2016 .

[27]  Taher H. Haveliwala Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search , 2003, IEEE Trans. Knowl. Data Eng..

[28]  Pasquale Lops,et al.  Semantics-Aware Content-Based Recommender Systems , 2014, Recommender Systems Handbook.

[29]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[30]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[31]  Steffen Rendle,et al.  Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.

[32]  Jonice Oliveira,et al.  A recommendation approach for consuming linked open data , 2017, Expert Syst. Appl..

[33]  John G. Breslin,et al.  Measuring semantic distance for linked open data-enabled recommender systems , 2016, SAC.