Ontology-based Linked Data Summarization in Semantics-aware Recommender Systems

In the current information-centric era, recommender systems are gaining momentum as tools able to assist users in daily decision-making tasks. They may exploit users’ past behavior combined with side/contextual information to suggest them new items or pieces of knowledge they might be interested in. Within the recommendation process, Linked Data have been already proposed as a valuable source of information to enhance the predictive power of recommender systems not only in terms of accuracy but also of diversity and novelty of results. In this direction, one of the main open issues in using Linked Data to feed a recommendation engine is related to feature selection: how to select only the most relevant subset of the original Linked Data thus avoiding both useless processing of data and the so called “curse of dimensionality” problem. In this paper, we show how ontology-based (linked) data summarization can drive the selection of properties/features useful to a recommender system. In particular, we compare a fully automated feature selection method based on ontology-based data summaries with more classical ones, and we evaluate the performance of these methods in terms of accuracy and aggregate diversity of a recommender system exploiting the top-k selected features. We set up an experimental testbed relying on datasets related to different knowledge domains. Results show the feasibility of a feature selection process driven by ontology-based data summaries for Linked Data-enabled recommender systems.

[1]  Emir Muñoz On Learnability of Constraints from RDF Data , 2016, ESWC.

[2]  Jens Lehmann,et al.  LODStats - An Extensible Framework for High-Performance Dataset Analytics , 2012, EKAW.

[3]  Pasquale Lops,et al.  Semantics-aware Content-based Recommender Systems , 2014, CBRecSys@RecSys.

[4]  Martha Larson,et al.  CLiMF: learning to maximize reciprocal rank with collaborative less-is-more filtering , 2012, RecSys.

[5]  Paolo Tomeo,et al.  An analysis of users' propensity toward diversity in recommendations , 2014, RecSys '14.

[6]  Dimitris Plexousakis,et al.  RDF Digest: Efficient Summarization of RDF/S KBs , 2015, ESWC.

[7]  Pasquale Lops,et al.  Semantics-aware Graph-based Recommender Systems Exploiting Linked Open Data , 2016, UMAP.

[8]  Paolo Tomeo,et al.  Addressing the Cold Start with Positive-Only Feedback Through Semantic-Based Recommendations , 2017, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[9]  Ansgar Scherp,et al.  TermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from the Linked Open Data Cloud , 2015, ESWC.

[10]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[11]  Gediminas Adomavicius,et al.  Improving Aggregate Recommendation Diversity Using Ranking-Based Techniques , 2012, IEEE Transactions on Knowledge and Data Engineering.

[12]  Wolfram Wöß,et al.  RDFStats - An Extensible RDF Statistics Generator and Library , 2009, 2009 20th International Workshop on Database and Expert Systems Application.

[13]  Paolo Tomeo,et al.  Schema-summarization in linked-data-based feature selection for recommender systems , 2017, SAC.

[14]  Paolo Tomeo,et al.  A SPRank : Semantic Path-based Ranking for Top-N Recommendations using Linked Open Data , 2016 .

[15]  Tao Qin,et al.  Feature selection for ranking , 2007, SIGIR.

[16]  Steffen Staab,et al.  SchemEX - Efficient construction of a data catalogue by stream-based indexing of linked data , 2012, J. Web Semant..

[17]  Pasquale Lops,et al.  Content-based Recommender Systems: State of the Art and Trends , 2011, Recommender Systems Handbook.

[18]  Johannes Fürnkranz,et al.  Unsupervised generation of data mining features from linked open data , 2012, WIMS '12.

[19]  Alexandre Passant,et al.  dbrec - Music Recommendations Using DBpedia , 2010, SEMWEB.

[20]  Andrea Maurino,et al.  ABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization , 2016, SumPre@ESWC.

[21]  Pasquale Lops,et al.  Combining Distributional Semantics and Entity Linking for Context-Aware Content-Based Recommendation , 2014, UMAP.

[22]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[23]  Tommaso Di Noia Knowledge-enabled Recommender Systems: Models, Challenges, Solutions , 2017, KDWeb.

[24]  Iván Cantador,et al.  A generic semantic-based framework for cross-domain recommendation , 2011, HetRec '11.

[25]  Tommaso Di Noia,et al.  Linked Open Data-Enabled Recommender Systems: ESWC 2014 Challenge on Book Recommendation , 2014, SemWebEval@ESWC.

[26]  Axel-Cyrille Ngonga Ngomo,et al.  ROCKER: A Refinement Operator for Key Discovery , 2015, WWW.

[27]  Guy Shani,et al.  Evaluating Recommender Systems , 2015, Recommender Systems Handbook.

[28]  George Karypis,et al.  A Comprehensive Survey of Neighborhood-based Recommendation Methods , 2011, Recommender Systems Handbook.

[29]  Conor Hayes,et al.  Using Linked Data to Build Open, Collaborative Recommender Systems , 2010, AAAI Spring Symposium: Linked Data Meets Artificial Intelligence.

[30]  Asunción Gómez-Pérez,et al.  Loupe - An Online Tool for Inspecting Datasets in the Linked Data Cloud , 2015, SEMWEB.

[31]  Phuong Nguyen,et al.  An evaluation of SimRank and Personalized PageRank to build a recommender system for the Web of Data , 2015, WWW.

[32]  Tsvi Kuflik,et al.  Second workshop on information heterogeneity and fusion in recommender systems (HetRec2011) , 2011, RecSys '11.