Knowledge graph summarization impacts on movie recommendations

A classical problem that frequently compromises Recommender System (RS) accuracy is the sparsity of the data about the interactions of the users with the items to be recommended. The use of side information (e.g. movie domain information) from a Knowledge Graph (KG) has proven effective to circumvent this problem. However, KG growth in terms of size and complexity gives rise to many challenges, including the demand for high-cost algorithms to handle large amounts of partially irrelevant and noisy data. Meanwhile, though Graph Summarization (GS) has become popular to support tasks such as KG visualization and search, it is still relatively unexplored in the KG-based RS domain. In this work, we investigate the potential of GS as a preprocessing step to condense side information in a KG and consequently reduce computational costs of using this information. We propose a GS method that combines embedding based on latent semantics (ComplEx) with nodes clustering (K-Means) in single-view and multi-view approaches for KG summarization, i.e. which act on the whole KG at once or on a separated KG view at a time, respectively. Then, we evaluate the impacts of these alternative GS approaches on several state-of-the-art KG-based RSs, in experiments using the MovieLens 1M dataset and side information gathered from IMDb and DBpedia. Our experimental results show that KG summarization can speed up the recommendation process without significant changes in movie recommendation quality, which vary in accordance with the GS approach, the summarization ratio, and the recommendation method.

[1]  Danai Koutra,et al.  Graph Summarization Methods and Applications , 2016, ACM Comput. Surv..

[2]  Iulia Paun,et al.  Efficiency-Effectiveness Trade-offs in Recommendation Systems , 2020, RecSys.

[3]  Marco Fiorucci,et al.  Separating Structure from Noise in Large Graphs Using the Regularity Lemma , 2019, Pattern Recognit..

[4]  Lili Sahakyan,et al.  Remembering to Forget , 2010, Psychological science.

[5]  Alessandro Bozzon,et al.  Recurrent knowledge graph embedding for effective recommendation , 2018, RecSys.

[6]  Minyi Guo,et al.  RippleNet: Propagating User Preferences on the Knowledge Graph for Recommender Systems , 2018, CIKM.

[7]  Francisco Herrera,et al.  Prototype Selection for Nearest Neighbor Classification: Taxonomy and Empirical Study , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Thar Baker,et al.  Analysis of Dimensionality Reduction Techniques on Big Data , 2020, IEEE Access.

[9]  David C. Wilson,et al.  Categorizing Case-Base Maintenance: Dimensions and Directions , 1998, EWCBR.

[10]  Alberto D. Pascual-Montano,et al.  A survey of dimensionality reduction techniques , 2014, ArXiv.

[11]  Feras Al-Obeidat,et al.  Topic and sentiment aware microblog summarization for twitter , 2018, Journal of Intelligent Information Systems.

[12]  Chuan Qin,et al.  A Survey on Knowledge Graph-Based Recommender Systems , 2020 .

[13]  Yixin Cao,et al.  Unifying Knowledge Graph Learning and Recommendation: Towards a Better Understanding of User Preferences , 2019, WWW.

[14]  Hong Yu,et al.  Web Items Recommendation Based on Multi-View Clustering , 2018, 2018 IEEE 42nd Annual Computer Software and Applications Conference (COMPSAC).

[15]  Tony R. Martinez,et al.  Instance Pruning Techniques , 1997, ICML.

[16]  Eric O. Postma,et al.  Dimensionality Reduction: A Comparative Review , 2008 .

[17]  Igor Jurisica,et al.  Maintaining Case-Based Reasoning Systems: A Machine Learning Approach , 2004, ECCBR.

[18]  Neil Yorke-Smith,et al.  Leveraging multiviews of trust and similarity to enhance clustering-based recommender systems , 2015, Knowl. Based Syst..

[19]  Nicholas Jing Yuan,et al.  Collaborative Knowledge Base Embedding for Recommender Systems , 2016, KDD.

[20]  Qingming Huang,et al.  GOMES: A group-aware multi-view fusion approach towards real-world image clustering , 2015, 2015 IEEE International Conference on Multimedia and Expo (ICME).

[21]  Yizhou Sun,et al.  Recommendation in heterogeneous information networks with implicit user feedback , 2013, RecSys.

[22]  F. Maxwell Harper,et al.  The MovieLens Datasets: History and Context , 2016, TIIS.

[23]  Zhendong Mao,et al.  Knowledge Graph Embedding: A Survey of Approaches and Applications , 2017, IEEE Transactions on Knowledge and Data Engineering.

[24]  Padraig Cunningham,et al.  k-Nearest Neighbour Classifiers - A Tutorial , 2020, ACM Comput. Surv..

[25]  Jignesh M. Patel,et al.  Discovery-driven graph summarization , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[26]  Xuelong Li,et al.  Multi-View Clustering and Semi-Supervised Classification with Adaptive Neighbours , 2017, AAAI.

[27]  Renato Fileto,et al.  Hybrid Recommender System Based on Multi-Hierarchical Ontologies , 2018, WebMedia.

[28]  Ji Zhang,et al.  A tourism destination recommender system using users’ sentiment and temporal dynamics , 2018, Journal of Intelligent Information Systems.

[29]  Zhiyuan Liu,et al.  Learning Entity and Relation Embeddings for Knowledge Graph Completion , 2015, AAAI.

[30]  Gong Cheng,et al.  DeepLENS: Deep Learning for Entity Summarization , 2020, DL4KG@ESWC.

[31]  Hao Wang,et al.  Multi-view clustering: A survey , 2018, Big Data Min. Anal..

[32]  Marcelo G. Manzato,et al.  Case recommender: a flexible and extensible python framework for recommender systems , 2018, RecSys.

[33]  François Goasdoué,et al.  Summarizing semantic graphs: a survey , 2018, The VLDB Journal.

[34]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[35]  Hong Yu,et al.  Tag recommendation method in folksonomy based on user tagging status , 2017, Journal of Intelligent Information Systems.

[36]  Paolo Tomeo,et al.  Schema-summarization in linked-data-based feature selection for recommender systems , 2017, SAC.

[37]  Xu Chen,et al.  Learning over Knowledge-Base Embeddings for Recommendation , 2018, Algorithms.

[38]  Alejandro Bellogín,et al.  Exploiting recommendation confidence in decision-aware recommender systems , 2018, Journal of Intelligent Information Systems.

[39]  Dimitrios Gunopulos,et al.  Non-linear dimensionality reduction techniques for classification and visualization , 2002, KDD.

[40]  John G. Breslin,et al.  Transfer Learning for Item Recommendations and Knowledge Graph Completion in Item Related Domains via a Co-Factorization Model , 2018, ESWC.

[41]  Muhammad Kashif Hanif,et al.  Overview and comparative study of dimensionality reduction techniques for high dimensional data , 2020, Inf. Fusion.

[42]  Charu C. Aggarwal,et al.  Recommender Systems , 2016, Springer International Publishing.

[43]  Jianfeng Gao,et al.  Embedding Entities and Relations for Learning and Inference in Knowledge Bases , 2014, ICLR.

[44]  Barry Smyth,et al.  Case-Base Maintenance , 1998, IEA/AIE.

[45]  Mariano P. Consens,et al.  Linked Movie Data Base , 2009, LDOW.

[46]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[47]  Jason Weston,et al.  Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[48]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[49]  Steffen Bickel,et al.  Multi-view clustering , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[50]  Zhen Wang,et al.  Knowledge Graph Embedding by Translating on Hyperplanes , 2014, AAAI.

[51]  Chhavi Rana,et al.  Social recommender systems: techniques, domains, metrics, datasets and future scope , 2019, Journal of Intelligent Information Systems.

[52]  Yixin Cao,et al.  KGAT: Knowledge Graph Attention Network for Recommendation , 2019, KDD.

[53]  Guillaume Bouchard,et al.  Complex Embeddings for Simple Link Prediction , 2016, ICML.

[54]  Dennis L. Wilson,et al.  Asymptotic Properties of Nearest Neighbor Rules Using Edited Data , 1972, IEEE Trans. Syst. Man Cybern..

[55]  Miquel Sànchez-Marrè,et al.  Reputation-Based Maintenance in Case-Based Reasoning , 2020, Knowl. Based Syst..

[56]  Marcin Sydow,et al.  The notion of diversity in graphical entity summarisation on semantic knowledge graphs , 2013, Journal of Intelligent Information Systems.

[57]  Michel Verleysen,et al.  Recent methods for dimensionality reduction: A brief comparative analysis , 2014, ESANN.

[58]  Miguel Ángel Rodríguez-García,et al.  BlindDate recommender: A context-aware ontology-based dating recommendation platform , 2018, J. Inf. Sci..

[59]  Liang Chen,et al.  Trust-aware media recommendation in heterogeneous social networks , 2013, World Wide Web.

[60]  Roberto Willrich,et al.  Using Implicit Feedback for Neighbors Selection: Alleviating the Sparsity Problem in Collaborative Recommendation Systems , 2017, WebMedia.

[61]  Zahid Halim,et al.  Multi-view document clustering via ensemble method , 2014, Journal of Intelligent Information Systems.