Learning with Heterogeneous Side Information Fusion for Recommender Systems

Collaborative filtering (CF) based methods have become the most popular technique for recommender systems (RSs). In recent years, various types of side information such as social connections among users and metadata of items have been introduced into CF and shown to be effective for improving recommendation performance. Moreover, side information can alleviate data sparsity and cold start problems facing conventional CF based methods. However, previous works process different types of information separately, thus losing information that might exist across different types of side information. In this work, we study the application of Heterogeneous Information Network (HIN), which offers flexible representation of different types of side information, to enhance CF based recommendation methods. Since HIN could be a complex graph representing multiple types of relations between entity types, we need to tackle two challenging issues facing HIN-based RSs: How to capture the complex semantics that determines the similarities between users and items in a HIN, and how to fuse the heterogeneous side information to support recommendation. To address these issues, we apply metagraph to HIN-based RSs and solve the information fusion problem with a "matrix factorization (MF) + factorization machine (FM)" framework. For the MF part, we obtain the user-item similarity matrix from each metagraph and then apply low-rank matrix approximation to obtain latent features for both users and items. For the FM part, we apply FM with Group lasso (FMG) on the features obtained from the MF part to train the recommending model and at the same time identify the usefulness of the metagraphs. Experimental results on two large real-world datasets, i.e., Amazon and Yelp, show that our proposed approach is better than FM and other state-of-the-art HIN-based recommendation methods.

[1]  Yizhou Sun,et al.  Mining heterogeneous information networks: a structural analysis approach , 2013, SKDD.

[2]  Tie-Yan Liu,et al.  Large-Scale Low-Rank Matrix Learning with Nonconvex Regularizers , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Xiang Li,et al.  Meta Structure: Computing Relevance in Large Heterogeneous Information Networks , 2016, KDD.

[4]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[5]  Mao Ye,et al.  Exploiting geographical influence for collaborative point-of-interest recommendation , 2011, SIGIR.

[6]  Brian D. Davison,et al.  Co-factorization machines: modeling user interests and predicting individual decisions in Twitter , 2013, WSDM.

[7]  Huazheng Wang,et al.  Factorization Bandits for Interactive Recommendation , 2017, AAAI.

[8]  Qiang Yang,et al.  Transfer learning in heterogeneous collaborative filtering domains , 2013, Artif. Intell..

[9]  Philip S. Yu,et al.  HeteSim: A General Framework for Relevance Measure in Heterogeneous Networks , 2013, IEEE Transactions on Knowledge and Data Engineering.

[10]  James T. Kwok,et al.  Collaborative Filtering with Social Local Models , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[11]  Majid Sarrafzadeh,et al.  HeteroMed: Heterogeneous Information Network for Medical Diagnosis , 2018, CIKM.

[12]  Julian J. McAuley,et al.  Ups and Downs: Modeling the Visual Evolution of Fashion Trends with One-Class Collaborative Filtering , 2016, WWW.

[13]  Quanquan Gu,et al.  Collaborative Filtering with Entity Similarity Regularization in Heterogeneous Information Networks , 2013 .

[14]  Mengting Wan,et al.  Recommendation Through Mixtures of Heterogeneous Item Relationships , 2018, CIKM.

[15]  Chao Liu,et al.  Recommender systems with social regularization , 2011, WSDM '11.

[16]  Philip S. Yu,et al.  Integrating meta-path selection with user-guided object clustering in heterogeneous information networks , 2012, KDD.

[17]  Alexander J. Smola,et al.  Stochastic Variance Reduction for Nonconvex Optimization , 2016, ICML.

[18]  Steffen Rendle,et al.  Factorization Machines with libFM , 2012, TIST.

[19]  Dan Roth,et al.  Incorporating World Knowledge to Document Clustering via Heterogeneous Information Networks , 2015, KDD.

[20]  Jean-Philippe Vert,et al.  Group lasso with overlap and graph lasso , 2009, ICML '09.

[21]  Stephen P. Boyd,et al.  Enhancing Sparsity by Reweighted ℓ1 Minimization , 2007, 0711.1612.

[22]  Yanfang Ye,et al.  Gotcha - Sly Malware!: Scorpion A Metagraph2vec Based Malware Detection System , 2018, KDD.

[23]  Wu-Jun Li,et al.  Coupled Group Lasso for Web-Scale CTR Prediction in Display Advertising , 2014, ICML.

[24]  Heng-Tze Cheng,et al.  Wide & Deep Learning for Recommender Systems , 2016, DLRS@RecSys.

[25]  Jiawei Han,et al.  KnowSim: A Document Similarity Measure on Structured Heterogeneous Information Networks , 2015, 2015 IEEE International Conference on Data Mining.

[26]  Arkadiusz Paterek,et al.  Improving regularized singular value decomposition for collaborative filtering , 2007 .

[27]  Jure Leskovec,et al.  Hidden factors and hidden topics: understanding rating dimensions with review text , 2013, RecSys.

[28]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[29]  Lin Xiao,et al.  A Proximal Stochastic Gradient Method with Progressive Variance Reduction , 2014, SIAM J. Optim..

[30]  Philip S. Yu,et al.  PathSelClus: Integrating Meta-Path Selection with User-Guided Object Clustering in Heterogeneous Information Networks , 2013, TKDD.

[31]  Minyi Guo,et al.  Exploring High-Order User Preference on the Knowledge Graph for Recommender Systems , 2019, ACM Trans. Inf. Syst..

[32]  Ni Lao,et al.  Relational retrieval using a combination of path-constrained random walks , 2010, Machine Learning.

[33]  Emmanuel J. Candès,et al.  Exact Matrix Completion via Convex Optimization , 2008, Found. Comput. Math..

[34]  Yi Ma,et al.  Robust principal component analysis? , 2009, JACM.

[35]  James T. Kwok,et al.  Efficient Learning with a Family of Nonconvex Regularizers by Redistributing Nonconvexity , 2016, ICML.

[36]  Yizhou Sun,et al.  Semi-supervised Learning over Heterogeneous Information Networks by Ensemble of Meta-graph Guided Random Walks , 2017, IJCAI.

[37]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[38]  Joshua C. Denny,et al.  Chapter 13: Mining Electronic Health Records in the Genomics Era , 2012, PLoS Comput. Biol..

[39]  Yanfang Ye,et al.  HinDroid: An Intelligent Android Malware Detection System Based on Structured Heterogeneous Information Network , 2017, KDD.

[40]  Yizhou Sun,et al.  Personalized entity recommendation: a heterogeneous information network approach , 2014, WSDM.

[41]  Xin Li,et al.  Automatic Opioid User Detection from Twitter: Transductive Ensemble Built on Different Meta-graph Based Similarities over Heterogeneous Information Network , 2018, IJCAI.

[42]  Emmanuel J. Candès,et al.  Exact Matrix Completion via Convex Optimization , 2009, Found. Comput. Math..

[43]  Philip S. Yu,et al.  Meta-path based multi-network collective link prediction , 2014, KDD.

[44]  Lars Schmidt-Thieme,et al.  Pairwise interaction tensor factorization for personalized tag recommendation , 2010, WSDM '10.

[45]  Jieping Ye,et al.  Efficient Methods for Overlapping Group Lasso , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Dik Lun Lee,et al.  Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba , 2018, KDD.

[47]  Philip S. Yu,et al.  Semantic Path based Personalized Recommendation on Weighted Heterogeneous Information Networks , 2015, CIKM.

[48]  Petr Sojka,et al.  Software Framework for Topic Modelling with Large Corpora , 2010 .

[49]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[50]  Yizhou Sun,et al.  Distant Meta-Path Similarities for Text-Based Heterogeneous Information Networks , 2017, CIKM.

[51]  Leon Wenliang Zhong,et al.  Fast Low-Rank Matrix Learning with Nonconvex Regularization , 2015, 2015 IEEE International Conference on Data Mining.

[52]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[53]  Heterogeneous Information Networks , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[54]  Weike Pan,et al.  A survey of transfer learning for collaborative recommendation with auxiliary data , 2016, Neurocomputing.

[55]  Qiang Yang,et al.  Transfer learning for collaborative filtering via a rating-matrix generative model , 2009, ICML '09.

[56]  Kevin Chen-Chuan Chang,et al.  Semantic proximity search on graphs with metagraph-based learning , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[57]  Qiang Yang,et al.  A unified framework of active transfer learning for cross-system recommendation , 2017, Artif. Intell..

[58]  Philip S. Yu,et al.  Inferring anchor links across multiple heterogeneous social networks , 2013, CIKM.

[59]  Isabelle Guyon,et al.  Taking Human out of Learning Applications: A Survey on Automated Machine Learning , 2018, 1810.13306.

[60]  John Riedl,et al.  An Algorithmic Framework for Performing Collaborative Filtering , 1999, SIGIR Forum.

[61]  Philip S. Yu,et al.  Heterogeneous Information Network Embedding for Recommendation , 2017, IEEE Transactions on Knowledge and Data Engineering.

[62]  Mohamed Abdel-Mottaleb,et al.  Discriminant correlation analysis for feature level fusion with application to multimodal biometrics , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[63]  Neil Yorke-Smith,et al.  LibRec: A Java Library for Recommender Systems , 2015, UMAP Workshops.

[64]  Qiang Yang,et al.  Beyond Personalization: Social Content Recommendation for Creator Equality and Consumer Satisfaction , 2019, KDD.

[65]  Philip S. Yu,et al.  Multi-label classification by mining label and instance correlations from heterogeneous information networks , 2013, KDD.

[66]  Michael R. Lyu,et al.  Ratings meet reviews, a combined approach to recommend , 2014, RecSys '14.

[67]  Philip S. Yu,et al.  PathSim , 2011, Proc. VLDB Endow..

[68]  Nitesh V. Chawla,et al.  Heterogeneous Graph Neural Network , 2019, KDD.

[69]  Stephen P. Boyd,et al.  Proximal Algorithms , 2013, Found. Trends Optim..

[70]  Huan Li,et al.  Accelerated Proximal Gradient Methods for Nonconvex Programming , 2015, NIPS.

[71]  Philip S. Yu,et al.  Aspect-Level Deep Collaborative Filtering via Heterogeneous Information Networks , 2018, IJCAI.

[72]  Tong Zhang,et al.  Analysis of Multi-stage Convex Relaxation for Sparse Regularization , 2010, J. Mach. Learn. Res..

[73]  Saeed Shiry Ghidary,et al.  Usage-based web recommendations: a reinforcement learning approach , 2007, RecSys '07.

[74]  Philip S. Yu,et al.  Leveraging Meta-path based Context for Top- N Recommendation with A Neural Co-Attention Model , 2018, KDD.

[75]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[76]  Nitesh V. Chawla,et al.  metapath2vec: Scalable Representation Learning for Heterogeneous Networks , 2017, KDD.

[77]  James T. Kwok,et al.  Accelerated Inexact Soft-Impute for Fast Large-Scale Matrix Completion , 2015, IJCAI.

[78]  Yuan Qi,et al.  Cash-Out User Detection Based on Attributed Heterogeneous Information Network with a Hierarchical Attention Mechanism , 2019, AAAI.

[79]  Charu C. Aggarwal,et al.  When will it happen?: relationship prediction in heterogeneous information networks , 2012, WSDM '12.

[80]  Tie-Yan Liu,et al.  Efficient Inexact Proximal Gradient Algorithm for Nonconvex Problems , 2016, IJCAI.

[81]  Zeyuan Allen Zhu,et al.  Variance Reduction for Faster Non-Convex Optimization , 2016, ICML.

[82]  Xing Xie,et al.  Towards mobile intelligence: Learning from GPS history data for collaborative recommendation , 2012, Artif. Intell..

[83]  Dik Lun Lee,et al.  Meta-Graph Based Recommendation Fusion over Heterogeneous Information Networks , 2017, KDD.

[84]  Philip S. Yu,et al.  A Survey of Heterogeneous Information Network Analysis , 2015, IEEE Transactions on Knowledge and Data Engineering.

[85]  Yoav Shoham,et al.  Fab: content-based, collaborative recommendation , 1997, CACM.