MC2G: An Efficient Algorithm for Matrix Completion with Social and Item Similarity Graphs

We consider a discrete-valued matrix completion problem for recommender systems in which both the social and item similarity graphs are available as side information. We develop and analyze MC2G (Matrix Completion with 2 Graphs), a quasilinear-time algorithm which is based on spectral clustering and local refinement steps. We show that the sample complexity of MC2G meets an information-theoretic limit that is derived using maximum likelihood estimation and is also order-optimal. We demonstrate that having both graphs as side information outperforms having just a single graph, thus the availability of two graphs results in a synergistic effect. Experiments on synthetic datasets corroborate our theoretical results. Finally, experiments on a sub-sampled version of the Netflix dataset show that MC2G significantly outperforms other state-of-the-art matrix completion algorithms that leverage graph side information.

[1]  Martin Ester,et al.  Using a trust network to improve top-N recommendation , 2009, RecSys '09.

[2]  Maurice Tchuente,et al.  A general graph-based framework for top-N recommendation using content, temporal and trust information , 2019, ArXiv.

[3]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[4]  P. Li,et al.  Rate optimal Chernoff bound and application to community detection in the stochastic block models , 2020 .

[5]  Paolo Avesani,et al.  Trust-aware recommender systems , 2007, RecSys '07.

[6]  Christos Boutsidis,et al.  Spectral Clustering via the Power Method - Provably , 2013, ICML.

[7]  Xavier Bresson,et al.  Matrix Completion on Graphs , 2014, NIPS 2014.

[8]  Michael R. Lyu,et al.  SoRec: social recommendation using probabilistic matrix factorization , 2008, CIKM '08.

[9]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[10]  Aria Nosratinia,et al.  Recovering a Single Community With Side Information , 2018, IEEE Transactions on Information Theory.

[11]  Anup Rao,et al.  Stochastic Block Model and Community Detection in Sparse Graphs: A spectral algorithm with optimal rate of recovery , 2015, COLT.

[12]  Anderson Y. Zhang,et al.  Minimax Rates of Community Detection in Stochastic Block Models , 2015, ArXiv.

[13]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[14]  Paolo Avesani,et al.  Controversial Users Demand Local Trust Metrics: An Experimental Study on Epinions.com Community , 2005, AAAI.

[15]  James T. Kwok,et al.  Collaborative Filtering with Social Local Models , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[16]  Aria Nosratinia,et al.  Exact Recovery in Community Detection With Continuous-Valued Side Information , 2019, IEEE Signal Processing Letters.

[17]  Nathan Srebro,et al.  Fast maximum margin matrix factorization for collaborative prediction , 2005, ICML.

[18]  Aria Nosratinia,et al.  Community Detection With Side Information: Exact Recovery Under the Stochastic Block Model , 2018, IEEE Journal of Selected Topics in Signal Processing.

[19]  Arya Mazumdar,et al.  Query Complexity of Clustering with Side Information , 2017, NIPS.

[20]  Martin Ester,et al.  TrustWalker: a random walk model for combining trust-based and item-based recommendation , 2009, KDD.

[21]  Xiaojun Wu,et al.  Graph Regularized Nonnegative Matrix Factorization for Data Representation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[23]  Emmanuel Abbe,et al.  Exact Recovery in the Stochastic Block Model , 2014, IEEE Transactions on Information Theory.

[24]  Martin Ester,et al.  A matrix factorization technique with trust propagation for recommendation in social networks , 2010, RecSys '10.

[25]  J. Golbeck,et al.  FilmTrust: movie recommendations using trust in web-based social networks , 2006, CCNC 2006. 2006 3rd IEEE Consumer Communications and Networking Conference, 2006..

[26]  Emmanuel J. Candès,et al.  Exact Matrix Completion via Convex Optimization , 2008, Found. Comput. Math..

[27]  Emmanuel Abbe,et al.  Community detection and stochastic block models: recent developments , 2017, Found. Trends Commun. Inf. Theory.

[28]  A. Rinaldo,et al.  Consistency of spectral clustering in stochastic block models , 2013, 1312.2050.

[29]  Kibeom Lee,et al.  Escaping your comfort zone: A graph-based recommender system for finding novel recommendations among relevant items , 2015, Expert Syst. Appl..

[30]  Mayuram S. Krishnan,et al.  The Personalization Privacy Paradox: An Empirical Evaluation of Information Transparency and the Willingness to be Profiled Online for Personalization , 2006, MIS Q..

[31]  Kangwook Lee,et al.  Discrete-valued Preference Estimation with Graph Side Information , 2020, ArXiv.

[32]  Xavier Bresson,et al.  Geometric Matrix Completion with Recurrent Multi-Graph Neural Networks , 2017, NIPS.

[33]  Adel Javanmard,et al.  Phase transitions in semidefinite relaxations , 2015, Proceedings of the National Academy of Sciences.

[34]  Neil Yorke-Smith,et al.  LibRec: A Java Library for Recommender Systems , 2015, UMAP Workshops.

[35]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[36]  Jennifer Golbeck,et al.  Computing and Applying Trust in Web-based Social Networks , 2005 .

[37]  Yang Guo,et al.  On top-k recommendation using social networks , 2012, RecSys.

[38]  Wu-Jun Li,et al.  Relation regularized matrix factorization , 2009, IJCAI 2009.

[39]  Michael R. Lyu,et al.  Learning to recommend with social trust ensemble , 2009, SIGIR.

[40]  Pradeep Ravikumar,et al.  Collaborative Filtering with Graph Information: Consistency and Scalable Methods , 2015, NIPS.

[41]  Emmanuel Abbe,et al.  Community Detection in General Stochastic Block models: Fundamental Limits and Efficient Algorithms for Recovery , 2015, 2015 IEEE 56th Annual Symposium on Foundations of Computer Science.

[42]  Neil Yorke-Smith,et al.  TrustSVD: Collaborative Filtering with Both the Explicit and Implicit Influence of User Trust and of Item Ratings , 2015, AAAI.

[43]  Chao Liu,et al.  Recommender systems with social regularization , 2011, WSDM '11.

[44]  James Bennett,et al.  The Netflix Prize , 2007 .

[45]  Hong Joo Lee,et al.  Use of social network information to enhance collaborative filtering performance , 2010, Expert Syst. Appl..

[46]  Mária Bieliková,et al.  Movie Recommendation Based on Graph Traversal Algorithms , 2013, 2013 24th International Workshop on Database and Expert Systems Applications.

[47]  Deepak Agarwal,et al.  fLDA: matrix factorization through latent dirichlet allocation , 2010, WSDM '10.

[48]  Ramnath K. Chellappa,et al.  Personalization versus Privacy: An Empirical Examination of the Online Consumer’s Dilemma , 2005, Inf. Technol. Manag..

[49]  Elchanan Mossel,et al.  Density Evolution in the Degree-correlated Stochastic Block Model , 2015, COLT.

[50]  Inderjit S. Dhillon,et al.  Matrix Completion with Noisy Side Information , 2015, NIPS.

[51]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[52]  Kris Vanhecke,et al.  Privacy Aspects of Recommender Systems , 2015, Recommender Systems Handbook.

[53]  R. Srikant,et al.  Jointly clustering rows and columns of binary matrices: algorithms and trade-offs , 2013, SIGMETRICS '14.

[54]  Huan Liu,et al.  Social recommendation: a review , 2013, Social Network Analysis and Mining.

[55]  Varun Jog,et al.  Information-theoretic bounds for exact recovery in weighted stochastic block models using the Renyi divergence , 2015, ArXiv.

[56]  Philip S. Yu,et al.  Horting hatches an egg: a new graph-theoretic approach to collaborative filtering , 1999, KDD '99.

[57]  Elchanan Mossel,et al.  Spectral redemption in clustering sparse networks , 2013, Proceedings of the National Academy of Sciences.

[58]  Paolo Avesani,et al.  Trust-Aware Collaborative Filtering for Recommender Systems , 2004, CoopIS/DOA/ODBASE.

[59]  Kangwook Lee,et al.  Binary Rating Estimation with Graph Side Information , 2018, NeurIPS.

[60]  Elchanan Mossel,et al.  Reconstruction and estimation in the planted partition model , 2012, Probability Theory and Related Fields.

[61]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[62]  Nguyen Duy Phuong,et al.  A Graph-Based Method for Combining Collaborative and Content-Based Filtering , 2008, PRICAI.

[63]  Bruce Hajek,et al.  Information limits for recovering a hidden community , 2015, 2016 IEEE International Symposium on Information Theory (ISIT).

[64]  Kangwook Lee,et al.  On the Joint Recovery of Community Structure and Community Features , 2018, 2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[65]  Galen Reeves,et al.  Mutual Information in Community Detection with Covariate Information and Correlated Networks , 2019, 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[66]  Yang Guo,et al.  Bayesian-Inference-Based Recommendation in Online Social Networks , 2011, IEEE Transactions on Parallel and Distributed Systems.

[67]  Ruslan Salakhutdinov,et al.  Bayesian probabilistic matrix factorization using Markov chain Monte Carlo , 2008, ICML '08.

[68]  Ting Li,et al.  Willing to pay for quality personalization? Trade-off between quality and privacy , 2012, Eur. J. Inf. Syst..

[69]  Mohammad Ali Abbasi,et al.  Trust-Aware Recommender Systems , 2014 .

[70]  Chao Gao,et al.  Achieving Optimal Misclassification Proportion in Stochastic Block Models , 2015, J. Mach. Learn. Res..

[71]  Hsinchun Chen,et al.  A graph model for E-commerce recommender systems , 2004, J. Assoc. Inf. Sci. Technol..

[72]  J. Wolfowitz Coding Theorems of Information Theory , 1962, Ergebnisse der Mathematik und Ihrer Grenzgebiete.

[73]  Hsinchun Chen,et al.  A graph-based recommender system for digital library , 2002, JCDL '02.

[74]  Aaron Clauset,et al.  Adapting the Stochastic Block Model to Edge-Weighted Networks , 2013, ArXiv.