Tumblr Blog Recommendation with Boosted Inductive Matrix Completion

Popular microblogging sites such as Tumblr have attracted hundreds of millions of users as a content sharing platform, where users can create rich content in the form of posts that are shared with other users who follow them. Due to the sheer amount of posts created on such services, an important task is to make quality recommendations of blogs for users to follow. Apart from traditional recommender system settings where the follower graph is the main data source, additional side-information of users and blogs such as user activity (e.g., like and reblog) and rich content (e.g., text and images) are also available to be exploited for enhanced recommendation performance. In this paper, we propose a novel boosted inductive matrix completion method (BIMC) for blog recommendation. BIMC is an additive low-rank model for user-blog preferences consisting of two components; one component captures the low-rank structure of follow relationships and the other captures the latent structure using side-information. Our model formulation combines the power of the recently proposed inductive matrix completion (IMC) model (for side-information) together with a standard matrix completion (MC) model (for low-rank structure). Furthermore, we utilize recently developed deep learning techniques to obtain semantically rich feature representations of text and images that are incorporated in BIMC. Experiments on a large-scale real-world dataset from Tumblr illustrate the effectiveness of the proposed BIMC method.

[1]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Martha Larson,et al.  Collaborative Filtering beyond the User-Item Matrix , 2014, ACM Comput. Surv..

[3]  Jimmy J. Lin,et al.  WTF: the who to follow service at Twitter , 2013, WWW.

[4]  Yan Liu,et al.  What is Tumblr: a statistical overview and comparison , 2014, SKDD.

[5]  Analía Amandi,et al.  Followee recommendation based on text analysis of micro-blogging activity , 2013, Inf. Syst..

[6]  Emmanuel J. Candès,et al.  Exact Matrix Completion via Convex Optimization , 2008, Found. Comput. Math..

[7]  Roberto Turrin,et al.  Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.

[8]  Geoffrey J. Gordon,et al.  Relational learning via collective matrix factorization , 2008, KDD.

[9]  Benjamin Schrauwen,et al.  Deep content-based music recommendation , 2013, NIPS.

[10]  Inderjit S. Dhillon,et al.  Large-scale Multi-label Learning with Missing Labels , 2013, ICML.

[11]  Yudong Chen,et al.  Coherent Matrix Completion , 2013, ICML.

[12]  Christophe Diot,et al.  Finding a needle in a haystack of reviews: cold start context-based hotel recommender system , 2012, RecSys.

[13]  Guillaume Bouchard,et al.  Convex Collective Matrix Factorization , 2013, AISTATS.

[14]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[15]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[16]  Brian D. Davison,et al.  Structural link analysis and prediction in microblogs , 2011, CIKM '11.

[17]  Younghoon Kim,et al.  TWITOBI: A Recommendation System for Twitter Using Probabilistic Modeling , 2011, 2011 IEEE 11th International Conference on Data Mining.

[18]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[19]  Nagarajan Natarajan,et al.  Which app will you use next?: collaborative filtering with interactional context , 2013, RecSys.

[20]  Yehuda Koren,et al.  Lessons from the Netflix prize challenge , 2007, SKDD.

[21]  Omar U. Florez Deep Learning of Semantic Word Representations to Implement a Content-Based Recommender for the RecSys Challenge'14 , 2014, SemWebEval@ESWC.

[22]  Chao Liu,et al.  Recommender systems with social regularization , 2011, WSDM '11.

[23]  Joemon M. Jose,et al.  Handling data sparsity in collaborative filtering using emotion and semantic based features , 2011, SIGIR.

[24]  Inderjit S. Dhillon,et al.  Multi-scale link prediction , 2012, CIKM '12.

[25]  Martin Ester,et al.  A matrix factorization technique with trust propagation for recommendation in social networks , 2010, RecSys '10.

[26]  Wu-Jun Li,et al.  TagiCoFi: tag informed collaborative filtering , 2009, RecSys '09.

[27]  Mong-Li Lee,et al.  Community-based user recommendation in uni-directional social networks , 2013, CIKM.

[28]  Jason Weston,et al.  #TagSpace: Semantic Embeddings from Hashtags , 2014, EMNLP.

[29]  Makoto Yamada,et al.  Consistent Collective Matrix Completion under Joint Low Rank Structure , 2014, AISTATS.

[30]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[31]  Brian D. Davison,et al.  Co-factorization machines: modeling user interests and predicting individual decisions in Twitter , 2013, WSDM.

[32]  John Hannon,et al.  Recommending twitter users to follow using content and collaborative filtering approaches , 2010, RecSys '10.

[33]  Inderjit S. Dhillon,et al.  Provable Inductive Matrix Completion , 2013, ArXiv.

[34]  Andrea Montanari,et al.  Matrix Completion from Noisy Entries , 2009, J. Mach. Learn. Res..

[35]  Nagarajan Natarajan,et al.  Scalable Affiliation Recommendation using Auxiliary Networks , 2011, TIST.

[36]  Deepak Agarwal,et al.  Regression-based latent factor models , 2009, KDD.

[37]  Prateek Jain,et al.  Low-rank matrix completion using alternating minimization , 2012, STOC '13.

[38]  Ioannis Konstas,et al.  On social networks and collaborative recommendation , 2009, SIGIR.