Scalable Affiliation Recommendation using Auxiliary Networks

Social network analysis has attracted increasing attention in recent years. In many social networks, besides friendship links among users, the phenomenon of users associating themselves with groups or communities is common. Thus, two networks exist simultaneously: the friendship network among users, and the affiliation network between users and groups. In this article, we tackle the affiliation recommendation problem, where the task is to predict or suggest new affiliations between users and communities, given the current state of the friendship and affiliation networks. More generally, affiliations need not be community affiliations---they can be a user’s taste, so affiliation recommendation algorithms have applications beyond community recommendation. In this article, we show that information from the friendship network can indeed be fruitfully exploited in making affiliation recommendations. Using a simple way of combining these networks, we suggest two models of user-community affinity for the purpose of making affiliation recommendations: one based on graph proximity, and another using latent factors to model users and communities. We explore the affiliation recommendation algorithms suggested by these models and evaluate these algorithms on two real-world networks, Orkut and Youtube. In doing so, we motivate and propose a way of evaluating recommenders, by measuring how good the top 50 recommendations are for the average user, and demonstrate the importance of choosing the right evaluation strategy. The algorithms suggested by the graph proximity model turn out to be the most effective. We also introduce scalable versions of these algorithms, and demonstrate their effectiveness. This use of link prediction techniques for the purpose of affiliation recommendation is, to our knowledge, novel.

[1]  Geoffrey J. Gordon,et al.  Relational learning via collective matrix factorization , 2008, KDD.

[2]  Nagarajan Natarajan,et al.  Affiliation recommendation using auxiliary networks , 2010, RecSys '10.

[3]  Berkant Savas,et al.  Fast and accurate low rank approximation of massive graphs , 2010 .

[4]  Munmun De Choudhury,et al.  Connecting content to community in social media via image content, user tags and user communication , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[5]  R. Larsen Lanczos Bidiagonalization With Partial Reorthogonalization , 1998 .

[6]  Jon M. Kleinberg,et al.  The link-prediction problem for social networks , 2007, J. Assoc. Inf. Sci. Technol..

[7]  Jon Kleinberg,et al.  The link prediction problem for social networks , 2003, CIKM '03.

[8]  Edward Y. Chang,et al.  Combinational collaborative filtering for personalized community recommendation , 2008, KDD.

[9]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[10]  Wei Chen,et al.  Efficient influence maximization in social networks , 2009, KDD.

[11]  Vipin Kumar,et al.  A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[12]  Brett W. Bader,et al.  Enhancing Multilingual Latent Semantic Analysis with Term Alignment Information , 2008, COLING.

[13]  Edward Y. Chang,et al.  Collaborative filtering for orkut communities: discovery of user latent behavior , 2009, WWW '09.

[14]  Lise Getoor,et al.  Co-evolution of social and affiliation networks , 2009, KDD.

[15]  Chao Yang,et al.  ARPACK users' guide - solution of large-scale eigenvalue problems with implicitly restarted Arnoldi methods , 1998, Software, environments, tools.

[16]  Jon M. Kleinberg,et al.  Group formation in large social networks: membership, growth, and evolution , 2006, KDD '06.

[17]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[18]  Inderjit S. Dhillon,et al.  Clustered low rank approximation of graphs in information science applications , 2011, SDM.

[19]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[20]  Wei Tang,et al.  Clustering with Multiple Graphs , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[21]  Inderjit S. Dhillon,et al.  Weighted Graph Cuts without Eigenvectors A Multilevel Approach , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Krishna P. Gummadi,et al.  Measurement and analysis of online social networks , 2007, IMC '07.