Predicting Links and Inferring Attributes using a Social-Attribute Network (SAN)

The eects of social inuence and homophily suggest that both network structure and node attribute information should inform the tasks of link prediction and node attribute inference. Recently, Yin et al. [28, 29] proposed Social-Attribute Network (SAN), an attribute-augmented social network, to integrate network structure and node attributes to perform both link prediction and attribute inference. They focused on generalizing the random walk with restart algorithm to the SAN framework and showed improved performance. In this paper, we extend the SAN framework with several leading supervised and unsupervised link prediction algorithms and demonstrate performance improvement for each algorithm on both link prediction and attribute inference. Moreover, we make the novel observation that attribute inference can help inform link prediction, i.e., link prediction accuracy is further improved by rst inferring missing attributes. We comprehensively evaluate these algorithms and compare them with other existing algorithms using a novel, largescale Google+ dataset, which we make publicly available 1 .

[1]  Pang-Ning Tan,et al.  A Matrix Alignment Approach for Collective Classification , 2009, 2009 International Conference on Advances in Social Network Analysis and Mining.

[2]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[3]  Jun Yu,et al.  Learning Algorithms for Link Prediction Based on Chance Constraints , 2010, ECML/PKDD.

[4]  Jure Leskovec,et al.  Supervised random walks: predicting and recommending links in social networks , 2010, WSDM '11.

[5]  Thomas Hofmann,et al.  Stochastic Relational Models for Discriminative Link Prediction , 2007 .

[6]  David Yarowsky,et al.  Hierarchical Bayesian Models for Latent Attribute Detection in Social Media , 2011, ICWSM.

[7]  Nitesh V. Chawla,et al.  New perspectives and methods in link prediction , 2010, KDD.

[8]  Lise Getoor,et al.  To join or not to join: the illusion of privacy in social networks with mixed public and private user profiles , 2009, WWW '09.

[9]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[10]  Jianquan Liu,et al.  Link prediction: the power of maximal entropy random walk , 2011, CIKM '11.

[11]  Alexander J. Smola,et al.  Like like alike: joint friendship and interest propagation in social networks , 2011, WWW.

[12]  Jure Leskovec,et al.  Modeling Social Networks with Node Attributes using the Multiplicative Attribute Graph Model , 2011, UAI.

[13]  Jiawei Han,et al.  A Unified Framework for Link Recommendation Using Random Walks , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[14]  Charles Elkan,et al.  Link Prediction via Matrix Factorization , 2011, ECML/PKDD.

[15]  David Yarowsky,et al.  Typed graph models for semi-supervised learning of name ethnicity , 2011, ACL 2011.

[16]  Lise Getoor,et al.  Combining Collective Classification and Link Prediction , 2007 .

[17]  Mohammad Al Hasan,et al.  Link prediction using supervised learning , 2006 .

[18]  Christos Faloutsos,et al.  Realistic, Mathematically Tractable Graph Generation and Evolution, Using Kronecker Multiplication , 2005, PKDD.

[19]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[20]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[21]  Ben Taskar,et al.  Link Prediction in Relational Data , 2003, NIPS.

[22]  Gueorgi Kossinets,et al.  Empirical Analysis of an Evolving Social Network , 2006, Science.

[23]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[24]  Christos Faloutsos,et al.  Automatic multimedia cross-modal correlation discovery , 2004, KDD.

[25]  Paul Resnick,et al.  Recommender systems , 1997, CACM.

[26]  Christos Faloutsos,et al.  Graphs over time: densification laws, shrinking diameters and possible explanations , 2005, KDD '05.

[27]  Jiawei Han,et al.  LINKREC: a unified framework for link recommendation with user attributes and graph structure , 2010, WWW '10.

[28]  Marc Najork,et al.  Computing Information Retrieval Performance Measures Efficiently in the Presence of Tied Scores , 2008, ECIR.

[29]  Panagiotis Symeonidis,et al.  Transitive node similarity for link prediction in social networks with positive and negative links , 2010, RecSys '10.

[30]  Ravi Kumar,et al.  Structure and evolution of blogspace , 2004, CACM.

[31]  Gueorgi Kossinets Effects of missing data in social networks , 2006, Soc. Networks.

[32]  Jennifer Neville,et al.  Randomization tests for distinguishing social influence and homophily effects , 2010, WWW '10.

[33]  Thomas L. Griffiths,et al.  Nonparametric Latent Feature Models for Link Prediction , 2009, NIPS.

[34]  David Yarowsky,et al.  Classifying latent user attributes in twitter , 2010, SMUC '10.

[35]  Jure Leskovec,et al.  Microscopic evolution of social networks , 2008, KDD.

[36]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[37]  Jure Leskovec,et al.  The Network Completion Problem: Inferring Missing Nodes and Edges in Networks , 2011, SDM.

[38]  David J. Hand,et al.  A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems , 2001, Machine Learning.