Using proximity to predict activity in social networks

The structure of a social network contains information useful for predicting its evolution. We show that structural information also helps predict activity. People who are "close" in some sense in a social network are more likely to perform similar actions than more distant people. We use network proximity to capture the degree to which people are "close" to each other. In addition to standard proximity metrics used in the link prediction task, such as neighborhood overlap, we introduce new metrics that model different types of interactions that take place between people. We study this claim empirically using data about URL forwarding activity on the social media sites Digg and Twitter. We show that structural proximity of two users in the follower graph is related to similarity of their activity, i.e., how many URLs they both forward. We also show that given friends' activity, knowing their proximity to the user can help better predict which URLs the user will forward. We compare the performance of different proximity metrics on the activity prediction task and find that metrics that take into account the attention-limited nature of interactions in social media lead to substantially better predictions.

[1]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[2]  Stephen Chadwick,et al.  The Deep South , 2012 .

[3]  Kristina Lerman,et al.  Entropy-based Classification of 'Retweeting' Activity on Twitter , 2011, ArXiv.

[4]  Krishna P. Gummadi,et al.  A measurement-driven analysis of information propagation in the flickr social network , 2009, WWW '09.

[5]  Fang Wu,et al.  Novelty and collective attention , 2007, Proceedings of the National Academy of Sciences.

[6]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[7]  Duncan J. Watts,et al.  Who says what to whom on twitter , 2011, WWW.

[8]  Kristina Lerman,et al.  What Stops Social Epidemics? , 2011, ICWSM.

[9]  Munmun De Choudhury,et al.  "Birds of a Feather": Does User Homophily Impact Information Diffusion in Social Media? , 2010, ArXiv.

[10]  Ravi Kumar,et al.  Influence and correlation in social networks , 2008, KDD.

[11]  Kristina Lerman,et al.  Social Networks and Social Information Filtering on Digg , 2006, ICWSM.

[12]  Cosma Rohilla Shalizi,et al.  Homophily and Contagion Are Generically Confounded in Observational Social Network Studies , 2010, Sociological methods & research.

[13]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[14]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[15]  Philip S. Yu,et al.  Proximity Tracking on Time-Evolving Bipartite Graphs , 2008, SDM.

[16]  Christos Faloutsos,et al.  Fast direction-aware proximity for graph mining , 2007, KDD '07.

[17]  C. Steglich,et al.  DYNAMIC NETWORKS AND BEHAVIOR: SEPARATING SELECTION FROM INFLUENCE: separating selection from influence , 2010 .

[18]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[19]  Tad Hogg,et al.  Stochastic Models of User-Contributory Web Sites , 2009, ICWSM.

[20]  Kristina Lerman,et al.  Non-Conservative Diffusion and its Application to Social Network Analysis , 2011, ArXiv.

[21]  Yehuda Koren,et al.  Measuring and extracting proximity graphs in networks , 2007, TKDD.

[22]  Kristina Lerman,et al.  Social Information Processing in Social News Aggregation , 2007, ArXiv.

[23]  Kristina Lerman,et al.  Social Browsing on Flickr , 2006, ICWSM.

[24]  Kristina Lerman,et al.  A probabilistic approach for learning folksonomies from structured data , 2011, WSDM '11.

[25]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[26]  Dylan Walker,et al.  Creating Social Contagion Through Viral Product Design: A Randomized Trial of Peer Influence in Networks , 2010, ICIS.

[27]  Jon Kleinberg,et al.  Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter , 2011, WWW.

[28]  D. Boyd,et al.  The Arab Spring| The Revolutions Were Tweeted: Information Flows during the 2011 Tunisian and Egyptian Revolutions , 2011 .

[29]  Kristina Lerman,et al.  Social Information Processing in News Aggregation , 2007, IEEE Internet Computing.

[30]  Kristina Lerman,et al.  Information Contagion: An Empirical Study of the Spread of News on Digg and Twitter Social Networks , 2010, ICWSM.

[31]  L. Freeman Finding Social Groups: A Meta-Analysis of the Southern Women Data , 2003 .

[32]  Arun Sundararajan,et al.  Distinguishing influence-based contagion from homophily-driven diffusion in dynamic networks , 2009, Proceedings of the National Academy of Sciences.

[33]  A-L Barabási,et al.  Structure and tie strengths in mobile communication networks , 2006, Proceedings of the National Academy of Sciences.