On the Interplay between Social and Topical Structure

People's interests and people's social relationships are intuitively connected, but understanding their interplay and whether they can help predict each other has remained an open question. We examine the interface of two decisive structures forming the backbone of online social media: the graph structure of social networks - who connects with whom - and the set structure of topical affiliations - who is interested in what. In studying this interface, we identify key relationships whereby each of these structures can be understood in terms of the other. The context for our analysis is Twitter, a complex social network of both follower relationships and communication relationships. On Twitter, "hashtags" are used to label conversation topics, and we examine hashtag usage alongside these social structures. We find that the hashtags that users adopt can predict their social relationships, and also that the social relationships between the initial adopters of a hashtag can predict the future popularity of that hashtag. By studying weighted social relationships, we observe that while strong reciprocated ties are the easiest to predict from hashtag structure, they are also much less useful than weak directed ties for predicting hashtag popularity. Importantly, we show that computationally simple structural determinants can provide remarkable performance in both tasks. While our analyses focus on Twitter, we view our findings as broadly applicable to topical affiliations and social relationships in a host of diverse contexts, including the movies people watch, the brands people like, or the locations people frequent.

[1]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[2]  Mor Naaman,et al.  HT06, tagging paper, taxonomy, Flickr, academic article, to read , 2006, HYPERTEXT '06.

[3]  Rossano Schifanella,et al.  Folks in Folksonomies: social link prediction from shared metadata , 2010, WSDM '10.

[4]  Masahiro Kimura,et al.  Tractable Models for Information Diffusion in Social Networks , 2006, PKDD.

[5]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[6]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[7]  Jasmine Novak,et al.  Geographic routing in social networks , 2005, Proc. Natl. Acad. Sci. USA.

[8]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[9]  E. Rogers Diffusion of Innovations, Fourth Edition , 1982 .

[10]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[11]  Efthimis N. Efthimiadis,et al.  Conversational tagging in twitter , 2010, HT '10.

[12]  Ari Rappoport,et al.  What's in a hashtag?: content based prediction of the spread of ideas in microblogging communities , 2012, WSDM '12.

[13]  E. David,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World , 2010 .

[14]  Rui Li,et al.  Exploring social tagging graph for web object classification , 2009, KDD.

[15]  Francesco Bonchi,et al.  Cold start link prediction , 2010, KDD.

[16]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[17]  Hsin-Hsi Chen,et al.  Temporal Correlation between Social Tags and Emerging Long-Term Trend Detection , 2010, ICWSM.

[18]  Ciro Cattuto,et al.  Evaluating similarity measures for emergent semantics of social tagging , 2009, WWW '09.

[19]  Jon Kleinberg,et al.  Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter , 2011, WWW.

[20]  Ravi Kumar,et al.  Structure and evolution of online social networks , 2006, KDD '06.

[21]  Jacob Goldenberg,et al.  Using Complex Systems Analysis to Advance Marketing Theory Development , 2001 .

[22]  Jure Leskovec,et al.  Statistical properties of community structure in large social and information networks , 2008, WWW.

[23]  Tony Hammond,et al.  Social Bookmarking Tools (I): A General Overview , 2005, D Lib Mag..

[24]  Justin Cheng,et al.  Predicting Reciprocity in Social Networks , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[25]  Valentin Robu,et al.  The complex dynamics of collaborative tagging , 2007, WWW '07.

[26]  Eric Gilbert,et al.  A longitudinal study of follow predictors on twitter , 2013, CHI.

[27]  Jon M. Kleinberg,et al.  Small-World Phenomena and the Dynamics of Information , 2001, NIPS.

[28]  Qi He,et al.  What Do People Want in Microblogs? Measuring Interestingness of Hashtags in Twitter , 2010, 2010 IEEE International Conference on Data Mining.

[29]  Peter H. Reingen,et al.  Social Ties and Word-of-Mouth Referral Behavior , 1987 .

[30]  Long Jiang,et al.  User-level sentiment analysis incorporating social networks , 2011, KDD.

[31]  Michael J. Muller,et al.  Social tagging roles: publishers, evangelists, leaders , 2008, CHI.

[32]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[33]  Ben Taskar,et al.  Link Prediction in Relational Data , 2003, NIPS.

[34]  Giulio Rossetti,et al.  Scalable Link Prediction on Multidimensional Networks , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[35]  Jon M. Kleinberg,et al.  The Directed Closure Process in Hybrid Social-Information Networks, with an Analysis of Link Formation on Twitter , 2010, ICWSM.

[36]  Jon M. Kleinberg,et al.  The small-world phenomenon: an algorithmic perspective , 2000, STOC '00.

[37]  Hector Garcia-Molina,et al.  Clustering the tagged web , 2009, WSDM '09.

[38]  Matthew Richardson,et al.  Mining knowledge-sharing sites for viral marketing , 2002, KDD.