Dynamic inference of social roles in information cascades

Nodes in complex networks inherently represent different kinds of functional or organizational roles. In the dynamic process of an information cascade, users play different roles in spreading the information: some act as seeds to initiate the process, some limit the propagation and others are in-between. Understanding the roles of users is crucial in modeling the cascades. Previous research mainly focuses on modeling users behavior based upon the dynamic exchange of information with neighbors. We argue however that the structural patterns in the neighborhood of nodes may already contain enough information to infer users’ roles, independently from the information flow in itself. To approach this possibility, we examine how network characteristics of users affect their actions in the cascade. We also advocate that temporal information is very important. With this in mind, we propose an unsupervised methodology based on ensemble clustering to classify users into their social roles in a network, using not only their current topological positions, but also considering their history over time. Our experiments on two social networks, Flickr and Digg, show that topological metrics indeed possess discriminatory power and that different structural patterns correspond to different parts in the process. We observe that user commitment in the neighborhood affects considerably the influence score of users. In addition, we discover that the cohesion of neighborhood is important in the blocking behavior of users. With this we can construct topological fingerprints that can help us in identifying social roles, based solely on structural social ties, and independently from nodes activity and how information flows.

[1]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[2]  Philip S. Yu,et al.  Identifying the influential bloggers in a community , 2008, WSDM '08.

[3]  Ryan A. Rossi,et al.  Role-dynamics: fast mining of large dynamic networks , 2012, WWW.

[4]  Fernando M. A. Silva,et al.  Event detection in evolving networks , 2012, 2012 Fourth International Conference on Computational Aspects of Social Networks (CASoN).

[5]  Krishna P. Gummadi,et al.  A measurement-driven analysis of information propagation in the flickr social network , 2009, WWW '09.

[6]  Santo Fortunato,et al.  Consensus clustering in complex networks , 2012, Scientific Reports.

[7]  Divesh Srivastava,et al.  Forward Decay: A Practical Time Decay Model for Streaming Systems , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[8]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[9]  Philip S. Yu,et al.  Mining concept-drifting data streams using ensemble classifiers , 2003, KDD '03.

[10]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[11]  Daniel M. Romero,et al.  Influence and passivity in social media , 2010, ECML/PKDD.

[12]  Shashi Shekhar,et al.  Multilevel hypergraph partitioning: application in VLSI domain , 1997, DAC.

[13]  Kristina Lerman,et al.  What Stops Social Epidemics? , 2011, ICWSM.

[14]  Jiawei Han,et al.  Entity Role Discovery in Hierarchical Topical Communities , 2013 .

[15]  E. David,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World , 2010 .

[16]  Ben Taskar,et al.  Discriminative Probabilistic Models for Relational Data , 2002, UAI.

[17]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[18]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[19]  Tina Eliassi-Rad,et al.  Leveraging Label-Independent Features for Classification in Sparsely Labeled Networks: An Empirical Study , 2008, SNAKDD.

[20]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[21]  Mark S. Granovetter Economic Action and Social Structure: The Problem of Embeddedness , 1985, American Journal of Sociology.

[22]  Kristina Lerman,et al.  Predicting Influential Users in Online Social Networks , 2010, ArXiv.

[23]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[24]  Aristides Gionis,et al.  Clustering Aggregation , 2005, ICDE.

[25]  Mudhakar Srivatsa,et al.  Microscopic Social Influence , 2012, SDM.

[26]  Chris Arney,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World (Easley, D. and Kleinberg, J.; 2010) [Book Review] , 2013, IEEE Technology and Society Magazine.

[27]  Jure Leskovec,et al.  Information diffusion and external influence in networks , 2012, KDD.

[28]  J. Kleinberg,et al.  Networks, Crowds, and Markets , 2010 .

[29]  Lada A. Adamic,et al.  How to search a social network , 2005, Soc. Networks.

[30]  Philip S. Yu,et al.  Inferring social roles and statuses in social networks , 2013, KDD.

[31]  Krishna P. Gummadi,et al.  Delayed information cascades in Flickr: Measurement, analysis, and modeling , 2012, Comput. Networks.

[32]  Danai Koutra,et al.  RolX: structural role extraction & mining in large graphs , 2012, KDD.

[33]  Masahiro Kimura,et al.  Prediction of Information Diffusion Probabilities for Independent Cascade Model , 2008, KES.

[34]  Jon M. Kleinberg,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World [Book Review] , 2013, IEEE Technol. Soc. Mag..

[35]  Duncan J. Watts,et al.  Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[36]  Kristina Lerman,et al.  Social Contagion: An Empirical Study of Information Spread on Digg and Twitter Follower Graphs , 2012, ArXiv.

[37]  Phillip Bonacich,et al.  Some unique properties of eigenvector centrality , 2007, Soc. Networks.

[38]  Esteban Moro,et al.  Impact of human activity patterns on the dynamics of information diffusion. , 2009, Physical review letters.

[39]  Jure Leskovec,et al.  The role of social networks in online shopping: information passing, price of trust, and consumer choice , 2011, EC '11.

[40]  Laks V. S. Lakshmanan,et al.  Learning influence probabilities in social networks , 2010, WSDM '10.

[41]  Gueorgi Kossinets,et al.  Empirical Analysis of an Evolving Social Network , 2006, Science.

[42]  Luciano da Fontoura Costa,et al.  Beyond the average: Detecting global singular nodes from local features in complex networks , 2006, 1003.3084.

[43]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[44]  Ana L. N. Fred,et al.  Analysis of consensus partition in cluster ensemble , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[45]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[46]  Haewoon Kwak,et al.  Finding influentials based on the temporal order of information adoption in twitter , 2010, WWW '10.

[47]  S. C. Johnson Hierarchical clustering schemes , 1967, Psychometrika.

[48]  Kristina Lerman,et al.  Rethinking Centrality: The Role of Dynamical Processes in Social Network Analysis , 2012, ArXiv.

[49]  Jimeng Sun,et al.  Social influence analysis in large-scale networks , 2009, KDD.

[50]  Ling Liu,et al.  Social influence based clustering of heterogeneous information networks , 2013, KDD.

[51]  Fernando M. A. Silva,et al.  Network Node Label Acquisition and Tracking , 2011, EPIA.

[52]  Ana L. N. Fred,et al.  Data clustering using evidence accumulation , 2002, Object recognition supported by user interaction for service robots.