Modeling Information Diffusion in Implicit Networks

Social media forms a central domain for the production and dissemination of real-time information. Even though such flows of information have traditionally been thought of as diffusion processes over social networks, the underlying phenomena are the result of a complex web of interactions among numerous participants. Here we develop the Linear Influence Model where rather than requiring the knowledge of the social network and then modeling the diffusion by predicting which node will influence which other nodes in the network, we focus on modeling the global influence of a node on the rate of diffusion through the (implicit) network. We model the number of newly infected nodes as a function of which other nodes got infected in the past. For each node we estimate an influence function that quantifies how many subsequent infections can be attributed to the influence of that node over time. A nonparametric formulation of the model leads to a simple least squares problem that can be solved on large datasets. We validate our model on a set of 500 million tweets and a set of 170 million news articles and blog posts. We show that the Linear Influence Model accurately models influences of nodes and reliably predicts the temporal dynamics of information diffusion. We find that patterns of influence of individual participants differ significantly depending on the type of the node and the topic of the information.

[1]  Christos Faloutsos,et al.  Modeling Blog Dynamics , 2009, ICWSM.

[2]  Yun Chi,et al.  Information flow modeling based on diffusion rate for prediction and ranking , 2007, WWW '07.

[3]  E. Rogers,et al.  Diffusion of innovations , 1964, Encyclopedia of Sport Management.

[4]  Krishna P. Gummadi,et al.  A measurement-driven analysis of information propagation in the flickr social network , 2009, WWW '09.

[5]  Christos Faloutsos,et al.  Cascading Behavior in Large Blog Graphs , 2007 .

[6]  Peter H. Reingen,et al.  Social Ties and Word-of-Mouth Referral Behavior , 1987 .

[7]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[8]  Thomas J. Johnson,et al.  Wag the Blog: How Reliance on Traditional Media and the Internet Influence Credibility Perceptions of Weblogs Among Blog Users , 2004 .

[9]  Noah E. Friedkin,et al.  A Structural Theory of Social Influence: List of Tables and Figures , 1998 .

[10]  P. Lazarsfeld,et al.  Personal Influence: The Part Played by People in the Flow of Mass Communications , 1956 .

[11]  P. Lazarsfeld,et al.  6. Katz, E. Personal Influence: The Part Played by People in the Flow of Mass Communications , 1956 .

[12]  Jure Leskovec,et al.  Patterns of temporal variation in online media , 2011, WSDM '11.

[13]  Jon M. Kleinberg,et al.  Tracing information flow on a global scale using Internet chain-letter data , 2008, Proceedings of the National Academy of Sciences.

[14]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[15]  D. Watts,et al.  Influentials, Networks, and Public Opinion Formation , 2007 .

[16]  P. Lazarsfeld,et al.  Personal Influence: The Part Played by People in the Flow of Mass Communications , 1956 .

[17]  Laks V. S. Lakshmanan,et al.  Learning influence probabilities in social networks , 2010, WSDM '10.

[18]  Noah E. Friedkin,et al.  A Structural Theory of Social Influence: List of Tables and Figures , 1998 .

[19]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[20]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[21]  P. Lazarsfeld,et al.  Personal Influence: The Part Played by People in the Flow of Mass Communications , 1956 .

[22]  Charles L. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[23]  Mark S. Granovetter Threshold Models of Collective Behavior , 1978, American Journal of Sociology.

[24]  Jayson Harsin,et al.  The Rumour Bomb: Theorising the Convergence of New and Old Trends in Mediated US Politics , 2006 .

[25]  Chris Volinsky,et al.  Network-Based Marketing: Identifying Likely Adopters Via Consumer Networks , 2006, math/0606278.

[26]  Alexander Grey,et al.  The Mathematical Theory of Infectious Diseases and Its Applications , 1977 .

[27]  Lada A. Adamic,et al.  Tracking information epidemics in blogspace , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).

[28]  B. Golub,et al.  Naive Learning in Social Networks: Convergence, Influence and Wisdom of Crowds , 2007 .

[29]  Kathy E. Gill How can we measure the influence of the blogosphere? , 2004 .

[30]  Noah E. Friedkin A Structural Theory of Social Influence: Measures of the Theoretical Constructs , 1998 .

[31]  Thomas F. Coleman,et al.  A Reflective Newton Method for Minimizing a Quadratic Function Subject to Bounds on Some of the Variables , 1992, SIAM J. Optim..

[32]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[33]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[34]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[35]  Fang Wu,et al.  Novelty and collective attention , 2007, Proceedings of the National Academy of Sciences.

[36]  Tom Rosenstiel,et al.  Warp Speed: America in the Age of Mixed Media , 1999 .

[37]  Jacob Goldenberg,et al.  Talk of the Network: A Complex Systems Look at the Underlying Process of Word-of-Mouth , 2001 .

[38]  Duncan J Watts,et al.  A simple model of global cascades on random networks , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[39]  Keri K. Stephens,et al.  Rogers' diffusion of innovations , 2008 .

[40]  N. Ling The Mathematical Theory of Infectious Diseases and its applications , 1978 .