Learning Asynchronous-Time Information Diffusion Models and its Application to Behavioral Data Analysis over Social Networks

One of the interesting and important problems of information diffusion over a large social network is to identify an appropriate model from a limited amount of diffusion information. There are two contrasting approaches to model information diffusion: a push type model known as Independent Cascade (IC) model and a pull type model known as Linear Threshold (LT) model. We extend these two models (called AsIC and AsLT in this paper) to incorporate asynchronous time delay and investigate 1) how they differ from or similar to each other in terms of information diffusion, 2) whether the model itself is learnable or not from the observed information diffusion data, and 3) which model is more appropriate to explain for a particular topic (information) to diffuse/propagate. We first show there can be variations with respect to how the time delay is modeled, and derive the likelihood of the observed data being generated for each model. Using one particular time delay model, we show the model parameters are learnable from a limited amount of observation. We then propose a method based on predictive accuracy by which to select a model which better explains the observed data. Extensive evaluations were performed. We first show using synthetic data with the network structures taken from real networks that there are considerable behavioral differences between the AsIC and the AsLT models, the proposed methods accurately and stably learn the model parameters, and identify the correct diffusion model from a limited amount of observation data. We next apply these methods to behavioral analysis of topic propagation using the real blog propagation data, and show there is a clear indication as to which topic better follows which model although the results are rather insensitive to the model selected at the level of discussing how far and fast each topic propagates from the learned parameter values.

[1]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[2]  Pedro M. Domingos Mining Social Networks for Viral Marketing , 2022 .

[3]  Masahiro Kimura,et al.  Extracting influential nodes on a social network for information diffusion , 2009, Data Mining and Knowledge Discovery.

[4]  Lada A. Adamic,et al.  Social influence and the diffusion of user-created content , 2009, EC '09.

[5]  Lada A. Adamic,et al.  Tracking information epidemics in blogspace , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).

[6]  Pawan Kumar,et al.  Notice of Violation of IEEE Publication Principles The Anatomy of a Large-Scale Hyper Textual Web Search Engine , 2009 .

[7]  Albert-László Barabási,et al.  Error and attack tolerance of complex networks , 2000, Nature.

[8]  Masahiro Kimura,et al.  Learning information diffusion model in a social network for predicting influence of nodes , 2011, Intell. Data Anal..

[9]  Wei Chen,et al.  Efficient influence maximization in social networks , 2009, KDD.

[10]  Masahiro Kimura,et al.  Prediction of Information Diffusion Probabilities for Independent Cascade Model , 2008, KES.

[11]  John Scott What is social network analysis , 2010 .

[12]  SaitoKazumi,et al.  Blocking links to minimize contamination spread in a social network , 2009 .

[13]  Aristides Gionis,et al.  Sparsification of influence networks , 2011, KDD.

[14]  Stephanie Forrest,et al.  Email networks and the spread of computer viruses. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  Yiming Yang,et al.  The Enron Corpus: A New Dataset for Email Classi(cid:12)cation Research , 2004 .

[16]  Duncan J. Watts,et al.  Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[17]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[18]  Jure Leskovec,et al.  Inferring networks of diffusion and influence , 2010, KDD.

[19]  Duncan J Watts,et al.  A simple model of global cascades on random networks , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Masahiro Kimura,et al.  Learning Continuous-Time Information Diffusion Model for Social Behavioral Data Analysis , 2009, ACML.

[21]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[22]  Tom A. B. Snijders,et al.  Social Network Analysis , 2011, International Encyclopedia of Statistical Science.

[23]  Masahiro Kimura,et al.  Generative Models of Information Diffusion with Asynchronous Timedelay , 2010, ACML.

[24]  Laks V. S. Lakshmanan,et al.  Learning influence probabilities in social networks , 2010, WSDM '10.

[25]  Jon Kleinberg,et al.  Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter , 2011, WWW.

[26]  Michael I. Jordan,et al.  Link Analysis, Eigenvectors and Stability , 2001, IJCAI.

[27]  H. Akaike A new look at the Bayes procedure , 1978 .

[28]  Masahiro Kimura,et al.  Extracting Influential Nodes for Information Diffusion on a Social Network , 2007, AAAI.

[29]  Jacob Goldenberg,et al.  Talk of the Network: A Complex Systems Look at the Underlying Process of Word-of-Mouth , 2001 .

[30]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[31]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[32]  Masahiro Kimura,et al.  Behavioral Analyses of Information Diffusion Models by Observed Data of Social Network , 2010, SBP.

[33]  Masahiro Kimura,et al.  Finding Influential Nodes in a Social Network from Information Diffusion Data , 2009 .

[34]  Masahiro Kimura,et al.  Selecting Information Diffusion Models over Social Networks for Behavioral Analysis , 2010, ECML/PKDD.

[35]  Masahiro Kimura,et al.  Learning Diffusion Probability Based on Node Attributes in Social Networks , 2011, ISMIS.

[36]  Jure Leskovec,et al.  On the Convexity of Latent Social Network Inference , 2010, NIPS.

[37]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[38]  D. Watts,et al.  Influentials, Networks, and Public Opinion Formation , 2007 .