Applicability of Multilayer Diffusion Network Inference to Social Media Data

Information on social media spreads through an underlying diffusion network that connects people of common interests and opinions. This diffusion network often comprises multiple layers, each capturing the spreading dynamics of a certain type of information characterized by, for example, topic, attitude, or language. Researchers have previously proposed methods to infer these underlying multilayer diffusion networks from observed spreading patterns, but little is known about how well these methods perform across the range of realistic spreading data. In this paper, we first introduce an effective implementation of the inference method that can achieve higher accuracy than existing implementations in comparable runtime. Then, we conduct an extensive series of synthetic data experiments to systematically analyze the performance of the method, under varied network structure (e.g. density, number of layers) and information diffusion settings (e.g. cascade size, layer mixing) that are designed to mimic real-world spreading on social media. Our findings include that the inference accuracy varies extremely with network density, and that the method fails to decompose the diffusion network correctly when most cascades in the data reach a limited audience. In demonstrating the conditions under which the inference accuracy is extremely low, our paper highlights the need to carefully evaluate the applicability of the method before running the inference on real data. Practically, our results serve as a reference for this evaluation, and our publicly available implementation supports further testing under personalized settings.

[1]  Jennifer M. Larson,et al.  #polisci Twitter: A Descriptive Analysis of how Political Scientists Use Twitter in 2019 , 2020, Perspectives on Politics.

[2]  Zhoujun Li,et al.  Diabetes-Associated Factors as Predictors of Nursing Home Admission and Costs in the Elderly Across Europe. , 2017, Journal of the American Medical Directors Association.

[3]  Thomas Josef Liniger,et al.  Multivariate Hawkes processes , 2009 .

[4]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[5]  Leysia Palen,et al.  (How) will the revolution be retweeted?: information diffusion and the 2011 Egyptian uprising , 2012, CSCW.

[6]  Bernhard Schölkopf,et al.  Uncovering the Temporal Dynamics of Diffusion Networks , 2011, ICML.

[7]  Yan Liu,et al.  Not Enough Data?: Joint Inferring Multiple Diffusion Networks via Network Generation Priors , 2017, WSDM.

[8]  Yan Xia,et al.  Spread of tweets in climate discussions: A case study of the 2019 Nobel Peace Prize announcement , 2020, Nordic Journal of Media Studies.

[9]  Mason A. Porter,et al.  Multilayer networks , 2013, J. Complex Networks.

[10]  C. Spearman The proof and measurement of association between two things. , 2015, International journal of epidemiology.

[11]  Hong Cheng,et al.  A Model-Free Approach to Infer the Diffusion Network from Event Cascade , 2016, CIKM.

[12]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[13]  Ming-Syan Chen,et al.  Uncovering Multiple Diffusion Networks Using the First-Hand Sharing Pattern , 2016, SDM.

[14]  Cécile Favre,et al.  Information diffusion in online social networks: a survey , 2013, SGMD.

[15]  James R. Foulds,et al.  HawkesTopic: A Joint Model for Network Inference and Topic Modeling from Text-Based Cascades , 2015, ICML.

[16]  Jure Leskovec,et al.  Inferring networks of diffusion and influence , 2010, KDD.

[17]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[18]  D. Gillespie Exact Stochastic Simulation of Coupled Chemical Reactions , 1977 .

[19]  M. Newman Spread of epidemic disease on networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  Jiejun Xu,et al.  Rolling through tumblr: characterizing behavioral patterns of the microblogging platform , 2014, WebSci '14.

[21]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[22]  Sinan Aral,et al.  The spread of true and false news online , 2018, Science.

[23]  Jianxin Li,et al.  Inferring Multiplex Diffusion Network via Multivariate Marked Hawkes Process , 2018, ArXiv.

[24]  M. Newman,et al.  Random graphs with arbitrary degree distributions and their applications. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  Le Song,et al.  Uncover Topic-Sensitive Information Diffusion Networks , 2013, AISTATS.

[26]  Bernhard Schölkopf,et al.  Structure and dynamics of information pathways in online media , 2012, WSDM.

[27]  Ming-Syan Chen,et al.  Cluster cascades: Infer multiple underlying networks using diffusion data , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[28]  Jure Leskovec,et al.  On the Convexity of Latent Social Network Inference , 2010, NIPS.

[29]  Le Song,et al.  Learning Networks of Heterogeneous Influence , 2012, NIPS.

[30]  Justin Cheng,et al.  Rumor Cascades , 2014, ICWSM.