Early warning analysis for social diffusion events

There is considerable interest in developing predictive capabilities for social diffusion processes, for instance to permit early identification of emerging contentious situations, rapid detection of disease outbreaks, or accurate forecasting of the ultimate reach of potentially “viral” ideas or behaviors. This paper proposes a new approach to this predictive analytics problem, in which analysis of meso-scale network dynamics is leveraged to generate useful predictions for complex social phenomena. We begin by deriving a stochastic hybrid dynamical systems (S-HDS) model for diffusion processes taking place over social networks with realistic topologies; this modeling approach is inspired by recent work in biology demonstrating that S-HDS offer a useful mathematical formalism with which to represent complex, multi-scale biological network dynamics. We then perform formal stochastic reachability analysis with this S-HDS model and conclude that the outcomes of social diffusion processes may depend crucially upon the way the early dynamics of the process interacts with the underlying network’s community structure and core-periphery structure. This theoretical finding provides the foundations for developing a machine learning algorithm that enables accurate early warning analysis for social diffusion events. The utility of the warning algorithm, and the power of network-based predictive metrics, are demonstrated through an empirical investigation of the propagation of political “memes” over social media networks. Additionally, we illustrate the potential of the approach for security informatics applications through case studies involving early warning analysis of large-scale protests events and politically-motivated cyber attacks.

[1]  A. J. Hall Infectious diseases of humans: R. M. Anderson & R. M. May. Oxford etc.: Oxford University Press, 1991. viii + 757 pp. Price £50. ISBN 0-19-854599-1 , 1992 .

[2]  E. Rogers,et al.  Diffusion of Innovations , 1964 .

[3]  Lada A. Adamic,et al.  The political blogosphere and the 2004 U.S. election: divided they blog , 2005, LinkKDD '05.

[4]  Max Planck,et al.  Automatically identifying the sources of large Internet events , 2010, 2010 IEEE International Conference on Intelligence and Security Informatics.

[5]  N. Christakis,et al.  Social Network Sensors for Early Detection of Contagious Outbreaks , 2010, PloS one.

[6]  Richard Colbaugh,et al.  Early warning analysis for social diffusion events , 2010, ISI.

[7]  John Lygeros,et al.  Toward a General Theory of Stochastic Hybrid Systems , 2006 .

[8]  Richard Colbaugh,et al.  Predictability of 'Unpredictable' Cultural Markets , 2010 .

[9]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[10]  Richard Colbaugh,et al.  Predictive Analysis for Social Processes , 2009, AAAI Spring Symposium: Technosocial Predictive Analytics.

[11]  Matthew J. Salganik,et al.  Experimental Study of Inequality and Unpredictability in an Artificial Cultural Market , 2006, Science.

[12]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[13]  David M. Pennock,et al.  Predicting consumer behavior with Web search , 2010, Proceedings of the National Academy of Sciences.

[14]  Bernard P. Zeigler,et al.  Discrete Event Multi-level Models for Systems Biology , 2005, Trans. Comp. Sys. Biology.

[15]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[16]  Pamela Oliver,et al.  Working Paper and Technical Report Series the Opposing Forces Diffusion Model: the Initiation and Repression of Collective Violence , 2022 .

[17]  L. Bettencourt,et al.  The power of a good idea: Quantitative modeling of the spread of ideas from epidemiological models , 2005, physics/0502067.

[18]  John Lygeros,et al.  Stochastic hybrid delay population dynamics: Well-posed models and extinction , 2009, Journal of biological dynamics.

[19]  A. Krueger,et al.  Attitudes and Action: Public Opinion and the Occurrence of International Terrorism , 2009, Science.

[20]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[21]  E. Rogers,et al.  Diffusion of Innovations, 5th Edition , 2003 .

[22]  A. Moghadam,et al.  The Globalization of Martyrdom: Al Qaeda, Salafi Jihad, and the Diffusion of Suicide Attacks , 2008 .

[23]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Jon M. Kleinberg,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World [Book Review] , 2013, IEEE Technol. Soc. Mag..

[25]  W. D. Walls,et al.  Modeling Movie Success When ‘Nobody Knows Anything’: Conditional Stable-Distribution Analysis Of Film Returns , 2005 .

[26]  John C Doyle,et al.  Architecture, constraints, and behavior , 2011, Proceedings of the National Academy of Sciences.

[27]  Juli'an Candia,et al.  Mass media influence spreading in social networks with community structure , 2008, 0806.1762.

[28]  M. Bradley,et al.  Affective Normsfor English Words (ANEW): Stimuli, instruction manual and affective ratings (Tech Report C-1) , 1999 .

[29]  Tad Hogg,et al.  Using Stochastic Models to Describe and Predict Social Dynamics of Web Users , 2010, TIST.

[30]  Ganesh Ramakrishnan,et al.  Passage Scoring for Question Answering via Bayesian Inference on Lexical Relations , 2003, TREC.

[31]  E. David,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World , 2010 .

[32]  Grigoriy E. Pinchuk,et al.  Stochastic hybrid modeling of DNA replication across a complete genome , 2009 .

[33]  Richard Colbaugh,et al.  Predictive analysis for social processes I: Multi-scale hybrid system modeling , 2009, 2009 IEEE Control Applications, (CCA) & Intelligent Control, (ISIC).

[34]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[35]  Richard Colbaugh,et al.  Proactive defense for evolving cyber threats , 2011, Proceedings of 2011 IEEE International Conference on Intelligence and Security Informatics.

[36]  Sean P. O'Brien,et al.  Crisis Early Warning and Decision Support: Contemporary Approaches and Thoughts on Future Research , 2010 .

[37]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[38]  Kristin M. Bakke,et al.  The perils of policy by p-value: Predicting civil conflicts , 2010 .

[39]  Yuval Shavitt,et al.  A model of Internet topology using k-shell decomposition , 2007, Proceedings of the National Academy of Sciences.

[40]  Peter Hedström,et al.  Mesolevel Networks and the Diffusion of Social Movements: The Case of the Swedish Social Democratic Party1 , 2000, American Journal of Sociology.

[41]  Antonis Papachristodoulou,et al.  Advanced Methods and Algorithms for Biological Networks Analysis , 2006, Proceedings of the IEEE.

[42]  P. Parrilo Structured semidefinite programs and semialgebraic geometry methods in robustness and optimization , 2000 .

[43]  Ádám M. Halász,et al.  Stochastic Modeling and Control of Biological Systems: The Lactose Regulation System of Escherichia Coli , 2008, IEEE Transactions on Automatic Control.

[44]  H. Kushner Stochastic Stability and Control , 2012 .

[45]  Richard Colbaugh,et al.  Detecting emerging topics and trends via predictive analysis of ‘meme’ dynamics , 2011, Proceedings of 2011 IEEE International Conference on Intelligence and Security Informatics.

[46]  M. Bradley,et al.  Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings , 1999 .

[47]  Richard Colbaugh,et al.  Web Analytics for Security Informatics , 2011, 2011 European Intelligence and Security Informatics Conference.