Challenges to Estimating Contagion Effects from Observational Data

A growing body of literature attempts to learn about contagion using observational (i.e., non-experimental) data collected from a single social network. While the conclusions of these studies may be correct, the methods rely on assumptions that are likely—and sometimes guaranteed to be—false, and therefore the evidence for the conclusions is often weaker than it seems. Developing methods that do not need to rely on implausible assumptions is an incredibly challenging and important open problem in statistics. Appropriate methods don’t (yet!) exist, so researchers hoping to learn about contagion from observational social network data are sometimes faced with a dilemma: they can abandon their research program, or they can use inappropriate methods. This chapter will focus on the challenges and the open problems and will not weigh in on that dilemma, except to mention here that the most responsible way to use any statistical method, especially when it is well-known that the assumptions on which it rests do not hold, is with a healthy dose of skepticism, with honest acknowledgment and deep understanding of the limitations, and with copious caveats about how to interpret the results.

[1]  Michael G Hudgens,et al.  Causal Inference for Vaccine Effects on Infectiousness , 2012, The international journal of biostatistics.

[2]  Arun Sundararajan,et al.  Distinguishing influence-based contagion from homophily-driven diffusion in dynamic networks , 2009, Proceedings of the National Academy of Sciences.

[3]  N. Christakis,et al.  Social Network Sensors for Early Detection of Contagious Outbreaks , 2010, PloS one.

[4]  S. Lauritzen,et al.  Chain graph models and their causal interpretations , 2002 .

[5]  M. Halloran,et al.  Causal Inference in Infectious Diseases , 1995, Epidemiology.

[6]  Frank Goetzke,et al.  Network Effects in Public Transit Use: Evidence from a Spatially Autoregressive Mode Choice Model for New York , 2008 .

[7]  Zengo Furukawa,et al.  A General Framework for , 1991 .

[8]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[9]  Tom A. B. Snijders,et al.  Introduction to stochastic actor-based models for network dynamics , 2010, Soc. Networks.

[10]  J. Murabito,et al.  The Spread of Alcohol Consumption Behavior in a Large Social Network , 2010, Annals of Internal Medicine.

[11]  Jake Bowers,et al.  Reasoning about Interference Between Units: A General Framework , 2013, Political Analysis.

[12]  Brendan Nyhan,et al.  The "unfriending" problem: The consequences of homophily in friendship retention for causal estimates of social influence , 2010, Soc. Networks.

[13]  Cosma Rohilla Shalizi,et al.  Homophily and Contagion Are Generically Confounded in Observational Social Network Studies , 2010, Sociological methods & research.

[14]  N. Christakis,et al.  The Spread of Obesity in a Large Social Network Over 32 Years , 2007, The New England journal of medicine.

[15]  C. Manski Identification of Endogenous Social Effects: The Reflection Problem , 1993 .

[16]  Jason M. Fletcher,et al.  Is Obesity Contagious? Social Networks vs. Environmental Factors in the Obesity Epidemic , 2008, Journal of Health Economics.

[17]  D. Rubin [On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9.] Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies , 1990 .

[18]  A. Thomas The social contagion hypothesis: comment on ‘Social contagion theory: examining dynamic social networks and human behavior’ , 2012, Statistics in medicine.

[19]  Changhui Kang Classroom peer effects and academic achievement: Quasi-randomization evidence from South Korea , 2007 .

[20]  N. Christakis,et al.  Alone in the Crowd: The Structure and Spread of Loneliness in a Large Social Network , 2009 .

[21]  Russell Lyons,et al.  The Spread of Evidence-Poor Medicine via Flawed Social-Network Analysis , 2010, 1007.2876.

[22]  T. VanderWeele Sensitivity Analysis for Contagion Effects in Social Networks , 2011, Sociological methods & research.

[23]  Elizabeth L. Ogburn,et al.  Causal Inference for Social Network Data , 2017, Journal of the American Statistical Association.

[24]  P. Rosenbaum Interference Between Units in Randomized Experiments , 2007 .

[25]  M. Hernán A definition of causal effect for epidemiological research , 2004, Journal of Epidemiology and Community Health.

[26]  M. Hudgens,et al.  Toward Causal Inference With Interference , 2008, Journal of the American Statistical Association.

[27]  T. Speed,et al.  On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9 , 1990 .

[28]  D. Lazer,et al.  The Coevolution of Networks and Political Attitudes , 2010 .

[29]  Tyler J VanderWeele,et al.  On causal inference in the presence of interference , 2012, Statistical methods in medical research.

[30]  Elizabeth L. Ogburn,et al.  Causal diagrams for interference , 2014, 1403.1239.

[31]  Steven F. Railsback,et al.  Agent-Based and Individual-Based Modeling: A Practical Introduction , 2011 .

[32]  T. Snijders,et al.  Modeling the Coevolution of Networks and Behavior , 2007 .

[33]  J Mark,et al.  Causal Inference for Networks , 2012 .

[34]  Greg Ver Steeg,et al.  Ruling out latent homophily in social networks , 2010 .

[35]  Christian Steglich,et al.  Beyond dyadic interdependence: Actor-oriented models for co-evolving social networks and individual behaviors , 2007 .

[36]  Sander Greenland,et al.  An introduction to instrumental variables for epidemiologists. , 2018, International journal of epidemiology.

[37]  Joshua D. Angrist,et al.  Mostly Harmless Econometrics: An Empiricist's Companion , 2008 .

[38]  Cosma Rohilla Shalizi Comment on "Why and When 'Flawed' Social Network Analyses Still Yield Valid Tests of no Contagion" , 2012, Statistics, politics, and policy.

[39]  J. Garland THE NEW ENGLAND JOURNAL OF MEDICINE , 1977, The Lancet.

[40]  R. Fisher,et al.  On the Mathematical Foundations of Theoretical Statistics , 1922 .

[41]  Peter M. Aronow,et al.  Estimating Average Causal Effects Under General Interference , 2012 .

[42]  S. Raudenbush,et al.  Evaluating Kindergarten Retention Policy , 2006 .

[43]  Michael E. Sobel,et al.  What Do Randomized Studies of Housing Mobility Demonstrate? , 2006 .

[44]  Lung-fei Lee,et al.  Asymptotic Distributions of Quasi-Maximum Likelihood Estimators for Spatial Autoregressive Models , 2004 .

[45]  Christopher Winship,et al.  Endogenous Selection Bias: The Problem of Conditioning on a Collider Variable. , 2014, Annual review of sociology.

[46]  A. James O’Malley,et al.  The analysis of social networks , 2008, Health Services and Outcomes Research Methodology.

[47]  Michael G. Hudgens,et al.  Large Sample Randomization Inference of Causal Effects in the Presence of Interference , 2014, Journal of the American Statistical Association.

[48]  N. Christakis,et al.  SUPPLEMENTARY ONLINE MATERIAL FOR: The Collective Dynamics of Smoking in a Large Social Network , 2022 .

[49]  D. Rubin Causal Inference Using Potential Outcomes , 2005 .

[50]  J. Besag On Spatial-Temporal Models and Markov Fields , 1977 .

[51]  Eric J Tchetgen Tchetgen,et al.  Why and When "Flawed" Social Network Analyses Still Yield Valid Tests of no Contagion , 2012, Statistics, politics, and policy.

[52]  S. Greenland An introduction To instrumental variables for epidemiologists , 2000, International journal of epidemiology.

[53]  Xu Lin,et al.  Peer Effects and Student Academic Achievement: An Application of Spatial Autoregressive Model with Group Unobservables , 2007 .

[54]  A. Zaslavsky,et al.  Estimating Peer Effects in Longitudinal Dyadic Data Using Instrumental Variables , 2014, Biometrics.

[55]  Mir M. Ali,et al.  Estimating peer effects in adolescent smoking behavior: a longitudinal analysis. , 2009, The Journal of adolescent health : official publication of the Society for Adolescent Medicine.

[56]  Tyler J. VanderWeele,et al.  Vaccines, Contagion, and Social Networks , 2014, ArXiv.