Model-assisted design of experiments in the presence of network-correlated outcomes

We consider the problem of how to assign treatment in a randomized experiment, in which the correlation among the outcomes is informed by a network available pre-intervention. Working within the potential outcome causal framework, we develop a class of models that posit such a correlation structure among the outcomes. Then we leverage these models to develop restricted randomization strategies for allocating treatment optimally, by minimizing the mean square error of the estimated average treatment effect. Analytical decompositions of the mean square error, due both to the model and to the randomization distribution, provide insights into aspects of the optimal designs. In particular, the analysis suggests new notions of balance based on specific network quantities, in addition to classical covariate balance. The resulting balanced, optimal restricted randomization strategies are still design unbiased, in situations where the model used to derive them does not hold. We illustrate how the proposed treatment allocation strategies improve on allocations that ignore the network structure, with extensive simulations.

[1]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[2]  Student,et al.  Comparison Between Balanced and Random Arrangements of Field Plots , 1938 .

[3]  H. L. Lucas,et al.  DESIGN OF EXPERIMENTS IN NON-LINEAR SITUATIONS , 1959 .

[4]  W. Youden Randomization and Experimentation , 1972 .

[5]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[6]  R. Simon,et al.  Restricted randomization designs in clinical trials. , 1979, Biometrics.

[7]  P. Holland Statistics and Causal Inference , 1985 .

[8]  W. Näther Optimum experimental designs , 1994 .

[9]  S. Wasserman,et al.  Social Network Analysis: List of Illustrations , 1994 .

[10]  H. Thorisson Coupling Methods in Probability Theory , 1995 .

[11]  Carl-Erik Särndal,et al.  Model Assisted Survey Sampling , 1997 .

[12]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[13]  P. Rosenbaum Covariance Adjustment in Randomized Experiments and Observational Studies , 2002 .

[14]  David E. Goldberg,et al.  Genetic algorithms and Machine Learning , 1988, Machine Learning.

[15]  P. Rosenbaum Interference Between Units in Randomized Experiments , 2007 .

[16]  N. Christakis,et al.  The Spread of Obesity in a Large Social Network Over 32 Years , 2007, The New England journal of medicine.

[17]  M. Hudgens,et al.  Toward Causal Inference With Interference , 2008, Journal of the American Statistical Association.

[18]  Arun Sundararajan,et al.  Distinguishing influence-based contagion from homophily-driven diffusion in dynamic networks , 2009, Proceedings of the National Academy of Sciences.

[19]  Eric D. Kolaczyk,et al.  Statistical Analysis of Network Data: Methods and Models , 2009 .

[20]  Edoardo M. Airoldi,et al.  A Survey of Statistical Network Models , 2009, Found. Trends Mach. Learn..

[21]  P. Bickel,et al.  A nonparametric view of network models and Newman–Girvan and other modularities , 2009, Proceedings of the National Academy of Sciences.

[22]  Duncan J. Watts,et al.  Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[23]  Cosma Rohilla Shalizi,et al.  Homophily and Contagion Are Generically Confounded in Observational Social Network Studies , 2010, Sociological methods & research.

[24]  Cameron Marlow,et al.  A 61-million-person experiment in social influence and political mobilization , 2012, Nature.

[25]  Charles F. Manski,et al.  Identification of Treatment Response with Social Interactions , 2013 .

[26]  Lada A. Adamic,et al.  The role of social networks in information diffusion , 2012, WWW.

[27]  Kari Lock Morgan,et al.  Rerandomization to improve covariate balance in experiments , 2012, 1207.5625.

[28]  Edward K. Kao,et al.  Estimation of Causal Peer Influence Effects , 2013, ICML.

[29]  Jon M. Kleinberg,et al.  Graph cluster randomization: network exposure to multiple universes , 2013, KDD.

[30]  Tyler J. VanderWeele,et al.  Vaccines, Contagion, and Social Networks , 2014, ArXiv.

[31]  Dean Eckles,et al.  Design and Analysis of Experiments in Networks: Reducing Bias from Interference , 2014, ArXiv.

[32]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction , 2016 .

[33]  P. Tharmanathan,et al.  Use and reporting of restricted randomization: a review. , 2015, Journal of evaluation in clinical practice.

[34]  Anmol Bhasin,et al.  Network A/B Testing: From Sampling to Estimation , 2015, WWW.

[35]  E. Airoldi,et al.  A natural experiment of social network formation and dynamics , 2015, Proceedings of the National Academy of Sciences.

[36]  N. Christakis,et al.  Social network targeting to maximise population behaviour change: a cluster randomised controlled trial , 2015, The Lancet.

[37]  P. Aronow,et al.  Estimating Average Causal Effects Under Interference Between Units , 2015 .

[38]  Edoardo M. Airoldi,et al.  Assessing the Impact of Granular Privacy Controls on Content Sharing and Disclosure on Facebook , 2016, Inf. Syst. Res..

[39]  Edoardo M. Airoldi,et al.  Identification and Estimation of Treatment and Interference Effects in Observational Studies on Networks , 2016, Journal of the American Statistical Association.

[40]  Cosma Rohilla Shalizi,et al.  Controlling for Latent Homophily in Social Networks through Inferring Latent Locations , 2016, ArXiv.

[41]  Peter M. Aronow,et al.  Estimating Average Causal Effects Under Interference Between Units , 2013, 1305.6156.

[42]  D. Sussman,et al.  Elements of estimation theory for causal effects in the presence of network interference , 2017, 1702.03578.

[43]  Holly B. Shakya,et al.  Exploiting social influence to magnify population-level behaviour change in maternal and child health: study protocol for a randomised controlled trial of network targeting algorithms in rural Honduras , 2017, BMJ Open.

[44]  Donald B. Rubin,et al.  Asymptotic theory of rerandomization in treatment–control experiments , 2016, Proceedings of the National Academy of Sciences.