The Econometrics of Randomized Experiments

Abstract In this chapter, we present econometric and statistical methods for analyzing randomized experiments. For basic experiments, we stress randomization-based inference as opposed to sampling-based inference. In randomization-based inference, uncertainty in estimates arises naturally from the random assignment of the treatments, rather than from hypothesized sampling from a large population. We show how this perspective relates to regression analyses for randomized experiments. We discuss the analyses of stratified, paired, and clustered randomized experiments, and we stress the general efficiency gains from stratification. We also discuss complications in randomized experiments such as noncompliance. In the presence of noncompliance, we contrast intention-to-treat analyses with instrumental variables analyses allowing for general treatment effect heterogeneity. We consider, in detail, estimation and inference for heterogenous treatment effects in settings with (possibly many) covariates. These methods allow researchers to explore heterogeneity by identifying subpopulations with different treatment effects while maintaining the ability to construct valid confidence intervals. We also discuss optimal assignment to treatment based on covariates in such settings. Finally, we discuss estimation and inference in experiments in settings with interactions between units, both in general network settings and in settings where the population is partitioned into groups with all interactions contained within these groups.

[1]  C. Manski Identification of Endogenous Social Effects: The Reflection Problem , 1993 .

[2]  Myoung‐jae Lee Micro-Econometrics for Policy, Program, and Treatment Effects , 2005 .

[3]  G. W. Snedecor STATISTICAL METHODS , 1967 .

[4]  Kjell A. Doksum,et al.  Empirical Probability Plots and Statistical Inference for Nonlinear Models in the Two-Sample Case , 1974 .

[5]  H. White A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity , 1980 .

[6]  J. Kagel,et al.  Handbook of Experimental Economics , 1997 .

[7]  Stephen P. Ryan,et al.  Incentives Work: Getting Teachers to Come to School , 2012 .

[8]  M. Zelen A new design for randomized clinical trials. , 1979, The New England journal of medicine.

[9]  John A. List,et al.  Multiple hypothesis testing in experimental economics , 2016, Experimental Economics.

[10]  Guido W. Imbens,et al.  Complementarity and Aggregate Implications of Assortative Matching: A Nonparametric Analysis , 2009 .

[11]  Esther Duflo,et al.  Field Experiments on Discrimination , 2016 .

[12]  D. Romer,et al.  A New Measure of Monetary Shocks: Derivation and Implications , 2003 .

[13]  Avi Feller,et al.  Principal Stratification , 2015 .

[14]  Susan Athey,et al.  Recursive partitioning for heterogeneous causal effects , 2015, Proceedings of the National Academy of Sciences.

[15]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[16]  W. Shadish,et al.  Experimental and Quasi-Experimental Designs for Generalized Causal Inference , 2001 .

[17]  Vasant Honavar,et al.  Transportability from Multiple Environments with Limited Experiments , 2013, NIPS.

[18]  N Segnan,et al.  Adjusting for non-compliance and contamination in randomized clinical trials. , 1997, Statistics in medicine.

[19]  Dennis L. Sun,et al.  Optimal Inference After Model Selection , 2014, 1410.2597.

[20]  P. J. Huber The behavior of maximum likelihood estimates under nonstandard conditions , 1967 .

[21]  Lu Tian,et al.  A Simple Method for Detecting Interactions between a Treatment and a Large Number of Covariates , 2012, 1212.2995.

[22]  David A. Freedman,et al.  On regression adjustments to experimental data , 2008, Adv. Appl. Math..

[23]  K. Hirano,et al.  Asymptotics for Statistical Treatment Rules , 2009 .

[24]  A Donner,et al.  Statistical methodology for paired cluster designs. , 1987, American journal of epidemiology.

[25]  F. J. Anscombe,et al.  The Validity of Comparative Experiments , 1948 .

[26]  Jon M. Kleinberg,et al.  Graph cluster randomization: network exposure to multiple universes , 2013, KDD.

[27]  J. Angrist,et al.  The Review of Economics and Statistics , 2008 .

[28]  Miriam Bruhn,et al.  In Pursuit of Balance: Randomization in Practice in Development Field Experiments , 2008 .

[29]  Xiaogang Su,et al.  Subgroup Analysis via Recursive Partitioning , 2009 .

[30]  D. Rubin,et al.  Estimating Outcome Distributions for Compliers in Instrumental Variables Models , 1997 .

[31]  Karl E. Peace,et al.  Intention to treat in clinical trials , 1989 .

[32]  J. M. Taylor,et al.  Subgroup identification from randomized clinical trial data , 2011, Statistics in medicine.

[33]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[34]  T. Speed,et al.  On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9 , 1990 .

[35]  Charles F. Manski,et al.  Learning about Treatment Effects from Experiments with Random Assignment of Treatments , 1996 .

[36]  Philip K. Robins,et al.  A Comparison of the Labor Supply Findings from the Four Negative Income Tax Experiments , 1985 .

[37]  Jerry A. Hausman,et al.  Specification and estimation of simultaneous equation models , 1983 .

[38]  Rachel Glennerster,et al.  Running Randomized Evaluations: A Practical Guide , 2013 .

[39]  D. Cox,et al.  The Theory of the Design of Experiments , 2000 .

[40]  Mirkin Boris,et al.  Clustering: A Data Recovery Approach , 2012 .

[41]  Abhijit Banerjee,et al.  The Experimental Approach to Development Economics , 2008 .

[42]  Thomas D. Cook,et al.  Introduction to Statistical Methods for Clinical Trials , 2007 .

[43]  D. Green,et al.  Modeling Heterogeneous Treatment Effects in Survey Experiments with Bayesian Additive Regression Trees , 2012 .

[44]  Oscar Kempthorne,et al.  THE RANDOMIZATION THEORY OF' EXPERIMENTAL INFERENCE* , 1955 .

[45]  O. Kempthorne The Design and Analysis of Experiments , 1952 .

[46]  D. Freedman Statistical Models for Causation , 2006, Evaluation review.

[47]  Edward Miguel,et al.  Worms: Identifying Impacts on Education and Health in the Presence of Treatment Externalities, Data User's Guide , 2014 .

[48]  Roseanne McNamee,et al.  Intention to treat, per protocol, as treated and instrumental variable estimators given non‐compliance and effect heterogeneity , 2009, Statistics in medicine.

[49]  Stuart G Baker,et al.  Latent class instrumental variables: a clinical and biostatistical perspective , 2016, Statistics in medicine.

[50]  H. Chipman,et al.  BART: Bayesian Additive Regression Trees , 2008, 0806.3286.

[51]  V. Chernozhukov,et al.  An IV Model of Quantile Treatment Effects , 2002 .

[52]  Stuart G. Baker,et al.  Analyzing a Randomized Cancer Prevention Trial with a Missing Binary Outcome, an Auxiliary Variable, and All-or-None Compliance , 2000 .

[53]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[54]  Brett Myors,et al.  Statistical Power Analysis , 2014 .

[55]  Sendhil Mullainathan,et al.  Site Selection Bias in Program Evaluation , 2012 .

[56]  Matt Taddy,et al.  Heterogeneous Treatment Effects in Digital Experimentation , 2014, 1412.8563.

[57]  C. Manski Nonparametric Bounds on Treatment Effects , 1989 .

[58]  J. S. Hunter,et al.  Statistics for Experimenters: Design, Innovation, and Discovery , 2006 .

[59]  Esther Duflo,et al.  Do Labor Market Policies Have Displacement Effects? Evidence from a Clustered Randomized Experiment , 2012 .

[60]  Abhijit Banerjee,et al.  Decision Theoretic Approaches to Experiment Design and External Validity , 2016 .

[61]  S S Ellenberg,et al.  Randomized consent designs for clinical trials: an update. , 1992, Statistics in Medicine.

[62]  J. Angrist,et al.  Identification and Estimation of Local Average Treatment Effects , 1994 .

[63]  P. Rosenbaum Design of Observational Studies , 2009, Springer Series in Statistics.

[64]  V. J. Hotz,et al.  Predicting the efficacy of future training programs using past experiences at other locations , 2005 .

[65]  D. Rubin,et al.  Assessing the effect of an influenza vaccine in an encouragement design. , 2000, Biostatistics.

[66]  P. Rosenbaum Covariance Adjustment in Randomized Experiments and Observational Studies , 2002 .

[67]  Matt Taddy,et al.  Heterogeneous Treatment Effects in Digital Experimentation , 2014 .

[68]  J. Angrist,et al.  Identification and Estimation of Local Average Treatment Effects , 1995 .

[69]  Marie Schmidt,et al.  Nonparametrics Statistical Methods Based On Ranks , 2016 .

[70]  S. Senn Testing for baseline balance in clinical trials. , 1994, Statistics in medicine.

[71]  Jennifer Hill,et al.  A Broader Template for Analyzing Broken Randomized Experiments , 1998 .

[72]  D. Cox Causality : some statistical aspects , 1992 .

[73]  Thomas Wensing,et al.  Analysis and Optimization , 2011 .

[74]  C. Manski Statistical treatment rules for heterogeneous populations , 2003 .

[75]  John Langford,et al.  The offset tree for learning with partial labels , 2008, KDD.

[76]  Peter M. Aronow,et al.  Estimating Average Causal Effects Under Interference Between Units , 2013, 1305.6156.

[77]  Jesse Rothstein,et al.  Social Experiments in the Labor Market , 2016 .

[78]  Sergio Firpo Efficient Semiparametric Estimation of Quantile Treatment Effects , 2004 .

[79]  William G. Cochran,et al.  Experimental Designs, 2nd Edition , 1950 .

[80]  P Diehr,et al.  Breaking the matches in a paired t-test for community interventions when the number of pairs is small. , 1995, Statistics in medicine.

[81]  Rajeev Dehejia,et al.  Program Evaluation as a Decision Problem , 1999 .

[82]  D. Rubin,et al.  Bayesian inference for causal effects in randomized experiments with noncompliance , 1997 .

[83]  Douglas G. Altman,et al.  Practical statistics for medical research , 1990 .

[84]  D. Green,et al.  Modeling heterogeneous treatment effects in large-scale experiments using Bayesian Additive Regression Trees , 2010 .

[85]  C. F. Jeff Wu,et al.  Experiments , 2021, Wiley Series in Probability and Statistics.

[86]  Joshua D. Angrist,et al.  Identification of Causal Effects Using Instrumental Variables , 1993 .

[87]  Richard K. Crump,et al.  Nonparametric Tests for Treatment Effect Heterogeneity , 2006, The Review of Economics and Statistics.

[88]  G. Imbens,et al.  Better Late than Nothing: Some Comments on Deaton (2009) and Heckman and Urzua (2009) , 2009 .

[89]  Stefan Wager,et al.  Estimation and Inference of Heterogeneous Treatment Effects using Random Forests , 2015, Journal of the American Statistical Association.

[90]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[91]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[92]  J. Angrist Mostly Harmless Econometrics , 2008 .

[93]  Peter Haan Manski, Charles: Public policy in an uncertain world , 2014 .

[94]  J. Pearl,et al.  Bounds on Treatment Effects from Studies with Imperfect Compliance , 1997 .

[95]  R. Lalonde Evaluating the Econometric Evaluations of Training Programs with Experimental Data , 1984 .

[96]  H. Weisberg,et al.  Post hoc subgroups in clinical trials: Anathema or analytics? , 2015, Clinical trials.

[97]  J. Angrist,et al.  Instrumental Variables Estimates of the Effect of Subsidized Training on the Quantiles of Trainee Earnings , 1999 .

[98]  Paul R. Rosenbaum,et al.  Robust, accurate confidence intervals with a weak instrument: quarter of birth and education , 2005 .

[99]  Edward Miguel,et al.  Reshaping Institutions: Evidence on Aid Impacts Using a Pre-Analysis Plan , 2011 .

[100]  E. L. Lehmann,et al.  Basic Concepts of Probability and Statistics , 1964 .

[101]  D. Rubin,et al.  Principal Stratification in Causal Inference , 2002, Biometrics.

[102]  J. Angrist,et al.  Causal Effects of Monetary Shocks: Semiparametric Conditional Independence Tests with a Multinomial Propensity Score , 2011, Review of Economics and Statistics.

[103]  Peter M. Aronow,et al.  On equivalencies between design-based and regression-based variance estimators for randomized experiments , 2012 .

[104]  P. Aronow,et al.  Estimating Average Causal Effects Under Interference Between Units , 2015 .

[105]  Rachael Meager,et al.  Understanding the Impact of Microcredit Expansions: A Bayesian Hierarchical Analysis of 7 Randomised Experiments , 2015, 1506.06669.

[106]  Stefan Wager,et al.  High-dimensional regression adjustments in randomized experiments , 2016, Proceedings of the National Academy of Sciences.

[107]  R. Glennerster,et al.  The Practicalities of Running Randomized Evaluations: Partnerships, Measurement, Ethics, and Transparency , 2017 .

[108]  Stefan Wager,et al.  Adaptive Concentration of Regression Trees, with Application to Random Forests , 2015 .

[109]  F. Eicker Limit Theorems for Regressions with Unequal and Dependent Errors , 1967 .

[110]  E. Keeler,et al.  Health insurance and the demand for medical care: evidence from a randomized experiment. , 1987, The American economic review.

[111]  P. Holland Statistics and Causal Inference , 1985 .

[112]  G. Imbens,et al.  Exact p-Values for Network Interference , 2015, 1506.02084.

[113]  Joshua D. Angrist,et al.  Treatment Effect Heterogeneity in Theory and Practice , 2003 .

[114]  Marc Ratkovic,et al.  Estimating treatment effect heterogeneity in randomized program evaluation , 2013, 1305.5682.

[115]  Guido W. Imbens,et al.  External Validity in Fuzzy Regression Discontinuity Designs , 2014, Journal of Business & Economic Statistics.

[116]  Stefan Wager,et al.  Uniform Convergence of Random Forests via Adaptive Concentration , 2015 .

[117]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction , 2016 .

[118]  Benjamin A. Olken,et al.  Promises and Perils of Pre-analysis Plans , 2015 .

[119]  Boris Mirkin Clustering: A Data Recovery Approach, Second Edition , 2012 .

[120]  C. Manski Partial Identification of Probability Distributions , 2003 .

[121]  Debopam Bhattacharya,et al.  Inferring Welfare Maximizing Treatment Assignment Under Budget Constraints , 2008 .

[122]  P. Aronow A General Method for Detecting Interference Between Units in Randomized Experiments , 2010 .

[123]  M. Hudgens,et al.  Toward Causal Inference With Interference , 2008, Journal of the American Statistical Association.

[124]  Jennifer L. Hill,et al.  Bayesian Nonparametric Modeling for Causal Inference , 2011 .

[125]  S G Baker,et al.  The paired availability design: a proposal for evaluating epidural analgesia during labor. , 1994, Statistics in medicine.

[126]  J. Neyman,et al.  Statistical Problems in Agricultural Experimentation , 1935 .

[127]  W. Lin,et al.  Agnostic notes on regression adjustments to experimental data: Reexamining Freedman's critique , 2012, 1208.2301.

[128]  Angus Deaton Instruments, Randomization, and Learning about Development , 2010 .

[129]  T. Shakespeare,et al.  Observational Studies , 2003 .

[130]  James E. West,et al.  From Natural Variation to Optimal Policy? the Importance of Endogenous Peer Group Formation * 1 Data 1.1 the Dataset 2 Experimental Design and Sorting Methodology , 2022 .

[131]  Susan Athey,et al.  Finite Population Causal Standard Errors , 2014 .

[132]  J. Putter,et al.  Basic Concepts of Probability and Statistics , 1965 .

[133]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[134]  Rebecca B. Morton,et al.  Experimental Political Science and the Study of Causality: References , 2010 .

[135]  Howard S. Bloom,et al.  Accounting for No-Shows in Experimental Evaluation Designs , 1984 .

[136]  Azeem M. Shaikh,et al.  Hypothesis Testing in Econometrics , 2009 .

[137]  R J Carroll,et al.  On design considerations and randomization-based inference for community intervention trials. , 1996, Statistics in medicine.

[138]  D. Rubin Matched Sampling for Causal Effects , 2006 .

[139]  Peter Z. Schochet Is regression adjustment supported by the Neyman model for causal inference , 2007 .

[140]  Christopher Winship,et al.  Counterfactuals and Causal Inference by Stephen L. Morgan , 2014 .

[141]  David R. Cox,et al.  A Note on Weighted Randomization , 1956 .

[142]  John Langford,et al.  Doubly Robust Policy Evaluation and Learning , 2011, ICML.

[143]  D. Rubin The design versus the analysis of observational studies for causal effects: parallels with the design of randomized trials , 2007, Statistics in medicine.

[144]  K. Hornik,et al.  Model-Based Recursive Partitioning , 2008 .

[145]  E. Duflo,et al.  The Role of Information and Social Interactions in Retirement Plan Decisions: Evidence from a Randomized Experiment , 2002 .

[146]  O. L. Davies,et al.  Design and analysis of industrial experiments , 1954 .

[147]  Kung-Jong Lui Binary Data Analysis of Randomized Clinical Trials with Noncompliance , 2011 .

[148]  D. Rubin [On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9.] Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies , 1990 .

[149]  M. H. Gail,et al.  Tests for no treatment e?ect in randomized clinical trials , 1988 .

[150]  Christian Hansen,et al.  Post-Selection and Post-Regularization Inference in Linear Models with Many Controls and Instruments , 2015, 1501.03185.

[151]  E. Lehmann,et al.  Nonparametrics: Statistical Methods Based on Ranks , 1976 .

[152]  Michal Kolesár,et al.  Robust Standard Errors in Small Samples: Some Practical Advice , 2012, Review of Economics and Statistics.

[153]  Paul R. Rosenbaum,et al.  Randomization Inference with an Instrumental Variable , 2005 .

[154]  Michael G. Hudgens,et al.  Large Sample Randomization Inference of Causal Effects in the Presence of Interference , 2014, Journal of the American Statistical Association.

[155]  D. Cox A note on data-splitting for the evaluation of significance levels , 1975 .

[156]  Cun-Hui Zhang,et al.  Lasso adjustments of treatment effect estimates in randomized experiments , 2015, Proceedings of the National Academy of Sciences.

[157]  Morteza Haghiri Applied Nonparametric Regression Analysis: The Choice of Generalized Additive Models , 2013 .

[158]  Dean Eckles,et al.  Design and Analysis of Experiments in Networks: Reducing Bias from Interference , 2014, ArXiv.

[159]  Kari Lock Morgan,et al.  Rerandomization to improve covariate balance in experiments , 2012, 1207.5625.

[160]  R Fisher,et al.  Design of Experiments , 1936 .

[161]  C. McCulloch,et al.  When does it pay to break the matches for analysis of a matched-pairs design? , 1992, Biometrics.

[162]  Xun Chen A note on non-parametric ANCOVA for covariate adjustment in randomized clinical trials. , 2005, Statistics in medicine.