论文信息 - The Perils of Randomization Checks in the Analysis of Experiments

The Perils of Randomization Checks in the Analysis of Experiments

In the analysis of experimental data, randomization checks, also known as balance tests, are used to indicate whether a randomization has produced balance on various characteristics across experimental conditions. Randomization checks are popular in many fields although their merits have yet to be established. The grounds on which balance tests are generally justified include either 1) the credibility of experimental findings, and/or 2) the efficiency of the statistical model. We show that balance tests cannot improve either credibility or efficiency. The most common “remedy” resulting from a failed balance test is the inclusion as a covariate of a variable failing the test; this practice cannot improve the choice of statistical model. Other commonly suggested responses to failed balance tests such as post-stratification or re-randomization also fail to improve on methods that do not require balance tests. We advocate resisting reviewer requests for randomization checks in all but some narrowly defined circumstances.

Diana C. Mutz | Robin Pemantle | Diana Mutz | R. Pemantle | Robin Pemantle

[1] R. Abelson. Statistics As Principled Argument , 1995 .

[2] Diana C. Mutz. Population-Based Survey Experiments , 2011 .

[3] R. Little. Post-Stratification: A Modeler's Perspective , 1993 .

[4] Gary King,et al. Misunderstandings between experimentalists and observationalists about causal inference , 2008 .

[5] Jake Bowers. Cambridge Handbook of Experimental Political Science: Making Effects Manifest in Randomized Experiments , 2011 .

[6] Kosuke Imai,et al. Do Get-Out-the-Vote Calls Reduce Turnout? The Importance of Statistical Methods for Field Experiments , 2005, American Political Science Review.

[7] Jake Bowers,et al. Covariate balance in simple stratified and clustered comparative studies , 2008, 0808.3857.

[8] D. Altman. Comparability of Randomised Groups , 1985 .

[9] S. Pocock,et al. Subgroup analysis, covariate adjustment and baseline comparisons in clinical trial reporting: current practiceand problems , 2002, Statistics in medicine.

[10] D. Freedman. Statistical Models and Causal Inference: On Regression Adjustments in Experiments with Several Treatments , 2008, 0803.3757.

[11] M. Feldstein. Multicollinearity and the Mean Square Error of Alternative Estimators , 1973 .

[12] S. Senn. Testing for baseline balance in clinical trials. , 1994, Statistics in medicine.

[13] Richard A. Berk,et al. Statistical Inference After Model Selection , 2010 .

[14] C B Begg,et al. Suspended judgment. Significance tests of covariate imbalance in clinical trials. , 1990, Controlled clinical trials.

[15] T. Permutt. Testing for imbalance of covariates in controlled experiments. , 1990, Statistics in medicine.

[16] M. Davidian,et al. Covariate adjustment for two‐sample treatment comparisons in randomized clinical trials: A principled yet flexible approach , 2008, Statistics in medicine.

[17] Kari Lock Morgan. Rerandomization to Improve Covariate Balance in Randomized Experiments , 2011 .

[18] D. Green,et al. The Effects of Canvassing, Telephone Calls, and Direct Mail on Voter Turnout: A Field Experiment , 2000, American Political Science Review.