An Optimal Method for Covariate Balancing and Its Properties

This article introduces a new randomization procedure to improve the covariate balance across treatment groups. Covariate balance is one of the most important concerns for successful comparative studies, such as causal inference and clinical trials, because it reduces bias and improves the accuracy of inference. However, chance imbalance may still exist in traditional randomized experiments, in which units are randomly allocated without using their covariate information. To address this issue, the proposed method allocates the units sequentially and adaptively, using information on the current level of imbalance and the incoming unit's covariate. With a large number of covariates or a large number of units, the proposed method shows substantial advantages over the traditional methods in terms of the covariate balance and computational time, making it an ideal technique in the era of big data. Furthermore, the proposed method attains the optimal covariate balance, in the sense that the estimated average treatment effect under the proposed method attains its minimum variance asymptotically. Numerical studies and real data analysis provide further evidence of the advantages of the proposed method.

[1]  Donald B Rubin,et al.  Rerandomization to Balance Tiers of Covariates , 2015, Journal of the American Statistical Association.

[2]  Feifang Hu,et al.  Testing Hypotheses of Covariate-Adaptive Randomized Clinical Trials , 2015 .

[3]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: A General Method for Estimating Sampling Variances for Standard Estimators for Average Causal Effects , 2015 .

[4]  Zhenjun Ma,et al.  Adaptive randomization for balancing over covariates , 2014 .

[5]  Feifang Hu,et al.  Balancing continuous covariates based on Kernel densities. , 2013, Contemporary clinical trials.

[6]  Dana Chandler,et al.  Breaking Monotony with Meaning: Motivation in Crowdsourcing Markets , 2012, ArXiv.

[7]  Yunzhi Lin,et al.  Balancing continuous and categorical baseline covariates in sequential clinical trials using the area between empirical cumulative distribution functions , 2012, Statistics in medicine.

[8]  F. Hu,et al.  Asymptotic properties of covariate-adaptive randomization , 2012, 1210.4666.

[9]  Kari Lock Morgan,et al.  Rerandomization to improve covariate balance in experiments , 2012, 1207.5625.

[10]  Maroussa Zagoraiou,et al.  The covariate-adaptive biased coin design for balancing clinical trials in the presence of prognostic factors , 2011 .

[11]  Kari Lock Morgan Rerandomization to Improve Covariate Balance in Randomized Experiments , 2011 .

[12]  Wenle Zhao,et al.  Quantifying the cost in power of ignoring continuous covariate imbalances in clinical trial randomization. , 2011, Contemporary clinical trials.

[13]  Kari Frazer Lock,et al.  Rerandomization to Improve Covariate Balance in Randomized Experiments , 2011 .

[14]  David G. Rand,et al.  The online laboratory: conducting experiments in a real labor market , 2010, ArXiv.

[15]  D. Taves,et al.  Rank-Minimization for balanced assignment of subjects in clinical trials. , 2010, Contemporary clinical trials.

[16]  D. Rubin For objective causal inference, design trumps analysis , 2008, 0811.1640.

[17]  David A. Freedman,et al.  On regression adjustments to experimental data , 2008, Adv. Appl. Math..

[18]  Stephen Senn,et al.  Controversies concerning randomization and additivity in clinical trials , 2004, Statistics in medicine.

[19]  Marion K Campbell,et al.  The method of minimization for allocation to clinical trials. a review. , 2002, Controlled clinical trials.

[20]  James Frane,et al.  A Method of Biased Coin Randomization, Its Implementation, and Its Validation , 1998 .

[21]  Richard L. Tweedie,et al.  Markov Chains and Stochastic Stability , 1993, Communications and Control Engineering Series.

[22]  Donald B. Rubin,et al.  Affinely Invariant Matching Methods with Ellipsoidal Distributions , 1992 .

[23]  F K Hoehler,et al.  Balancing allocation of subjects in biomedical research: a minimization strategy based on ranks. , 1987, Computers and biomedical research, an international journal.

[24]  Richard L. Smith Properties of Biased Coin Designs in Sequential Clinical Trials , 1984 .

[25]  Richard L. Smith Sequential Treatment Allocation Using Biased Coin Designs , 1984 .

[26]  A. Atkinson Optimum biased coin designs for sequential clinical trials with prognostic factors , 1982 .

[27]  C B Begg,et al.  A treatment allocation procedure for sequential clinical trials. , 1980, Biometrics.

[28]  L. J. Wei,et al.  An Application of an Urn Model to the Design of Sequential Controlled Clinical Trials , 1978 .

[29]  S. Pocock,et al.  Sequential treatment assignment with balancing for prognostic factors in the controlled clinical trial. , 1975, Biometrics.

[30]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[31]  W. G. Cochran,et al.  Controlling Bias in Observational Studies: A Review. , 1974 .

[32]  D R Taves,et al.  Minimization: A new method of assigning patients to treatment and control groups , 1974, Clinical pharmacology and therapeutics.

[33]  W. G. Cochran The Planning of Observational Studies of Human Populations , 1965 .

[34]  Student Comparison Between Balanced and Random Arrangements of Field Plots , 1938 .