Adaptive pre‐specification in randomized trials with and without pair‐matching

In randomized trials, adjustment for measured covariates during the analysis can reduce variance and increase power. To avoid misleading inference, the analysis plan must be pre-specified. However, it is often unclear a priori which baseline covariates (if any) should be adjusted for in the analysis. Consider, for example, the Sustainable East Africa Research in Community Health (SEARCH) trial for HIV prevention and treatment. There are 16 matched pairs of communities and many potential adjustment variables, including region, HIV prevalence, male circumcision coverage, and measures of community-level viral load. In this paper, we propose a rigorous procedure to data-adaptively select the adjustment set, which maximizes the efficiency of the analysis. Specifically, we use cross-validation to select from a pre-specified library the candidate targeted maximum likelihood estimator (TMLE) that minimizes the estimated variance. For further gains in precision, we also propose a collaborative procedure for estimating the known exposure mechanism. Our small sample simulations demonstrate the promise of the methodology to maximize study power, while maintaining nominal confidence interval coverage. We show how our procedure can be tailored to the scientific question (intervention effect for the study sample vs. for the target population) and study design (pair-matched or not). Copyright © 2016 John Wiley & Sons, Ltd.

[1]  Robert M Califf,et al.  Characteristics of clinical trials registered in ClinicalTrials.gov, 2007-2010. , 2012, JAMA.

[2]  S. Pocock,et al.  Subgroup analysis, covariate adjustment and baseline comparisons in clinical trial reporting: current practiceand problems , 2002, Statistics in medicine.

[3]  R. Fisher Statistical methods for research workers , 1927, Protoplasma.

[4]  Dylan S. Small,et al.  Randomization Inference in a Group–Randomized Trial of Treatments for Depression , 2008 .

[5]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[6]  M. J. van der Laan,et al.  The International Journal of Biostatistics Targeted Maximum Likelihood Learning , 2011 .

[7]  Laura B. Balzer,et al.  Targeted estimation and inference for the sample average treatment effect in trials with and without pair‐matching , 2016, Statistics in medicine.

[8]  W. G. Cochran Analysis of covariance: Its nature and uses. , 1957 .

[9]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[10]  J. Robins A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect , 1986 .

[11]  M. J. Laan,et al.  Adaptive pair‐matching in randomized trials with unbiased and efficient effect estimation , 2015, Statistics in medicine.

[12]  J. Lewis,et al.  Statistical principles for clinical trials (ICH E9): an introductory note on an international guideline. , 1999, Statistics in medicine.

[13]  Tim P Morris,et al.  The risks and rewards of covariate adjustment in randomized trials: an assessment of 12 outcomes from 8 studies , 2014, Trials.

[14]  J. Robins,et al.  Identifiability and Exchangeability for Direct and Indirect Effects , 1992, Epidemiology.

[15]  Xinyi Xu,et al.  Optimal Nonbipartite Matching and Its Statistical Applications , 2011, The American statistician.

[16]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[17]  Romain Neugebauer,et al.  Robust extraction of covariate information to improve estimation efficiency in randomized trials , 2011, Statistics in medicine.

[18]  Michael Rosenblum,et al.  The International Journal of Biostatistics Simple , Efficient Estimators of Treatment Effects in Randomized Trials Using Generalized Linear Models to Leverage Baseline Variables , 2011 .

[19]  Mark J van der Laan,et al.  Empirical Efficiency Maximization: Improved Locally Efficient Covariate Adjustment in Randomized Experiments and Survival Analysis , 2008, The international journal of biostatistics.

[20]  M. J. Laan,et al.  Targeted Learning: Causal Inference for Observational and Experimental Data , 2011 .

[21]  Jeffrey H Silber,et al.  Optimal multivariate matching before randomization. , 2004, Biostatistics.

[22]  Laura B. Balzer,et al.  ADAPTIVE MATCHING IN RANDOMIZED TRIALS AND OBSERVATIONAL STUDIES. , 2012, Journal of statistical research.

[23]  Lingling Li,et al.  Inverse probability weighting for covariate adjustment in randomized studies , 2014, Statistics in medicine.

[24]  Gary King,et al.  The essential role of pair matching in cluster-randomized experiments, with application to the Mexican Universal Health Insurance Evaluation , 2009, 0910.3752.

[25]  G. Imbens,et al.  Bias-Corrected Matching Estimators for Average Treatment Effects , 2011 .

[26]  Hao Helen Zhang,et al.  Variable selection for covariate‐adjusted semiparametric inference in randomized clinical trials , 2012, Statistics in medicine.

[27]  Dylan S. Small,et al.  Comment: The Essential Role of Pair Matching in Cluster-Randomized Experiments, with Application to the Mexican Universal Health Insurance Evaluation , 2009 .

[28]  Michael Rosenblum,et al.  Targeted Maximum Likelihood Estimation of the Parameter of a Marginal Structural Model , 2010, The international journal of biostatistics.

[29]  M. Traskin,et al.  A Powerful and Robust Test Statistic for Randomization Inference in Group‐Randomized Trials with Matched Pairs of Groups , 2012, Biometrics.

[30]  P. McCullagh,et al.  Some aspects of analysis of covariance. , 1982, Biometrics.

[31]  James M. Robins,et al.  Unified Methods for Censored Longitudinal Data and Causality , 2003 .

[32]  P M Fayers,et al.  Cluster-randomized trials , 2002, Palliative medicine.

[33]  Andrea Manca,et al.  A substantial and confusing variation exists in handling of baseline covariates in randomized controlled trials: a review of trials published in leading medical journals. , 2010, Journal of clinical epidemiology.

[34]  Mark J van der Laan,et al.  The International Journal of Biostatistics A Targeted Maximum Likelihood Estimator of a Causal Effect on a Bounded Continuous Outcome , 2011 .

[35]  M. J. van der Laan,et al.  The International Journal of Biostatistics Collaborative Double Robust Targeted Maximum Likelihood Estimation , 2011 .

[36]  D. Rubin [On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9.] Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies , 1990 .

[37]  F. Fuchs,et al.  The effect of antihypertensive agents on sleep apnea: protocol for a randomized controlled trial , 2014, Trials.

[38]  Marie Davidian,et al.  Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates , 2008, Biometrics.

[39]  A Donner,et al.  The merits of matching in community intervention trials: a cautionary tale. , 1997, Statistics in medicine.

[40]  M J van der Laan,et al.  Covariate adjustment in randomized trials with binary outcomes: Targeted maximum likelihood estimation , 2009, Statistics in medicine.

[41]  M H Gail,et al.  The efficiency of the matched-pairs design of the Community Intervention Trial for Smoking Cessation (COMMIT). , 1997, Controlled clinical trials.

[42]  Vinay Prasad,et al.  Characteristics of cluster randomized trials: are they living up to the randomized trial? , 2013, JAMA internal medicine.

[43]  A Donner,et al.  Developments in cluster randomized trials and Statistics in Medicine , 2007, Statistics in medicine.

[44]  Donald B. Rubin,et al.  Comment : Neyman ( 1923 ) and Causal Inference in Experiments and Observational Studies , 2007 .

[45]  M. Davidian,et al.  Covariate adjustment for two‐sample treatment comparisons in randomized clinical trials: A principled yet flexible approach , 2008, Statistics in medicine.

[46]  Michael Rosenblum,et al.  Leveraging prognostic baseline variables to gain precision in randomized trials , 2015, Statistics in medicine.

[47]  T. Speed,et al.  On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9 , 1990 .

[48]  Judea Pearl,et al.  Direct and Indirect Effects , 2001, UAI.

[49]  M. J. Laan,et al.  C-TMLE of an Additive Point Treatment Effect , 2011 .