Why Match in Individually and Cluster Randomized Trials

The decision to match individuals or clusters in randomized trials is motivated by both practical and statistical concerns. Matching protects against chance imbalances in baseline covariate distributions and is thought to improve study credibility. Matching is also implemented to increase study power. This article compares the asymptotic efficiency of the pair-matched design, where units are matched on baseline covariates and the treatment randomized within pairs, to the independent design, where units are randomly paired and the treatment randomized within pairs. We focus on estimating the average treatment effect and use the efficient influence curve to understand the information provided by each design for estimation of this causal parameter. Our theoretical results indicate that the pairmatched design is asymptotically less efficient than its non-matched counterpart. Our simulations confirm these results asymptotically and in finite samples. Our approach is estimator-independent, avoids all parametric modeling assumptions, and applies equally to individually randomized and cluster randomized trials.

[1]  W G COCHRAN,et al.  Matching in analytical studies. , 1953, American journal of public health and the nation's health.

[2]  L. H. Youkeles 184 Note: Loss of Power through Ineffective Pairing of Observations in Small Two-Treatment All-or-None Experiments , 1963 .

[3]  Mathen Kk MATCHING IN COMPARATIVE STUDIES IN PUBLIC HEALTH. , 1963 .

[4]  Matched Samples in Medical Investigations , 1964, British journal of preventive & social medicine.

[5]  Billewicz Wz,et al.  The efficiency of matched samples: an empirical investigation. , 1965 .

[6]  G. Chase,et al.  On the efficiency of matched pairs in Bernoulli trials , 1968 .

[7]  O S Miettinen,et al.  The matched pairs design in the case of all-or-none responses. , 1968, Biometrics.

[8]  O S Miettinen,et al.  Matching and design efficiency in retrospective studies. , 1970, American journal of epidemiology.

[9]  K J Rothman,et al.  Epidemiologic methods in clinical trials , 1977, Cancer.

[10]  S M McKinlay,et al.  Pair-matching-a reappraisal of a popular technique. , 1977, Biometrics.

[11]  H Morgenstern,et al.  Matching in epidemiologic studies: validity and efficiency considerations. , 1981, Biometrics.

[12]  C. Weinberg,et al.  Paired versus two-sample design for a clinical trial of treatments with dichotomous outcome: power considerations. , 1982, Biometrics.

[13]  L. L. Kupper,et al.  In defense of matching. , 1982, American journal of epidemiology.

[14]  J. Salonen,et al.  Effects of antioxidant supplementation on platelet function: A randomized pair-matched placebo-controlled double-blind trial in men with low antioxidant status , 1990 .

[15]  The Glaucoma Laser Trial (GLT): 3. Design and methods. Glaucoma Laser Trial Research Group. , 1991, Controlled clinical trials.

[16]  D. Rubin,et al.  Ignorability and Coarse Data , 1991 .

[17]  J. Robins,et al.  Recovery of Information and Adjustment for Dependent Censoring Using Surrogate Markers , 1992 .

[18]  T D Koepsell,et al.  The effect of matching on the power of randomized community intervention studies. , 1993, Statistics in medicine.

[19]  B. Kirkwood,et al.  Community-based hygiene education to reduce diarrhoeal disease in rural Zaire: impact of the intervention on diarrhoeal morbidity. , 1994, International journal of epidemiology.

[20]  K. Do,et al.  Efficient and Adaptive Estimation for Semiparametric Models. , 1994 .

[21]  P Diehr,et al.  Breaking the matches in a paired t-test for community interventions when the number of pairs is small. , 1995, Statistics in medicine.

[22]  J. Todd,et al.  Impact of improved treatment of sexually transmitted diseases on HIV infection in rural Tanzania: ra , 1995 .

[23]  Niels Keiding,et al.  Coarsening at random in general sample spaces and random censoring in continuous time , 1995 .

[24]  M H Gail,et al.  The efficiency of the matched-pairs design of the Community Intervention Trial for Smoking Cessation (COMMIT). , 1997, Controlled clinical trials.

[25]  James M. Robins,et al.  Coarsening at Random: Characterizations, Conjectures, Counter-Examples , 1997 .

[26]  A Donner,et al.  The merits of matching in community intervention trials: a cautionary tale. , 1997, Statistics in medicine.

[27]  David M. Murray,et al.  Design and Analysis of Group- Randomized Trials , 1998 .

[28]  J. Robins,et al.  Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models , 1999 .

[29]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[30]  Allan Donner,et al.  Design and Analysis of Cluster Randomization Trials in Health Research , 2001 .

[31]  T. Spector,et al.  A Randomized Controlled Trial of Vitamin D Supplementation on Preventing Postmenopausal Bone Loss and Modifying Bone Metabolism Using Identical Twin Pairs , 2000, Journal of bone and mineral research : the official journal of the American Society for Bone and Mineral Research.

[32]  S. Pocock,et al.  Subgroup analysis, covariate adjustment and baseline comparisons in clinical trial reporting: current practiceand problems , 2002, Statistics in medicine.

[33]  James M. Robins,et al.  Unified Methods for Censored Longitudinal Data and Causality , 2003 .

[34]  Howard S. Bloom,et al.  The Core Analytics of Randomized Experiments for Social Research. MDRC Working Papers on Research Methodology. , 2006 .

[35]  M. J. van der Laan,et al.  The International Journal of Biostatistics Targeted Maximum Likelihood Learning , 2011 .

[36]  A Donner,et al.  Developments in cluster randomized trials and Statistics in Medicine , 2007, Statistics in medicine.

[37]  Kosuke Imai,et al.  (www.interscience.wiley.com) DOI: 10.1002/sim.3337 Variance identification and efficiency analysis in randomized experiments under the matched-pair design , 2022 .

[38]  Mark J. van der Laan,et al.  Estimation based on case-control designs with known prevalence probability. , 2008 .

[39]  Marie Davidian,et al.  Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates , 2008, Biometrics.

[40]  M. Ruel,et al.  Age-based preventive targeting of food assistance and behaviour change and communication for reduction of childhood undernutrition in Haiti: a cluster randomised trial , 2008, The Lancet.

[41]  M. Davidian,et al.  Covariate adjustment for two‐sample treatment comparisons in randomized clinical trials: A principled yet flexible approach , 2008, Statistics in medicine.

[42]  Gary King,et al.  The essential role of pair matching in cluster-randomized experiments, with application to the Mexican Universal Health Insurance Evaluation , 2009, 0910.3752.

[43]  Richard J. Hayes,et al.  Cluster randomised trials , 2009 .

[44]  M J van der Laan,et al.  Covariate adjustment in randomized trials with binary outcomes: Targeted maximum likelihood estimation , 2009, Statistics in medicine.

[45]  D. Rubin,et al.  Targeted ANCOVA Estimator in RCTs , 2011 .

[46]  M. Rosenblum Robust Analysis of RCTs Using Generalized Linear Models , 2011 .

[47]  M. J. Laan,et al.  Targeted Learning: Causal Inference for Observational and Experimental Data , 2011 .

[48]  Timothy L. Lash,et al.  Comprar Modern Epidemiology | Timothy L. Lash | 9781451190052 | Lippincott Williams & Wilkins , 2012 .

[49]  Kristin E. Porter,et al.  Diagnosing and responding to violations in the positivity assumption , 2012, Statistical methods in medical research.