Adaptive pair‐matching in randomized trials with unbiased and efficient effect estimation

In randomized trials, pair-matching is an intuitive design strategy to protect study validity and to potentially increase study power. In a common design, candidate units are identified, and their baseline characteristics used to create the best n/2 matched pairs. Within the resulting pairs, the intervention is randomized, and the outcomes measured at the end of follow-up. We consider this design to be adaptive, because the construction of the matched pairs depends on the baseline covariates of all candidate units. As a consequence, the observed data cannot be considered as n/2 independent, identically distributed pairs of units, as common practice assumes. Instead, the observed data consist of n dependent units. This paper explores the consequences of adaptive pair-matching in randomized trials for estimation of the average treatment effect, conditional the baseline covariates of the n study units. By avoiding estimation of the covariate distribution, estimators of this conditional effect will often be more precise than estimators of the marginal effect. We contrast the unadjusted estimator with targeted minimum loss based estimation and show substantial efficiency gains from matching and further gains with adjustment. This work is motivated by the Sustainable East Africa Research in Community Health study, an ongoing community randomized trial to evaluate the impact of immediate and streamlined antiretroviral therapy on HIV incidence in rural East Africa.

[1]  Mathen Kk MATCHING IN COMPARATIVE STUDIES IN PUBLIC HEALTH. , 1963 .

[2]  Comment: The Essential Role of Pair Matching , 2009, 0910.3754.

[3]  Billewicz Wz,et al.  The efficiency of matched samples: an empirical investigation. , 1965 .

[4]  M. Petersen,et al.  Assessment of population-based HIV RNA levels in a rural east African setting using a fingerprick-based blood collection method. , 2013, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[5]  Allan Donner,et al.  Design and Analysis of Cluster Randomization Trials in Health Research , 2001 .

[6]  Lei Wang,et al.  Prevention of HIV-1 infection with early antiretroviral therapy. , 2011, The New England journal of medicine.

[7]  AIDS Treatment and Intrahousehold Resource Allocations: Children's Nutrition and Schooling in Kenya , 2007, Journal of public economics.

[8]  A. Hung,et al.  Reweighted Mahalanobis distance matching for cluster‐randomized trials with missing data , 2012, Pharmacoepidemiology and drug safety.

[9]  Douglas E. Faries,et al.  A Case Study of an Adaptive Clinical Trial in the Treatment of Out-Patients with Depressive Disorder , 1994 .

[10]  M. J. van der Laan,et al.  The International Journal of Biostatistics Targeted Maximum Likelihood Learning , 2011 .

[11]  Mark J van der Laan,et al.  Empirical Efficiency Maximization: Improved Locally Efficient Covariate Adjustment in Randomized Experiments and Survival Analysis , 2008, The international journal of biostatistics.

[12]  M. J. Laan,et al.  Targeted Learning: Causal Inference for Observational and Experimental Data , 2011 .

[13]  M. Lampe,et al.  Perinatal HIV and its prevention: progress toward an HIV-free generation. , 2010, Clinics in perinatology.

[14]  Laura B. Balzer,et al.  ADAPTIVE MATCHING IN RANDOMIZED TRIALS AND OBSERVATIONAL STUDIES. , 2012, Journal of statistical research.

[15]  W G COCHRAN,et al.  Matching in analytical studies. , 1953, American journal of public health and the nation's health.

[16]  Mitchell H. Gail,et al.  Community Intervention Trial for Smoking Cessation (COMMIT): I. cohort results from a four-year community intervention. , 1995, American journal of public health.

[17]  G. Imbens,et al.  Bias-Corrected Matching Estimators for Average Treatment Effects , 2011 .

[18]  M H Gail,et al.  The efficiency of the matched-pairs design of the Community Intervention Trial for Smoking Cessation (COMMIT). , 1997, Controlled clinical trials.

[19]  C. Flateau,et al.  Consequences of HIV infection on malaria and therapeutic implications: a systematic review. , 2011, The Lancet. Infectious diseases.

[20]  Markus Goldstein,et al.  The Economic Impact of AIDS Treatment , 2005, The Journal of Human Resources.

[21]  Anthony D Harries,et al.  Reducing tuberculosis-associated early mortality in antiretroviral treatment programmes in sub-Saharan Africa. , 2011, AIDS.

[22]  Mark J van der Laan,et al.  The International Journal of Biostatistics A Targeted Maximum Likelihood Estimator of a Causal Effect on a Bounded Continuous Outcome , 2011 .

[23]  Guido W. Imbens,et al.  Estimation of the Conditional Variance in Paired Experiments , 2008 .

[24]  Michael Rosenblum,et al.  The International Journal of Biostatistics Simple , Efficient Estimators of Treatment Effects in Randomized Trials Using Generalized Linear Models to Leverage Baseline Variables , 2011 .

[25]  T D Koepsell,et al.  The effect of matching on the power of randomized community intervention studies. , 1993, Statistics in medicine.

[26]  L. Myer,et al.  Antiretroviral Therapy and TB Notification Rates in a High HIV Prevalence South African Community , 2011, Journal of acquired immune deficiency syndromes.

[27]  L. Watson,et al.  Mounting a community-randomized trial: sample size, matching, selection, and randomization issues in PRISM. , 2004, Controlled clinical trials.

[28]  Mark J. van der Laan,et al.  Why Match in Individually and Cluster Randomized Trials , 2012 .

[29]  Jeffrey H Silber,et al.  Optimal multivariate matching before randomization. , 2004, Biostatistics.

[30]  M. Traskin,et al.  A Powerful and Robust Test Statistic for Randomization Inference in Group‐Randomized Trials with Matched Pairs of Groups , 2012, Biometrics.

[31]  Dylan S. Small,et al.  Comment: The Essential Role of Pair Matching in Cluster-Randomized Experiments, with Application to the Mexican Universal Health Insurance Evaluation , 2009 .

[32]  Matched Samples in Medical Investigations , 1964, British journal of preventive & social medicine.

[33]  C. Weinberg,et al.  Paired versus two-sample design for a clinical trial of treatments with dichotomous outcome: power considerations. , 1982, Biometrics.

[34]  L. J. Wei,et al.  Statistical Inference with Data-Dependent Treatment Allocation Rules , 1990 .

[35]  Michael Rosenblum,et al.  Targeted Maximum Likelihood Estimation of the Parameter of a Marginal Structural Model , 2010, The international journal of biostatistics.

[36]  David M. Murray,et al.  Design and Analysis of Group- Randomized Trials , 1998 .

[37]  A Donner,et al.  Developments in cluster randomized trials and Statistics in Medicine , 2007, Statistics in medicine.

[38]  O S Miettinen,et al.  The matched pairs design in the case of all-or-none responses. , 1968, Biometrics.

[39]  Sander Greenland,et al.  Modern Epidemiology 3rd edition , 1986 .

[40]  Kosuke Imai,et al.  (www.interscience.wiley.com) DOI: 10.1002/sim.3337 Variance identification and efficiency analysis in randomized experiments under the matched-pair design , 2022 .

[41]  Timothy L. Lash,et al.  Comprar Modern Epidemiology | Timothy L. Lash | 9781451190052 | Lippincott Williams & Wilkins , 2012 .

[42]  Susan A. Murphy,et al.  A Conceptual Framework for Adaptive Preventive Interventions , 2004, Prevention Science.

[43]  Mark J. van der Laan,et al.  tmle : An R Package for Targeted Maximum Likelihood Estimation , 2012 .

[44]  J. Montaner,et al.  ART in Prevention of HIV and TB: Update on Current Research Efforts , 2011 .

[45]  Gary King,et al.  The essential role of pair matching in cluster-randomized experiments, with application to the Mexican Universal Health Insurance Evaluation , 2009, 0910.3752.

[46]  P Diehr,et al.  Breaking the matches in a paired t-test for community interventions when the number of pairs is small. , 1995, Statistics in medicine.

[47]  M. J. van der Laan,et al.  Statistical Applications in Genetics and Molecular Biology Super Learner , 2010 .

[48]  R. Atun,et al.  Prevention of mother-to-child transmission of HIV and the health-related Millennium Development Goals: time for a public health approach , 2011, The Lancet.

[49]  Dylan S. Small,et al.  Randomization Inference in a Group–Randomized Trial of Treatments for Depression , 2008 .

[50]  W. Billewicz The efficiency of matched samples: an empirical investigation. , 1965, Biometrics.

[51]  Xinyi Xu,et al.  Optimal Nonbipartite Matching and Its Statistical Applications , 2011, The American statistician.

[52]  Michael Rayment,et al.  Prevention of HIV-1 infection with early antiretroviral therapy , 2012, Journal of Family Planning and Reproductive Health Care.

[53]  M J van der Laan,et al.  Covariate adjustment in randomized trials with binary outcomes: Targeted maximum likelihood estimation , 2009, Statistics in medicine.

[54]  Gary King,et al.  A Politically Robust Experimental Design for Public Policy Evaluation, With Application to the Mexican Universal Health Insurance Program , 2007, Journal of policy analysis and management : [the journal of the Association for Public Policy Analysis and Management].

[55]  M. J. van der Laan,et al.  Targeting the Optimal Design in Randomized Clinical Trials with Binary Outcomes and No Covariate: Simulation Study , 2011 .

[56]  Mark J van der Laan,et al.  Super Learning: An Application to the Prediction of HIV-1 Drug Resistance , 2007, Statistical applications in genetics and molecular biology.

[57]  S. Thompson The merits of matching in community intervention trials: a cautionary tale. , 1998, Statistics in medicine.

[58]  L. H. Youkeles 184 Note: Loss of Power through Ineffective Pairing of Observations in Small Two-Treatment All-or-None Experiments , 1963 .

[59]  S M McKinlay,et al.  Pair-matching-a reappraisal of a popular technique. , 1977, Biometrics.

[60]  Aad van der Vaart,et al.  Estimating a survival distribution with current status data and high-dimensional covariates , 2006 .

[61]  J. Todd,et al.  Impact of improved treatment of sexually transmitted diseases on HIV infection in rural Tanzania: ra , 1995 .

[62]  Maya L. Petersen,et al.  Longitudinal Targeted Maximum Likelihood Estimation , 2015 .

[63]  Mark J. van der Laan,et al.  Causal Inference for a Population of Causally Connected Units , 2014 .

[64]  Richard J. Hayes,et al.  Cluster randomised trials , 2009 .