Methods for Population Adjustment with Limited Access to Individual Patient Data: A Simulation Study

Population-adjusted indirect comparisons are used to estimate treatment effects when access to individual patient data is limited and there are cross-trial differences in effect modifiers. Increasingly, health technology assessment agencies are accepting evaluations that use these methods across a diverse range of therapeutic areas. Popular methods include matching-adjusted indirect comparison (MAIC) and simulated treatment comparison (STC). Despite this increasing popularity, there is limited formal evaluation of these methods and whether they can be used to accurately compare treatments. Thus, we undertake a comprehensive simulation study to compare standard unadjusted indirect comparisons, MAIC and STC across 162 scenarios. This simulation study assumes that the trials are investigating survival outcomes and measure continuous covariates, with the log hazard ratio as the measure of effect -- one of the most widely used setups in health technology assessment applications. The simulation scenarios vary the trial sample size, prognostic variable effects, interaction effects, covariate correlations and covariate overlap. Generally, MAIC yields unbiased treatment effect estimates, while STC is often biased because the measure of effect is non-collapsible. Standard indirect comparisons are systematically biased, particularly under stronger covariate imbalance and interaction effects. Standard errors and coverage rates are often valid in MAIC but underestimate variability in certain situations. Interval estimates for the standard indirect comparison are too narrow and STC suffers from bias-induced undercoverage. MAIC provides the most accurate estimates and, with lower degrees of covariate overlap, its bias reduction outweighs the loss in effective sample size and precision.

[1]  Andrea Manca,et al.  NICE DSU TECHNICAL SUPPORT DOCUMENT 17: THE USE OF OBSERVATIONAL DATA TO INFORM ESTIMATES OF TREATMENT EFFECTIVENESS IN TECHNOLOGY APPRAISAL: METHODS FOR COMPARATIVE INDIVIDUAL PATIENT DATA , 2015 .

[2]  Areti Angeliki Veroniki,et al.  A scoping review of indirect comparison methods and applications using individual patient data , 2016, BMC Medical Research Methodology.

[3]  Christine Fletcher,et al.  A review of methods for comparing treatments evaluated in studies that form disconnected networks of evidence , 2018, Research synthesis methods.

[4]  Gillian Bartlett,et al.  Bias due to aggregation of individual covariates in the Cox regression model. , 2003, American journal of epidemiology.

[5]  Ralf Bender,et al.  Generating survival times to simulate Cox proportional hazards models , 2005, Statistics in medicine.

[6]  B. Giraudeau,et al.  Propensity score methods for estimating relative risks in cluster randomized trials with low‐incidence binary outcomes and selection bias , 2014, Statistics in medicine.

[7]  J. Neyman On the Two Different Aspects of the Representative Method: the Method of Stratified Sampling and the Method of Purposive Selection , 1934 .

[8]  C. Walsh,et al.  Assessing the impact of a matching‐adjusted indirect comparison in a Bayesian network meta‐analysis , 2019, Research synthesis methods.

[9]  Jens Hainmueller,et al.  Entropy Balancing for Causal Effects: A Multivariate Reweighting Method to Produce Balanced Samples in Observational Studies , 2012, Political Analysis.

[10]  D G Altman,et al.  Indirect comparisons of competing interventions. , 2005, Health technology assessment.

[11]  Zhiwei Zhang,et al.  Covariate-Adjusted Putative Placebo Analysis in Active-Controlled Clinical Trials , 2009 .

[12]  Bill Ravens,et al.  An Introduction to Copulas , 2000, Technometrics.

[13]  David M. Phillippo,et al.  Population Adjustment Methods for Indirect Comparisons: A Review of National Institute for Health and Care Excellence Technology Appraisals , 2019, International Journal of Technology Assessment in Health Care.

[14]  Nicky J Welton,et al.  NICE DSU Technical Support Document 18: Methods for population-adjusted indirect comparisons in submissions to NICE , 2016 .

[15]  O. Miettinen,et al.  Confounding: essence and detection. , 1981, American journal of epidemiology.

[16]  Richard P. Brent,et al.  An Algorithm with Guaranteed Convergence for Finding a Zero of a Function , 1971, Comput. J..

[17]  Alan Brnabic,et al.  Inclusion Of Multiple Studies In Matching Adjusted Indirect Comparisons (Maic) , 2015 .

[19]  P. Rosenbaum Model-Based Direct Adjustment , 1987 .

[20]  Irina Proskorovsky,et al.  Simulation and Matching-Based Approaches for Indirect Comparison of Treatments , 2015, PharmacoEconomics.

[21]  J. Cappelleri,et al.  Indirect Treatment Comparison of Inotuzumab Ozogamicin Versus Blinatumomab for Relapsed or Refractory Acute Lymphoblastic Leukemia , 2019, Advances in Therapy.

[22]  D. Ayers,et al.  Comparative effectiveness of nivolumab versus clinical practice for advanced gastric or gastroesophageal junction cancer. , 2019, Journal of comparative effectiveness research.

[23]  N. Freemantle,et al.  The Effects of Model Misspecification in Unanchored Matching-Adjusted Indirect Comparison: Results of a Simulation Study. , 2020, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[24]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[25]  Mei Lu,et al.  Matching-adjusted indirect comparisons: a new tool for timely comparative effectiveness research. , 2012, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[26]  Jennifer Hill,et al.  Assessing Methods for Generalizing Experimental Impact Estimates to Target Populations , 2016, Journal of research on educational effectiveness.

[27]  J. Rahnenführer,et al.  Evaluation of Adjusted and Unadjusted Indirect Comparison Methods in Benefit Assessment , 2017, Methods of Information in Medicine.

[28]  Harold I Feldman,et al.  Model Selection, Confounder Control, and Marginal Structural Models , 2004 .

[29]  L. Hedges,et al.  Introduction to Meta‐Analysis , 2009, International Coaching Psychology Review.

[30]  M. Sculpher,et al.  Probabilistic sensitivity analysis for NICE technology assessment: not an optional extra. , 2005, Health economics.

[31]  David Hinkley,et al.  Bootstrap Methods: Another Look at the Jackknife , 2008 .

[32]  Michael J Crowther,et al.  Using simulation studies to evaluate statistical methods , 2017, Statistics in medicine.

[33]  K. Jack Ishak,et al.  No Head-to-Head Trial? Simulate the Missing Arms , 2012, PharmacoEconomics.

[34]  Nicky J Welton,et al.  Network Meta-Analysis for Decision-Making , 2018 .

[35]  F. Dominici,et al.  On quantifying the magnitude of confounding. , 2010, Biostatistics.

[36]  L. Howie,et al.  A 25-Year Experience of US Food and Drug Administration Accelerated Approval of Malignant Hematology and Oncology Drugs and Biologics: A Review , 2018, JAMA oncology.

[37]  Gerta Rücker,et al.  Presenting simulation results in a nested loop plot , 2014, BMC Medical Research Methodology.

[38]  Mei Lu,et al.  Comparative effectiveness research using matching‐adjusted indirect comparison: an application to treatment with guanfacine extended release or atomoxetine in children with attention‐deficit/hyperactivity disorder and comorbid oppositional defiant disorder , 2012, Pharmacoepidemiology and drug safety.

[39]  Thomas Lumley,et al.  Analysis of Complex Survey Samples , 2004 .

[40]  Gianluca Baio,et al.  Dirichlet process mixture models for regression discontinuity designs , 2020, Statistical methods in medical research.

[41]  H. Thom,et al.  Matching Adjusted Indirect Comparisons To Assess Comparative Effectiveness Of Therapies: Usage In Scientific Literature And Health Technology Appraisals , 2016 .

[42]  Kurt Hornik,et al.  On the generation of correlated artificial binary data , 1998 .

[43]  A. Briggs,et al.  Overall survival of glasdegib in combination with low-dose cytarabine, azacitidine, and decitabine among adult patients with previously untreated AML: comparative effectiveness using simulated treatment comparisons , 2019, ClinicoEconomics and outcomes research : CEOR.

[44]  J F Tierney,et al.  A critical review of methods for the assessment of patient-level interactions in individual participant data meta-analysis of randomized trials, and guidance for practitioners. , 2011, Journal of clinical epidemiology.

[45]  G. Imbens Nonparametric Estimation of Average Treatment Effects Under Exogeneity: A Review , 2004 .

[46]  L. Stewart,et al.  To IPD or not to IPD? , 2002, Evaluation & the health professions.

[47]  Keith Abrams,et al.  Use of Indirect and Mixed Treatment Comparisons for Technology Assessment , 2012, PharmacoEconomics.

[48]  Alex J. Sutton,et al.  Evidence Synthesis for Decision Making 2: A Generalized Linear Modeling Framework for Pairwise and Network Meta-analysis of Randomized Controlled Trials. , 2013 .

[49]  Mei Lu,et al.  Comparative Efficacy of Guanfacine Extended Release Versus Atomoxetine for the Treatment of Attention-Deficit/Hyperactivity Disorder in Children and Adolescents: Applying Matching-Adjusted Indirect Comparison Methodology , 2013, CNS Drugs.

[50]  David M. Phillippo,et al.  Equivalence of entropy balancing and the method of moments for matching‐adjusted indirect comparison , 2020, Research synthesis methods.

[51]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: A General Method for Estimating Sampling Variances for Standard Estimators for Average Causal Effects , 2015 .

[52]  Eric Q. Wu,et al.  Comparative Effectiveness Without Head-to-Head Trials , 2012, PharmacoEconomics.

[53]  Alan Brnabic,et al.  Alternative Weighting Approaches for Anchored Matching-Adjusted Indirect Comparisons via a Common Comparator. , 2019, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[54]  Gianluca Baio,et al.  Probabilistic sensitivity analysis in health economics , 2015, Statistical methods in medical research.

[55]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[56]  G. Imbens,et al.  Estimation of Causal Effects using Propensity Score Weighting: An Application to Data on Right Heart Catheterization , 2001, Health Services and Outcomes Research Methodology.

[57]  Mike Clarke,et al.  Individual Participant Data (IPD) Meta-analyses of Randomised Controlled Trials: Guidance on Their Use , 2015, PLoS medicine.

[58]  T. Lanitis,et al.  Simulated Treatment Comparison of Time-To-Event (And Other Non-Linear) Outcomes. , 2015, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[59]  Patrick Royston,et al.  The design of simulation studies in medical statistics , 2006, Statistics in medicine.

[60]  P. Austin An Introduction to Propensity Score Methods for Reducing the Effects of Confounding in Observational Studies , 2011, Multivariate behavioral research.

[61]  Roland R. Ramsahai,et al.  Evaluating treatment effectiveness in patient subgroups: a comparison of propensity score methods with an automated matching approach , 2012, The international journal of biostatistics.

[62]  Sofia Dias,et al.  Methods for Population-Adjusted Indirect Comparisons in Health Technology Appraisal , 2017, Medical decision making : an international journal of the Society for Medical Decision Making.

[63]  D. Mark,et al.  Clinical prediction models: are we building better mousetraps? , 2003, Journal of the American College of Cardiology.

[64]  M. Gail,et al.  Biased estimates of treatment effect in randomized experiments with nonlinear regressions and omitted covariates , 1984 .

[65]  Gianluca Baio,et al.  Regulatory approval of pharmaceuticals without a randomised controlled study: analysis of EMA and FDA approvals 1999–2014 , 2016, BMJ Open.

[66]  Kenneth Stanley,et al.  Design of Randomized Controlled Trials , 2007, Circulation.

[67]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[68]  J. Carpenter,et al.  Meta-analytical methods to identify who benefits most from treatments: daft, deluded, or deft approach? , 2017, British Medical Journal.

[69]  S. Cole,et al.  Generalizing evidence from randomized clinical trials to target populations: The ACTG 320 trial. , 2010, American journal of epidemiology.

[70]  H. White A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity , 1980 .

[71]  P. Grambsch,et al.  The Cox Model , 2000 .

[72]  J. Kalbfleisch,et al.  A Comparison of Cluster-Specific and Population-Averaged Approaches for Analyzing Correlated Binary Data , 1991 .

[73]  S. T. Buckland,et al.  An Introduction to the Bootstrap. , 1994 .

[74]  S D Walter,et al.  The results of direct and indirect treatment comparisons in meta-analysis of randomized controlled trials. , 1997, Journal of clinical epidemiology.

[75]  Brian K. Lee,et al.  Weight Trimming and Propensity Score Weighting , 2011, PloS one.

[76]  Douglas G Altman,et al.  Validity of indirect comparison for estimating efficacy of competing interventions: empirical evidence from published meta-analyses , 2003, BMJ : British Medical Journal.

[77]  Alex J. Sutton,et al.  Evidence Synthesis for Decision Making 2 , 2013, Medical decision making : an international journal of the Society for Medical Decision Making.