Combining Real-World and Randomized Control Trial Data Using Data-Adaptive Weighting via the On-Trial Score

Clinical trials with a hybrid control arm (a control arm constructed from a combination of randomized patients and real-world data on patients receiving usual care in standard clinical practice) have the potential to decrease the cost of randomized trials while increasing the proportion of trial patients given access to novel therapeutics. However, due to stringent trial inclusion criteria and differences in care and data quality between trials and community practice, trial patients may have systematically different outcomes compared to their real-world counterparts. We propose a new method for analyses of trials with a hybrid control arm that efficiently controls bias and type I error. Under our proposed approach, selected real-world patients are weighted by a function of the “on-trial score,” which reflects their similarity to trial patients. In contrast to previously developed hybrid control designs that assign the same weight to all real-world patients, our approach upweights of real-world patients who more closely resemble randomized control patients while dissimilar patients are discounted. Estimates of the treatment effect are obtained via Cox proportional hazards models. We compare our approach to existing approaches via simulations and apply these methods to a study using electronic health record data. Our proposed method is able to control type I error, minimize bias, and decrease variance when compared to using only trial data in nearly all scenarios examined. Therefore, our new approach can be used when conducting clinical trials by augmenting the standard-of-care arm with weighted patients from the EHR to increase power without inducing bias. Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA, USA Indigo Ag, Boston, MA, USA Penn Medicine, Philadelphia, PA, USA ∗Co-senior authors Corresponding author: Joanna Harton, Department of Biostatistics, Epidemiology, and Informatics, 423 Guardian Drive, Philadelphia, PA 19104, USA. Email: jograce@pennmedicine.upenn.edu 1 ar X iv :2 10 8. 08 75 6v 1 [ st at .M E ] 1 9 A ug 2 02 1

[1]  Michael Branson,et al.  A note on the power prior , 2009, Statistics in medicine.

[2]  S. Shariat,et al.  Prognostic Value of Lactate Dehydrogenase in Metastatic Prostate Cancer: A Systematic Review and Meta-analysis. , 2019, Clinical genitourinary cancer.

[3]  L. Egevad,et al.  Prognostic value of the Gleason score in prostate cancer , 2002, BJU international.

[4]  Keying Ye,et al.  Normalized Power Prior Bayesian Analysis , 2022, 2204.05615.

[5]  Heng Li,et al.  Propensity score-integrated composite likelihood approach for incorporating real-world evidence in single-arm clinical studies , 2019, Journal of biopharmaceutical statistics.

[6]  Margaret Gamalo-Siebers,et al.  Propensity‐score‐based priors for Bayesian augmented control design , 2018, Pharmaceutical statistics.

[7]  Arturo Molina,et al.  Abiraterone and increased survival in metastatic prostate cancer. , 2011, The New England journal of medicine.

[8]  Lorenzo Trippa,et al.  Design and Evaluation of an External Control Arm Using Prior Clinical Trials and Real-World Data , 2019, Clinical Cancer Research.

[9]  J. Rassen,et al.  Effects of expanding the look‐back period to all available data in the assessment of covariates , 2017, Pharmacoepidemiology and drug safety.

[10]  N. Dreyer,et al.  Real‐world evidence to support regulatory decision‐making for medicines: Considerations for external control arms , 2020, Pharmacoepidemiology and drug safety.

[11]  Frank Bretz,et al.  Beyond Randomized Clinical Trials: Use of External Controls , 2019, Clinical pharmacology and therapeutics.

[12]  S. Griffith,et al.  An evaluation of the impact of missing deaths on overall survival analyses of advanced non–small cell lung cancer patients conducted in an electronic health records database , 2019, Pharmacoepidemiology and drug safety.

[13]  J. Ibrahim,et al.  Power prior distributions for regression models , 2000 .

[14]  S. Ramsey,et al.  Using electronic health record data to identify comparator populations for comparative effectiveness research , 2020, Journal of medical economics.

[15]  K. Tsui,et al.  Prognosis of prostate cancer with initial prostate-specific antigen >1,000 ng/mL at diagnosis , 2017, OncoTargets and therapy.

[16]  S J Pocock,et al.  The combination of randomized and historical controls in clinical trials. , 1976, Journal of chronic diseases.

[17]  Joseph G. Ibrahim,et al.  Power prior distributions for generalized linear models , 2000 .

[18]  G. Scagliotti,et al.  Metastatic castration-resistant prostate cancer: time for innovation. , 2015, Future oncology.

[19]  Rajendra Prasad,et al.  Alkaline Phosphatase: An Overview , 2013, Indian Journal of Clinical Biochemistry.

[20]  H. Hansen,et al.  Performance status assessment in cancer patients. An inter-observer variability study. , 1993, British Journal of Cancer.

[21]  Jerome P. Reiter,et al.  A comparison of two methods of estimating propensity scores after multiple imputation , 2016, Statistical methods in medical research.