A Calibration Approach to Transportability with Observational Data

An important consideration in clinical research studies is proper evaluation of internal and external validity. While randomized clinical trials can overcome several threats to internal validity, they may be prone to poor external validity. Conversely, large prospective observational studies sampled from a broadly generalizable population may be externally valid, yet susceptible to threats to internal validity, particularly confounding. Thus, methods that address confounding and enhance transportability of study results across populations are essential for internally and externally valid causal inference, respectively. We develop a weighting method which estimates the effect of an intervention on an outcome in an observational study which can then be transported to a second, possibly unrelated target population. The proposed methodology employs calibration estimators to generate complementary balancing and sampling weights to address confounding and transportability, respectively, enabling valid estimation of the target population average treatment effect. A simulation study is conducted to demonstrate the advantages and similarities of the calibration approach against alternative techniques. We also test the performance of the calibration estimator-based inference in a motivating real data example comparing whether the effect of biguanides versus sulfonylureas - the two most common oral diabetes medication classes for initial treatment - on all-cause mortality described in a historical cohort applies to a contemporary cohort of US Veterans with diabetes.

[1]  T. Brennan,et al.  Patterns of medication initiation in newly diagnosed diabetes mellitus: quality and cost implications. , 2012, The American journal of medicine.

[2]  Vilmundur Gudnason,et al.  Diabetes Mellitus, Fasting Glucose, and Risk of Cause-Specific Death , 2011 .

[3]  Elias Bareinboim,et al.  Causal inference and the data-fusion problem , 2016, Proceedings of the National Academy of Sciences.

[4]  Lawrence A Leiter,et al.  Semaglutide and Cardiovascular Outcomes in Patients with Type 2 Diabetes. , 2016, The New England journal of medicine.

[5]  S. Lai,et al.  Effects of Metformin Versus Glipizide on Cardiovascular Outcomes in Patients With Type 2 Diabetes and Coronary Artery Disease , 2013, Diabetes Care.

[6]  Robert A Greevy,et al.  Comparative Effectiveness of Sulfonylurea and Metformin Monotherapy on Cardiovascular Events in Type 2 Diabetes Mellitus , 2012, Annals of Internal Medicine.

[7]  Xiaofei Wang,et al.  Integrative analysis of randomized clinical trials with real world evidence studies , 2020 .

[8]  D. Wysowski,et al.  Use of Antidiabetic Drugs in the U.S., 2003–2012 , 2014, Diabetes Care.

[9]  K. C. G. Chan,et al.  Globally efficient non‐parametric inference of average treatment effects by empirical balancing calibration weighting , 2016, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[10]  Michael G Hudgens,et al.  Generalizing Study Results: A Potential Outcomes Perspective. , 2017, Epidemiology.

[11]  Andrzej Stachurski,et al.  Parallel Optimization: Theory, Algorithms and Applications , 2000, Parallel Distributed Comput. Pract..

[12]  L. Geiss,et al.  Trends in cause-specific mortality among adults with and without diagnosed diabetes in the USA: an epidemiological analysis of linked national survey and vital statistics data , 2018, The Lancet.

[13]  Sherri Rose,et al.  Targeted Maximum Likelihood Estimation for Causal Inference in Observational Studies , 2017, American journal of epidemiology.

[14]  Elizabeth Selvin,et al.  Trends in Prevalence and Control of Diabetes in the United States, 1988–1994 and 1999–2010Trends in Prevalence and Control of Diabetes in the United States , 2014 .

[15]  Mark J van der Laan,et al.  Robust estimation of encouragement design intervention effects transported across sites , 2017, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[16]  Daniel O. Scharfstein,et al.  Causal Inference for Comprehensive Cohort Studies. , 2019, 1910.03531.

[17]  Elias Bareinboim,et al.  External Validity: From Do-Calculus to Transportability Across Populations , 2014, Probabilistic and Causal Inference.

[18]  M. Fischereder,et al.  Empagliflozin, Cardiovascular Outcomes, and Mortality in Type 2 Diabetes. , 2016, The New England journal of medicine.

[19]  L. Geiss,et al.  Prevalence and incidence trends for diagnosed diabetes among adults aged 20 to 79 years, United States, 1980-2012. , 2014, JAMA.

[20]  T. Brennan,et al.  Initial choice of oral glucose-lowering medication for diabetes mellitus: a patient-centered comparative effectiveness study. , 2014, JAMA internal medicine.

[21]  S. Cole,et al.  Generalizing evidence from randomized clinical trials to target populations: The ACTG 320 trial. , 2010, American journal of epidemiology.

[22]  S. Geman,et al.  Nonparametric Maximum Likelihood Estimation by the Method of Sieves , 1982 .

[23]  A. Vaag,et al.  Mortality and cardiovascular risk associated with different insulin secretagogues compared with metformin in type 2 diabetes, with or without a previous myocardial infarction: a nationwide study. , 2011, European heart journal.

[24]  D. Rubin For objective causal inference, design trumps analysis , 2008, 0811.1640.

[25]  S. Majumdar,et al.  Decreased mortality associated with the use of metformin compared with sulfonylurea monotherapy in type 2 diabetes. , 2002, Diabetes care.

[26]  R. Guthrie Canagliflozin and cardiovascular and renal events in type 2 diabetes , 2018, Postgraduate medicine.

[27]  F. Porzsolt,et al.  Using machine learning to predict laboratory test results , 2016, Annals of clinical biochemistry.

[28]  John B Buse,et al.  Liraglutide and Cardiovascular Outcomes in Type 2 Diabetes. , 2016, The New England journal of medicine.

[29]  Stephen R Cole,et al.  Transportability of Trial Results Using Inverse Odds of Sampling Weights. , 2017, American journal of epidemiology.

[30]  Neha J. Pagidipati,et al.  Effects of Once‐Weekly Exenatide on Cardiovascular Outcomes in Type 2 Diabetes , 2017, The New England journal of medicine.

[31]  Joseph Kang,et al.  Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data , 2007, 0804.2958.

[32]  S. Saydah,et al.  Trends and Disparities in Cardiovascular Mortality Among U.S. Adults With and Without Self-Reported Diabetes, 1988–2015 , 2018, Diabetes Care.

[33]  Desmond E. Williams,et al.  Changes in diabetes-related complications in the United States, 1990-2010. , 2014, The New England journal of medicine.

[34]  L. Stefanski,et al.  The Calculus of M-Estimation , 2002 .

[35]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[36]  Kevin P. Josey,et al.  A framework for covariate balance using Bregman distances , 2019, Scandinavian Journal of Statistics.

[37]  J. Robins,et al.  Estimation of Regression Coefficients When Some Regressors are not Always Observed , 1994 .

[38]  Maria Mori Brooks,et al.  A randomized trial of therapies for type 2 diabetes and coronary artery disease. , 2009, The New England journal of medicine.

[39]  Jianqing Fan,et al.  Improving Covariate Balancing Propensity Score : A Doubly Robust and Efficient Approach ∗ , 2016 .

[40]  Desmond E. Williams,et al.  Changes in diabetes-related complications in the United States. , 2014, The New England journal of medicine.

[41]  E. Boyko,et al.  Mortality among veterans with type 2 diabetes initiating metformin, sulfonylurea or rosiglitazone monotherapy , 2013, Diabetologia.

[42]  J. Lunceford,et al.  Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study , 2004, Statistics in medicine.

[43]  D. Horvitz,et al.  A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .

[44]  J. Gross,et al.  The Association between Sulfonylurea Use and All-Cause and Cardiovascular Mortality: A Meta-Analysis with Trial Sequential Analysis of Randomized Clinical Trials , 2016, PLoS medicine.

[45]  J. Saaddine,et al.  Achievement of goals in U.S. diabetes care, 1999-2010. , 2013, The New England journal of medicine.

[47]  M. J. van der Laan,et al.  The International Journal of Biostatistics Targeted Maximum Likelihood Learning , 2011 .

[48]  C. Särndal,et al.  Calibration Estimators in Survey Sampling , 1992 .

[49]  P. Wilson,et al.  Diabetes Mellitus–Related All‐Cause and Cardiovascular Mortality in a National Cohort of Adults , 2019, Journal of the American Heart Association.

[50]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[51]  D. Wysowski,et al.  Use of Antidiabetic Drugs in the , 2014 .

[52]  S. Suissa,et al.  Sulfonylureas and the Risks of Cardiovascular Events and Death: A Methodological Meta-Regression Analysis of the Observational Studies , 2017, Diabetes Care.

[53]  A. Tsiatis Semiparametric Theory and Missing Data , 2006 .

[54]  Eric Q. Wu,et al.  Comparative Effectiveness Without Head-to-Head Trials , 2012, PharmacoEconomics.