Use of instrumental variables in electronic health record-driven models

Precision medicine presents various methodological challenges whose assessment requires the consideration of multiple factors. In particular, the data multitude in the Electronic Health Records poses interoperability issues and requires novel inference strategies. A problem, though apparently a paradox, is that highly specific treatments and a variety of outcomes may hardly match with consistent observations (i.e., large samples). Why is it the case? Owing to the heterogeneity of Electronic Health Records, models for the evaluation of treatment effects need to be selected, and in some cases, the use of instrumental variables might be necessary. We studied the recently defined person-centered treatment effects in cancer and C-section contexts from Electronic Health Record sources and identified as an instrument the distance of patients from hospitals. We present first the rationale for using such instrument and then its model implementation. While for cancer patients consideration of distance turns out to be a penalty, implying a negative effect on the probability of receiving surgery, a positive effect is instead found in C-section due to higher propensity of scheduling delivery. Overall, the estimated person-centered treatment effects reveal a high degree of heterogeneity, whose interpretation remains context-dependent. With regard to the use of instruments in light of our two case studies, our suggestion is that this process requires ad hoc variable selection for both covariates and instruments and additional testing to ensure validity.

[1]  Estimating Person-Centered Treatment (Pet) Effects Using Instrumental Variables , 2012, Journal of applied econometrics.

[2]  Andrew M. Jones,et al.  The Roles of Cognitive and Non-Cognitive Skills in Moderating the Effects of Mixed-Ability Schools on Long-Term Health , 2014 .

[3]  Edward Vytlacil,et al.  Local Instrumental Variables , 2000 .

[4]  A. Basu Welfare Implications of Learning Through Solicitation Versus Diversification in Health Care , 2014, Journal of health economics.

[5]  G. Imbens The Role of the Propensity Score in Estimating Dose-Response Functions , 1999 .

[6]  Pietro Lió,et al.  Advances in translational biomedicine from systems approaches , 2014, Front. Genet..

[7]  J J Heckman,et al.  Local instrumental variables and latent variable models for identifying and bounding treatment effects. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[8]  M Alan Brookhart,et al.  Instrumental variables I: instrumental variables exploit natural variation in nonexperimental data to estimate causal relationships. , 2009, Journal of clinical epidemiology.

[9]  J. Tobias,et al.  Predicting ambient ultraviolet from routine meteorological data; its potential use as an instrumental variable for vitamin D status in pregnancy in a longitudinal birth cohort in the UK. , 2009, International journal of epidemiology.

[10]  Sandra E. Black,et al.  From the Cradle to the Labor Market? The Effect of Birth Weight on Adult Outcomes , 2005, SSRN Electronic Journal.

[11]  F. Windmeijer,et al.  A weak instrument F-test in linear IV models with multiple endogenous variables☆ , 2016, Journal of econometrics.

[12]  S. Reed,et al.  Method of delivery and neonatal outcome in very low-birthweight vertex-presenting fetuses. , 2008, American journal of obstetrics and gynecology.

[13]  Joshua C. Denny,et al.  Chapter 13: Mining Electronic Health Records in the Genomics Era , 2012, PLoS Comput. Biol..

[14]  M. Lechner Program Heterogeneity and Propensity Score Matching: An Application to the Evaluation of Active Labor Market Policies , 2002, Review of Economics and Statistics.

[15]  J. Sargan THE ESTIMATION OF ECONOMIC RELATIONSHIPS USING INSTRUMENTAL VARIABLES , 1958 .

[16]  B. Hamilton,et al.  Recent trends in cesarean delivery in the United States. , 2010, NCHS data brief.

[17]  J. Gore,et al.  Are Elderly Patients With Clinically Localized Prostate Cancer Overtreated? Exploring Heterogeneity in Survival Effects , 2015, Medical care.

[18]  Chunhua Weng,et al.  Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research , 2013, J. Am. Medical Informatics Assoc..

[19]  L. Hood,et al.  Systems medicine: the future of medical genomics and healthcare , 2009, Genome Medicine.

[20]  C. Mackenzie,et al.  A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. , 1987, Journal of chronic diseases.

[21]  Diane P. Martin,et al.  Association between method of delivery and maternal rehospitalization. , 2000, JAMA.

[22]  J. Denny,et al.  Extracting research-quality phenotypes from electronic health records to support precision medicine , 2015, Genome Medicine.

[23]  J. Pathak,et al.  Electronic health records-driven phenotyping: challenges, recent advances, and perspectives. , 2013, Journal of the American Medical Informatics Association : JAMIA.

[24]  J. Appleby,et al.  Application of patient-reported outcome measures (PROMs) data to estimate cost-effectiveness of hernia surgery in England , 2013, Journal of the Royal Society of Medicine.

[25]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[26]  W. Chung,et al.  Defining a comprehensive verotype using electronic health records for personalized medicine. , 2013, Journal of the American Medical Informatics Association : JAMIA.

[27]  J. Boardman,et al.  Low birth weight, social factors, and developmental outcomes among children in the United States , 2002, Demography.

[28]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[29]  J. Hausman Specification tests in econometrics , 1978 .

[30]  Anirban Basu,et al.  Person-centered Treatment (PeT) Effects: Individualized Treatment Effects Using Instrumental Variables , 2015 .

[31]  C. Auffray,et al.  Systems medicine approaches for the definition of complex phenotypes in chronic diseases and ageing. From concept to implementation and policies. , 2014, Current pharmaceutical design.

[32]  Frank Windmeijer,et al.  A weak instrument F-test in linear IV models with multiple endogenous variables☆ , 2013, Journal of Econometrics.

[33]  George Hripcsak,et al.  Next-generation phenotyping of electronic health records , 2012, J. Am. Medical Informatics Assoc..