Role of electronic health records in comparative effectiveness research.

The gold standard in evaluating treatment effects are randomized controlled trials (RCTs). Their design minimizes bias and maximizes our ability to identify causality. By contrast, observational data, which is routinely collected for other purposes, has many well-known limitations, including selection bias. Why then are we seeing such enthusiasm for ‘big data’ in healthcare, driven in part by the relentless growth in adoption of electronic health records (EHRs)? The first part of the answer is that, despite their many strengths, RCTs also have limitations. First, they do not represent real-world populations or settings. Their stringent exclusion criteria mean that the evidence they produce will not directly replicate the circumstances of many at-risk patients with common comorbidities, who might also benefit from the interventions under trial [1]. Most RCTs are also geographically localized, both in terms of the demographics of the participants as well as in clinical setting. RCTs also often lack the size required to detect the small effect sizes and significant variance encountered in many comparative effectiveness studies, and tend to be too short to detect long-term effects of interventions [2,3]. These limitations are imposed by the need to create controlled conditions, as well as by funding and ethical constraints common to experimental studies. The second motivation for the renewed interest in observational data is the enormous amount of digital data now being collected by clinical institutions, industry and government, and our recent technical capacity to warehouse, link and analyze data in volumes unprecedented a decade ago. Not only are clinical data being accumulated rapidly, they are providing an increasingly detailed record of individual behaviors and journeys. Together, these new attributes of observational data may become a ‘game changer’. The reason we randomize is to deal with the effects of unmodeled variation, and current strategies to deal with such variation in observational studies remain controversial [4]. However, if we had access to health records that included deep phenotypic, genotypic and environmental data, then at some stage we should reach a crossover point where observational data and RCTs are of equivalent value. At that moment we should be able to pull together, through case-matching, a personalized ‘virtual cohort’ of individuals whose collective recorded clinical destiny is at least as predictive of treatment outcomes as any RCT for a given patient. There is ongoing discussion of the relative merits of observational studies and RCTs, and the complementary roles that different forms of evidence play in contributing to the evidence base [3,5–8]. Historically, RCTs were designed to overcome the problems encountered in observational analyses and have therefore been seen as superior to observational studies rather than complementary. However, the rapid growth in EHR data has generated an unprecedented source of information, making it essential that we reassess this artificial wedge separating RCTs and observational studies, and recognize the important complementary roles both must play. “...the recent availability of large electronic health records data sets is challenging us to reconsider the role that observational studies can play in evidence-based medicine.”

[1]  T C Chalmers,et al.  Randomized versus historical controls for clinical trials. , 1982, The American journal of medicine.

[2]  N. Black Why we need observational studies to evaluate the effectiveness of health care , 1996, BMJ.

[3]  T. Shakespeare,et al.  Observational Studies , 2003 .

[4]  U. Abel,et al.  The role of randomization in clinical studies: myths and beliefs. , 1999, Journal of clinical epidemiology.

[5]  J. Concato,et al.  Randomized, controlled trials, observational studies, and the hierarchy of research designs. , 2000, The New England journal of medicine.

[6]  A. Hartz,et al.  A comparison of observational studies and randomized, controlled trials , 2000, American journal of ophthalmology.

[7]  J. Ioannidis,et al.  Completeness of safety reporting in randomized trials: an evaluation of 7 medical areas. , 2001, JAMA.

[8]  R. Stables Observational research in the evidence based environment: eclipsed by the randomised controlled trial? , 2002, Heart.

[9]  M W Gillman,et al.  Interpretation of observational studies , 2004, Heart.

[10]  Jan P Vandenbroucke,et al.  When are observational studies as credible as randomised trials? , 2004, The Lancet.

[11]  B. Thiers,et al.  Eligibility Criteria of Randomized Controlled Trials Published in High-Impact General Medical Journals: A Systematic Sampling Review , 2008 .

[12]  Julie Evans,et al.  Model Formulation: The BRIDG Project: A Technical Report , 2008, J. Am. Medical Informatics Assoc..

[13]  Richard Platt,et al.  Design of a National Distributed Health Data Network , 2009, Annals of Internal Medicine.

[14]  Ben Shneiderman,et al.  Finding comparable temporal categorical records: A similarity measure with an interactive visualization , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[15]  S. Sacks,et al.  Incidence of neuropsychiatric adverse events in influenza patients treated with oseltamivir or no antiviral treatment , 2009, International journal of clinical practice.

[16]  Yuval Shahar,et al.  Intelligent visualization and exploration of time-oriented data of multiple patients , 2010, Artif. Intell. Medicine.

[17]  Sebastian Schneeweiss,et al.  Instrumental variable methods in comparative safety and effectiveness research , 2010, Pharmacoepidemiology and drug safety.

[18]  J. Overhage,et al.  Advancing the Science for Active Surveillance: Rationale and Design for the Observational Medical Outcomes Partnership , 2010, Annals of Internal Medicine.

[19]  J. Frankovich,et al.  Evidence-based medicine in the EMR era. , 2011, The New England journal of medicine.

[20]  Jimeng Sun,et al.  Visual cluster analysis in support of clinical decision intelligence. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[21]  R. Altman,et al.  Detecting Drug Interactions From Adverse‐Event Reports: Interaction Between Paroxetine and Pravastatin Increases Blood Glucose Levels , 2011, Clinical pharmacology and therapeutics.

[22]  Jihoon Kim,et al.  iDASH: integrating data for analysis, anonymization, and sharing , 2012, J. Am. Medical Informatics Assoc..

[23]  Marsha A Raebel,et al.  Design considerations, architecture, and use of the Mini‐Sentinel distributed data system , 2012, Pharmacoepidemiology and drug safety.

[24]  Enrico Coiera,et al.  Learning from Hackers: Open-Source Clinical Trials , 2012, Science Translational Medicine.

[25]  Xiaoqian Jiang,et al.  Privacy Technology to Support Data Sharing for Comparative Effectiveness Research: A Systematic Review , 2013, Medical care.

[26]  J. Ioannidis Mega-trials for blockbusters. , 2013, JAMA.

[27]  George Hripcsak,et al.  Caveats for the use of operational electronic health record data in comparative effectiveness research. , 2013, Medical care.

[28]  J. Goldberger,et al.  Personalized medicine vs guideline-based medicine. , 2013, JAMA.

[29]  Fiona Godlee,et al.  All trials must be registered and the results published , 2013, BMJ.