Electronic health records based phenotyping in next-generation clinical trials: a perspective from the NIH Health Care Systems Collaboratory.

Widespread sharing of data from electronic health records and patient-reported outcomes can strengthen the national capacity for conducting cost-effective clinical trials and allow research to be embedded within routine care delivery. While pragmatic clinical trials (PCTs) have been performed for decades, they now can draw on rich sources of clinical and operational data that are continuously fed back to inform research and practice. The Health Care Systems Collaboratory program, initiated by the NIH Common Fund in 2012, engages healthcare systems as partners in discussing and promoting activities, tools, and strategies for supporting active participation in PCTs. The NIH Collaboratory consists of seven demonstration projects, and seven problem-specific working group 'Cores', aimed at leveraging the data captured in heterogeneous 'real-world' environments for research, thereby improving the efficiency, relevance, and generalizability of trials. Here, we introduce the Collaboratory, focusing on its Phenotype, Data Standards, and Data Quality Core, and present early observations from researchers implementing PCTs within large healthcare systems. We also identify gaps in knowledge and present an informatics research agenda that includes identifying methods for the definition and appropriate application of phenotypes in diverse healthcare settings, and methods for validating both the definition and execution of electronic health records based phenotypes.

[1]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[2]  Edward G. Schilling,et al.  Juran's Quality Handbook , 1998 .

[3]  Thomas M. White,et al.  Model Formation: Extending the LOINC Conceptual Schema to Support Standardized Assessment Instruments , 2002, J. Am. Medical Informatics Assoc..

[4]  Nicolette de Keizer,et al.  Model Formulation: Defining and Improving Data Quality in Medical Registries: A Literature Review, Case Study, and Generic Framework , 2002, J. Am. Medical Informatics Assoc..

[5]  Philip J. B. Brown,et al.  Data quality probes - exploiting and improving the quality of electronic patient record data and patient care , 2002, Int. J. Medical Informatics.

[6]  Robert M Califf,et al.  Lessons learned from recent cardiovascular clinical trials: Part II. , 2002, Circulation.

[7]  Robert M Califf,et al.  Lessons learned from recent cardiovascular clinical trials: Part I. , 2002, Circulation.

[8]  D. Stryer,et al.  Practical clinical trials: increasing the value of clinical research for decision making in clinical and health policy. , 2003, JAMA.

[9]  Ram Chilukuri,et al.  Common Data Element (CDE) Management and Deployment in Clinical Trials , 2003, AMIA.

[10]  L. Etheredge,et al.  A rapid-learning health system. , 2007, Health affairs.

[11]  R. Kush,et al.  Electronic health records, medical research, and the Tower of Babel. , 2008, The New England journal of medicine.

[12]  C. Warren,et al.  Global youth tobacco surveillance, 2000-2007. , 2008, Morbidity and mortality weekly report. Surveillance summaries.

[13]  Holly Hedegaard,et al.  Strategies to improve external cause-of-injury coding in state-based hospital discharge and emergency department data systems: recommendations of the CDC Workgroup for Improvement of External Cause-of-Injury Coding. , 2008, MMWR. Recommendations and reports : Morbidity and mortality weekly report. Recommendations and reports.

[14]  J. Lellouch,et al.  Explanatory and pragmatic attitudes in therapeutical trials. , 1967, Journal of chronic diseases.

[15]  Daniel J. Vreeman,et al.  LOINC®: a universal catalogue of individual clinical observations and uniform representation of enumerated collections , 2010, Int. J. Funct. Informatics Pers. Medicine.

[16]  Huaqin Pan,et al.  The PhenX Toolkit: Get the Most From Your Measures , 2011, American journal of epidemiology.

[17]  Advancing research data infrastructure for patient-centered outcomes research. , 2011, JAMA.

[18]  V. Luketic,et al.  PROMIS computerised adaptive tests are dynamic instruments to measure health‐related quality of life in patients with cirrhosis , 2011, Alimentary pharmacology & therapeutics.

[19]  Nikolaos A. Patsopoulos,et al.  A pragmatic view on pragmatic trials , 2011, Dialogues in clinical neuroscience.

[20]  M. Fava,et al.  Using electronic medical records to enable large-scale studies in psychiatry: treatment resistant depression as a model , 2011, Psychological Medicine.

[21]  Cui Tao,et al.  Building a robust, scalable and standards-driven infrastructure for secondary use of EHR data: The SHARPn project , 2012, J. Biomed. Informatics.

[22]  Sarah M. Greene,et al.  Implementing the Learning Health System: From Concept to Action , 2012, Annals of Internal Medicine.

[23]  Marsha A Raebel,et al.  Design considerations, architecture, and use of the Mini‐Sentinel distributed data system , 2012, Pharmacoepidemiology and drug safety.

[24]  Katherine M. Newton,et al.  Learning Health Care Systems: Leading Through Research , 2012, Clinical Medicine & Research.

[25]  Lin Chen,et al.  Importance of multi-modal approaches to effectively identify cataract cases from electronic health records , 2012, J. Am. Medical Informatics Assoc..

[26]  J. Steiner,et al.  A pragmatic framework for single-site and multisite data quality assessment in electronic health record-based clinical research. , 2012, Medical care.

[27]  Steven N Goodman,et al.  The research-treatment distinction: a problematic approach for determining which activities should have ethical oversight. , 2013, The Hastings Center report.

[28]  Chunhua Weng,et al.  Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research , 2013, J. Am. Medical Informatics Assoc..

[29]  David McManus,et al.  Validation of acute myocardial infarction in the Food and Drug Administration's Mini‐Sentinel program , 2013, Pharmacoepidemiology and drug safety.

[30]  T. Murdoch,et al.  The inevitable application of big data to health care. , 2013, JAMA.

[31]  Steven N Goodman,et al.  An ethics framework for a learning health care system: a departure from traditional research ethics and clinical ethics. , 2013, The Hastings Center report.

[32]  Melissa A. Basford,et al.  Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network. , 2013, Journal of the American Medical Informatics Association : JAMIA.

[33]  George Hripcsak,et al.  Next-generation phenotyping of electronic health records , 2012, J. Am. Medical Informatics Assoc..