Supporting Reuse of EHR Data in Healthcare Organizations: The CARED Research Infrastructure Framework

Healthcare organizations have in recent years started assembling their Electronic Health Record (EHR) data in data repositories to unlock their value using data analysis techniques. There are however a number of technical, organizational and ethical challenges that should be considered when reusing EHR data, which infrastructure technology consisting of appropriate software and hardware components can address. In a case study in the University Medical Center Utrecht (UMCU) in the Netherlands, we identified nine requirements of a modern technical infrastructure for reusing EHR data: (1) integrate data sources, (2) preprocess data, (3) store data, (4) support collaboration and documentation, (5) support various software and tooling packages, (6) enhance repeatability, (7) enhance privacy and security, (8) automate data process and (9) support analysis applications. We propose the CApable Reuse of EHR Data (CARED) framework for infrastructure that addresses these requirements, which consists of five consecutive data processing layers, and a control layer that governs the data processing. We then evaluate the framework with respect to the requirements, and finally describe its successful implementation in the Psychiatry Department of the UMCU along with three analysis cases. Our CARED research infrastructure framework can support healthcare organizations that aim to successfully reuse their EHR data.

[1]  Marco R. Spruit,et al.  Transitioning to a Data Driven Mental Health Practice: Collaborative Expert Sessions for Knowledge and Hypothesis Finding , 2016, Comput. Math. Methods Medicine.

[2]  George Hripcsak,et al.  Caveats for the use of operational electronic health record data in comparative effectiveness research. , 2013, Medical care.

[3]  Marzyeh Ghassemi,et al.  Metadata Correction: Making Big Data Useful for Health Care: A Summary of the Inaugural MIT Critical Data Conference , 2015, JMIR medical informatics.

[4]  S. Brunak,et al.  Mining electronic health records: towards better research applications and clinical care , 2012, Nature Reviews Genetics.

[5]  Yichuan Wang,et al.  Exploring the path to big data analytics success in healthcare , 2017 .

[6]  Cui Tao,et al.  Building a robust, scalable and standards-driven infrastructure for secondary use of EHR data: The SHARPn project , 2012, J. Biomed. Informatics.

[7]  W. H. Inmon,et al.  Building the data warehouse , 1992 .

[8]  B. Dean,et al.  Review: Use of Electronic Medical Records for Health Outcomes Research , 2009, Medical care research and review : MCRR.

[9]  Ian M. Mitchell,et al.  Best Practices for Scientific Computing , 2012, PLoS biology.

[10]  Shawn N Murphy,et al.  Current State of Information Technologies for the Clinical Research Enterprise across Academic Medical Centers , 2012, Clinical and translational science.

[11]  Harsh Kupwade Patil,et al.  Big Data Security and Privacy Issues in Healthcare , 2014, 2014 IEEE International Congress on Big Data.

[12]  Christoph U. Lehmann,et al.  Clinical Data Reuse or Secondary Use: Current Status and Potential Future Progress , 2017, Yearbook of Medical Informatics.

[13]  Christopher Klekar,et al.  Developing Electronic Data Methods Infrastructure to Participate in Collaborative Research Networks , 2014, EGEMS.

[14]  Brian Hazlehurst,et al.  CER Hub: An informatics platform for conducting comparative effectiveness research using multi-institutional, heterogeneous, electronic clinical data , 2015, Int. J. Medical Informatics.

[15]  Helvi Kyngäs,et al.  The qualitative content analysis process. , 2008, Journal of advanced nursing.

[16]  Yichuan Wang,et al.  An integrated big data analytics-enabled transformation model: Application to health care , 2018, Inf. Manag..

[17]  Zhengwu Lu,et al.  Clinical data management: Current status, challenges, and future directions from industry perspectives , 2010 .

[18]  Ziad Obermeyer,et al.  Lost in Thought - The Limits of the Human Mind and the Future of Medicine. , 2017, The New England journal of medicine.

[19]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[20]  Paul G. Biondich,et al.  Enabling Better Interoperability for HealthCare: Lessons in Developing a Standards Based Application Programing Interface for Electronic Medical Record Systems , 2015, Journal of Medical Systems.

[21]  Wendy A. Wolf,et al.  The eMERGE Network: A consortium of biorepositories linked to electronic medical records data for conducting genomic studies , 2011, BMC Medical Genomics.

[22]  Griffin M. Weber,et al.  Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2) , 2010, J. Am. Medical Informatics Assoc..

[23]  C Safran,et al.  Reuse Of Clinical Data , 2014, Yearbook of Medical Informatics.

[24]  Richard Platt,et al.  Launching PCORnet, a national patient-centered clinical research network , 2014, Journal of the American Medical Informatics Association : JAMIA.

[25]  Peter Groves,et al.  The 'big data' revolution in healthcare: Accelerating value and innovation , 2016 .

[26]  Murtaza Haider,et al.  Beyond the hype: Big data concepts, methods, and analytics , 2015, Int. J. Inf. Manag..

[27]  S. Lokhandwala,et al.  Objectives of the Secondary Analysis of Electronic Health Record Data , 2016 .

[28]  M. Singh,et al.  A Survey: Big Data Analytics in Health , 2017 .

[29]  Peter Tarczy-Hornoch,et al.  Characterizing Secondary Use of Clinical Data , 2015, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[30]  V. Santhosh Kumar,et al.  Data Warehouse Design Considerations for a Healthcare Business Intelligence System , 2015 .

[31]  Marco Spruit,et al.  DEDUCE: A pattern matching method for automatic de-identification of Dutch medical text , 2017, Telematics Informatics.

[32]  Peter J. Haug,et al.  Data Preparation Framework for Preprocessing Clinical Data in Data Mining , 2006, AMIA.

[33]  George Hripcsak,et al.  Recommendations for the Use of Operational Electronic Health Record Data in Comparative Effectiveness Research , 2013, EGEMS.

[34]  R. Peng Reproducible Research in Computational Science , 2011, Science.

[35]  D Kalra,et al.  Electronic health records: new opportunities for clinical research , 2013, Journal of internal medicine.

[36]  Karen J. Wernli,et al.  How the Provenance of Electronic Health Record Data Matters for Research: A Case Example Using System Mapping , 2014, EGEMS.

[37]  Joachim Roski,et al.  Creating value in health care through big data: opportunities and policy implications. , 2014, Health affairs.

[38]  Ahmed E. Youssef AFRAMEWORK FOR SECURE H EALTHCARE SYSTEMS BASED ON BIGDATA ANALYTICS IN M OBILE CLOUD COMPUTING ENVIRONMENTS , 2014 .

[39]  Christoph U. Lehmann,et al.  Clinical Data Reuse or Secondary Use: Current Status and Potential Future Progress. , 2017, Yearbook of medical informatics.

[40]  Christel Daniel-Le Bozec,et al.  Using electronic health records for clinical research: The case of the EHR4CR project , 2015, J. Biomed. Informatics.

[41]  Ahmed Hadj Kacem,et al.  Interoperability of healthcare information systems , 2014, The 2014 International Symposium on Networks, Computers and Communications.

[42]  Marco Spruit,et al.  Comparing Deep Learning and Classical Machine Learning Approaches for Predicting Inpatient Violence Incidents from Clinical Text , 2018, Applied Sciences.

[43]  P. Gill,et al.  Methods of data collection in qualitative research: interviews and focus groups , 2008, BDJ.

[44]  E. Larson,et al.  Using Electronically Available Inpatient Hospital Data for Research , 2011, Clinical and translational science.

[45]  Douglas MacFadden,et al.  Application of Information Technology The Shared Health Research Information Network ( SHRINE ) : A Prototype Federated Query Tool for Clinical Data Repositories , 2014 .

[46]  Binny Krishnankutty,et al.  Data management in clinical research: An overview , 2012, Indian journal of pharmacology.

[47]  Paul T. Groth,et al.  Ten Simple Rules for the Care and Feeding of Scientific Data , 2014, PLoS Comput. Biol..

[48]  Paul A. Harris,et al.  Secondary use of clinical data: The Vanderbilt approach , 2014, J. Biomed. Informatics.

[49]  N Peek,et al.  Technical Challenges for Big Data in Biomedicine and Health: Data Sources, Infrastructure, and Analytics , 2014, Yearbook of Medical Informatics.

[50]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[51]  T. Murdoch,et al.  The inevitable application of big data to health care. , 2013, JAMA.

[52]  Yolanda Gil,et al.  Privacy enforcement in data analysis workflows , 2007 .

[53]  Avita Katal,et al.  Big data: Issues, challenges, tools and Good practices , 2013, 2013 Sixth International Conference on Contemporary Computing (IC3).

[54]  T Ganslandt,et al.  Integrated Data Repository Toolkit (IDRT). A Suite of Programs to Facilitate Health Analytics on Heterogeneous Medical Data. , 2016, Methods of information in medicine.

[55]  P. Soranno,et al.  Creating and maintaining high-performing collaborative research teams: the importance of diversity and interpersonal skills , 2014 .

[56]  William W. Stead,et al.  Toward a science of learning systems: a research agenda for the high-functioning Learning Health System , 2014, J. Am. Medical Informatics Assoc..

[57]  Leo Anthony Celi,et al.  Challenges and Opportunities in Secondary Analyses of Electronic Health Record Data , 2016 .