Developing HL7 CDA-Based Data Warehouse for the Use of Electronic Health Record Data for Secondary Purposes

Background The growing availability of clinical and administrative data collected in electronic health records (EHRs) have led researchers and policy makers to implement data warehouses to improve the reuse of EHR data for secondary purposes. This approach can take advantages from a unique source of information that collects data from providers across multiple organizations. Moreover, the development of a data warehouse benefits from the standards adopted to exchange data provided by heterogeneous systems. Objective This article aims to design and implement a conceptual framework that semiautomatically extracts information collected in Health Level 7 Clinical Document Architecture (CDA) documents stored in an EHR and transforms them to be loaded in a target data warehouse. Results The solution adopted in this article supports the integration of the EHR as an operational data store in a data warehouse infrastructure. Moreover, data structure of EHR clinical documents and the data warehouse modeling schemas are analyzed to define a semiautomatic framework that maps the primitives of the CDA with the concepts of the dimensional model. The case study successfully tests this approach. Conclusion The proposed solution guarantees data quality using structured documents already integrated in a large-scale infrastructure, with a timely updated information flow. It ensures data integrity and consistency and has the advantage to be based on a sample size that covers a broad target population. Moreover, the use of CDAs simplifies the definition of extract, transform, and load tools through the adoption of a conceptual framework that load the information stored in the CDA in the data warehouse.

[1]  O. Lupse,et al.  Using HL7 CDA and CCD standards to improve communication between healthcare information systems , 2011, 2011 IEEE 9th International Symposium on Intelligent Systems and Informatics.

[2]  Sabine Loudcher,et al.  X-WACoDa: An XML-based approach for Warehousing and Analyzing Complex Data , 2017, ArXiv.

[3]  Hyoil Han,et al.  XML-OLAP: A Multidimensional Analysis Framework for XML Warehouses , 2005, DaWaK.

[4]  Marleen de Mul,et al.  Development of a clinical data warehouse from an intensive care clinical information system , 2012, Comput. Methods Programs Biomed..

[5]  Paul A. Harris,et al.  Secondary use of clinical data: The Vanderbilt approach , 2014, J. Biomed. Informatics.

[6]  Cui Tao,et al.  Building a robust, scalable and standards-driven infrastructure for secondary use of EHR data: The SHARPn project , 2012, J. Biomed. Informatics.

[7]  Mary Roth,et al.  Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources , 1997, VLDB.

[8]  Tobias Mettler,et al.  Supplier Relationship Management: A Case Study in the Context of Health Care , 2009, J. Theor. Appl. Electron. Commer. Res..

[9]  Clement J. McDonald,et al.  Standardizing clinical laboratory data for secondary use , 2012, J. Biomed. Informatics.

[10]  Philippe Lambin,et al.  Benefits of a clinical data warehouse with data mining tools to collect data for a radiotherapy trial. , 2013, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[11]  Barbara Wixom,et al.  The benefits of data warehousing: why some organizations realize exceptional payoffs , 2002, Inf. Manag..

[12]  E. Balas,et al.  Improving clinical practice using clinical decision support systems: a systematic review of trials to identify features critical to success , 2005, BMJ : British Medical Journal.

[13]  Jonathan S. Einbinder,et al.  Evaluation of a data warehouse in an academic health sciences center , 2000, Int. J. Medical Informatics.

[14]  Xiaoyan Wang,et al.  Active computerized pharmacovigilance using natural language processing, statistics, and electronic health records: a feasibility study. , 2009, Journal of the American Medical Informatics Association : JAMIA.

[15]  John A. Zachman,et al.  Data Stores, Data Warehousing, and the Zachman Framework: Managing Enterprise Knowledge , 1997 .

[16]  Fabrizio L. Ricci,et al.  Secondary uses of EHR systems: A feasibility study , 2013, 2013 E-Health and Bioengineering Conference (EHB).

[17]  Fabrizio L. Ricci,et al.  EHR-centric integration of health information systems , 2013, 2013 E-Health and Bioengineering Conference (EHB).

[18]  Daniel L Rubin,et al.  A data warehouse for integrating radiologic and pathologic data. , 2008, Journal of the American College of Radiology : JACR.

[19]  Charles N. Mead,et al.  The HL7 Reference Information Model Under Scrutiny , 2006, MIE.

[20]  Omar Boussaïd,et al.  X-Warehousing: An XML-Based Approach for Warehousing Complex Data , 2006, ADBIS.

[21]  Baoyan Liu,et al.  Development of traditional Chinese medicine clinical data warehouse for medical knowledge discovery and decision support , 2010, Artif. Intell. Medicine.

[22]  Thomas J. Eggebraaten,et al.  A health-care data model based on the HL7 Reference Information Model , 2007, IBM Syst. J..

[23]  Rinaldo Bellomo,et al.  Development and implementation of a high-quality clinical database: the Australian and New Zealand Intensive Care Society Adult Patient Database. , 2006, Journal of critical care.

[24]  Farzad Mostashari,et al.  Using electronic health record alerts to provide public health situational awareness to clinicians , 2010, J. Am. Medical Informatics Assoc..

[25]  Tony R. Sahama,et al.  A Data Warehouse Architecture for Clinical Data Warehousing , 2007, ACSW.

[26]  Jun Gao,et al.  DW4TR: A Data Warehouse for Translational Research , 2011, J. Biomed. Informatics.

[27]  N. Adler,et al.  Using Electronic Health Records for Population Health Research: A Review of Methods and Applications. , 2016, Annual review of public health.

[28]  Charles Safran,et al.  Toward a national framework for the secondary use of health data: an American Medical Informatics Association White Paper. , 2007, Journal of the American Medical Informatics Association : JAMIA.

[29]  J. Selby,et al.  Network News: Powering Clinical Research , 2013, Science Translational Medicine.

[30]  H. Prokosch,et al.  Perspectives for Medical Informatics , 2009, Methods of Information in Medicine.

[31]  Wolfgang Hümmer,et al.  XCube: XML for data warehouses , 2003, DOLAP '03.

[32]  Søren Brunak,et al.  Using Electronic Patient Records to Discover Disease Correlations and Stratify Patient Cohorts , 2011, PLoS Comput. Biol..