CLINICAL DATA WAREHOUSE: A REVIEW

Clinical decisions are crucial because they are related to human lives. Thus, managers and decision makers in the clinical environment seek new solutions that can support their decisions. A clinical data warehouse (CDW) is an important solution that is used to achieve clinical stakeholders’ goals by merging heterogeneous data sources in a central repository and using this repository to find answers related to the strategic clinical domain, thereby supporting clinical decisions. CDW implementation faces numerous obstacles, starting with the data sources and ending with the tools that view the clinical information. This paper presents a systematic overview of purpose of CDWs as well as the characteristics; requirements; data sources; extract, transform and load (ETL) process; security and privacy concerns; design approach; architecture; and challenges and difficulties related to implementing a successful CDW. PubMed and Google Scholar are used to find papers related to CDW. Among the total of 784 papers, only 42 are included in the literature review. These papers are classified based on five perspectives, namely methodology, data, system, ETL tool and purpose, to find insights related to aspects of CDW. This review can contribute answers to questions related to CDW and provide recommendations for implementing a successful CDW. Index Terms Clinical Data Warehouse, Data Warehouse, ETL, Clinical Operational Systems, Electronic Medical Records.

[1]  Paul C Tang,et al.  Research Paper: Comparison of Methodologies for Calculating Quality Measures Based on Administrative Data versus Clinical Data from an Electronic Health Record System: Implications for Performance Measures , 2007, J. Am. Medical Informatics Assoc..

[2]  Mahtab Karami,et al.  Clinical Data Warehouse: An Effective Tool to Create Intelligence in Disease Management , 2017, The health care manager.

[3]  Steve Evans,et al.  The DEDUCE Guided Query tool: Providing simplified access to clinical data for research and quality improvement , 2011, J. Biomed. Informatics.

[4]  Tony R. Sahama,et al.  Clinical Data Warehousing for Evidence Based Decision Making , 2015, MIE.

[5]  Choung-Soo Kim,et al.  Development of prostate cancer research database with the clinical data warehouse technology for direct linkage with electronic medical record system , 2013, Prostate international.

[6]  Heather N. Watson,et al.  Use of electronic medical records (EMR) for oncology outcomes research: assessing the comparability of EMR information to patient registry and health claims data , 2011, Clinical epidemiology.

[7]  R. Scott Evans,et al.  Clinical Use of an Enterprise Data Warehouse , 2012, AMIA.

[8]  P. Chountas,et al.  Development of a clinical data warehouse , 2004, 2004 IDEAS Workshop on Medical Information Systems: The Digital Hospital (IDEAS-DH'04).

[9]  Christopher G. Chute,et al.  The Enterprise Data Trust at Mayo Clinic: a semantically integrated warehouse of biomedical data , 2010, J. Am. Medical Informatics Assoc..

[10]  Irene Katzan,et al.  The Knowledge Program: an innovative, comprehensive electronic data capture system and warehouse. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[11]  Marleen de Mul,et al.  Development of a clinical data warehouse from an intensive care clinical information system , 2012, Comput. Methods Programs Biomed..

[12]  David K. Vawdrey,et al.  Measuring Mortality Information in Clinical Data Warehouses , 2015, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[13]  Johan Gustav Bellika,et al.  Archetype-based data warehouse environment to enable the reuse of electronic health record data , 2015, Int. J. Medical Informatics.

[14]  Rainer Röhrig,et al.  Secondary use of clinical data in healthcare providers - an overview on research, regulatory and ethical requirements. , 2012, Studies in health technology and informatics.

[15]  Walter Gall,et al.  A Clinical Data Warehouse Based on OMOP and i2b2 for Austrian Health Claims Data , 2018, eHealth.

[16]  Johannes Dirnberger,et al.  An ontology‐based clinical data warehouse for scientific research , 2015 .

[17]  Alaa Khalaf Hamoud,et al.  Building Data Warehouse for Diseases Registry: First step for Clinical Data Warehouse. , 2013 .

[18]  Gilad J. Kuperman,et al.  Designing a Clinical Data Warehouse Architecture to Support Quality Improvement Initiatives , 2016, AMIA.

[19]  Philippe Lambin,et al.  Benefits of a clinical data warehouse with data mining tools to collect data for a radiotherapy trial. , 2013, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[20]  Patrice Degoulet,et al.  Methodology of integration of a clinical data warehouse with a clinical information system: the HEGP case , 2010, MedInfo.

[21]  Matthias Ganzinger,et al.  A Framework for Integrating Heterogeneous Clinical Data for a Disease Area into a Central Data Warehouse , 2014, MIE.

[22]  Jean-François Ethier,et al.  Data Warehouse Design Methods Review: Trends, Challenges and Future Directions for the Healthcare Domain , 2015, ADBIS.

[23]  Osama El-Sayed Sheta,et al.  Building a health care data warehouse for cancer diseases , 2012, ArXiv.

[24]  André Happe,et al.  Roogle: An Information Retrieval Engine for Clinical Data Warehouse , 2011, MIE.

[25]  Shahidul Islam Khan,et al.  Towards development of health Data Warehouse: Bangladesh perspective , 2015, 2015 International Conference on Electrical Engineering and Information Communication Technology (ICEEICT).

[26]  Tony R. Sahama,et al.  A Data Warehouse Architecture for Clinical Data Warehousing , 2007, ACSW.

[27]  G. Hartvigsen,et al.  Secondary Use of EHR: Data Quality Issues and Informatics Opportunities , 2010, Summit on translational bioinformatics.

[28]  Anita Burgun-Parenthoine,et al.  Finding patients using similarity measures in a rare diseases-oriented clinical data warehouse: Dr. Warehouse and the needle in the needle stack , 2017, J. Biomed. Informatics.

[29]  George Hripcsak,et al.  Mining a clinical data warehouse to discover disease-finding associations using co-occurrence statistics , 2005, AMIA.

[30]  Baoyan Liu,et al.  Building Clinical Data Warehouse for Traditional Chinese Medicine Knowledge Discovery , 2008, 2008 International Conference on BioMedical Engineering and Informatics.

[31]  Kelly LeVan-Shultz,et al.  Data warehousing in an integrated health system: building the business case , 1998, DOLAP '98.

[32]  Alaa Khalaf Hamoud,et al.  Using OLAP with Diseases Registry Warehouse for Clinical Decision Support , 2014 .

[33]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[34]  David Levine,et al.  The Analytic Information Warehouse (AIW): A platform for analytics using electronic health record data , 2013, J. Biomed. Informatics.

[35]  Shahidul Islam Khan,et al.  Privacy and security problems of national health data warehouse: a convenient solution for developing countries , 2016, 2016 International Conference on Networking Systems and Security (NSysS).

[36]  Ralph Kimball,et al.  The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data , 2004 .

[37]  Matthew D. Krasowski,et al.  Use of a data warehouse at an academic medical center for clinical pathology quality improvement, education, and research , 2015, Journal of pathology informatics.

[38]  Kent A. Spackman,et al.  The Use of SNOMED© CT Simplifies Querying of a Clinical Data Warehouse , 2003, AMIA.

[39]  Reesa Laws,et al.  The Community Health Applied Research Network (CHARN) Data Warehouse: a Resource for Patient-Centered Outcomes Research and Quality Improvement in Underserved, Safety Net Populations , 2014, EGEMS.

[40]  Marc Cuggia,et al.  Semantic integration of medication data into the EHOP Clinical Data Warehouse , 2015, MIE.

[41]  Robert S. Seiner The Data Administration Newsletter (TDAN.com) , 2005 .

[42]  Stephen T. C. Wong,et al.  Data security and privacy management in healthcare applications and clinical data warehouse environment , 2016, 2016 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI).

[43]  Ralph Kimball,et al.  The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling , 1996 .

[44]  Ralph Kimball,et al.  The Data Warehouse Lifecycle Toolkit , 2009 .

[45]  K. Baumlin,et al.  Validating Emergency Department Vital Signs Using a Data Quality Engine for Data Warehouse , 2013, The open medical informatics journal.

[46]  Leslie Lenert,et al.  Leveraging a Statewide Clinical Data Warehouse to Expand Boundaries of the Learning Health System , 2016, EGEMS.

[47]  Alaa Khalaf Hamoud,et al.  Design and Implementing Cancer Data Warehouse to Support Clinical Decisions , 2016 .

[48]  Anita Burgun-Parenthoine,et al.  A clinician friendly data warehouse oriented toward narrative reports: Dr. Warehouse , 2018, J. Biomed. Informatics.

[49]  Jocelyn G Dewitt,et al.  Development of a Data Warehouse at an Academic Health System: Knowing a Place for the First Time , 2005, Academic medicine : journal of the Association of American Medical Colleges.

[50]  Shridar Ganesan,et al.  Roadmap to a Comprehensive Clinical Data Warehouse for Precision Medicine Applications in Oncology , 2017, Cancer informatics.

[51]  Patrice Degoulet,et al.  The Georges Pompidou University Hospital Clinical Data Warehouse: A 8-years follow-up experience , 2017, Int. J. Medical Informatics.

[52]  Nanna Suryana,et al.  Flexible Data Warehouse Parameters: Toward Building an Integrated Architecture , 2015 .

[53]  Patrick Rogers,et al.  Information warehouse - a comprehensive informatics platform for business, clinical, and research applications. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[54]  Roy Pardee,et al.  The HMO Research Network Virtual Data Warehouse: A Public Data Model to Support Collaboration , 2014, EGEMS.

[55]  Lemuel R Waitman,et al.  Expressing observations from electronic medical record flowsheets in an i2b2 based clinical data repository to support research and quality improvement. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[56]  Michael J. Denney,et al.  Validating the extract, transform, load process used to populate a large clinical research database , 2016, Int. J. Medical Informatics.

[57]  Catherine Williams How to Keep a Clinical Confidence. A Summary of Law and Guidance on Maintaining the Patient's Privacy , 1995 .

[58]  Michael Marschollek,et al.  Automated population of an i2b2 clinical data warehouse from an openEHR-based data repository , 2016, J. Biomed. Informatics.

[59]  George Hripcsak,et al.  Developing a multivariable prognostic model for pancreatic endocrine tumors using the clinical data warehouse resources of a single institution , 2010, Thrombosis and Haemostasis.

[60]  Chunhua Weng,et al.  Comparing the effectiveness of a clinical registry and a clinical data warehouse for supporting clinical trial recruitment: a case study. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[61]  Anita Burgun-Parenthoine,et al.  Improving a full-text search engine: the importance of negation detection and family history context to identify cases in a biomedical data warehouse , 2017, J. Am. Medical Informatics Assoc..

[62]  Bradley N. Doebbeling,et al.  A Multidimensional Data Warehouse for Community Health Centers , 2015, AMIA.

[63]  A Min Tjoa,et al.  The Relevance of Data Warehousing and Data Mining in the Field of Evidence-based Medicine to Support Healthcare Decision Making , 2007 .

[64]  J. Couderc The telemetric and holter ECG warehouse initiative (THEW): A data repository for the design, implementation and validation of ECG-related technologies , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[65]  Sooyoung Yoo,et al.  Electronically implemented clinical indicators based on a data warehouse in a tertiary hospital: Its clinical benefit and effectiveness , 2014, Int. J. Medical Informatics.

[66]  Jun Gao,et al.  DW4TR: A Data Warehouse for Translational Research , 2011, J. Biomed. Informatics.

[67]  Soo-Yong Shin,et al.  Characteristics Desired in Clinical Data Warehouse for Biomedical Research , 2014, Healthcare informatics research.

[68]  Jonathan S. Einbinder,et al.  Evaluation of a data warehouse in an academic health sciences center , 2000, Int. J. Medical Informatics.

[69]  Axel Schumacher,et al.  A collaborative approach to develop a multi-omics data analytics platform for translational research , 2014, Applied & translational genomics.

[70]  Tanya Podchiyska,et al.  Managing Medical Vocabulary Updates in a Clinical Data Warehouse: An RxNorm Case Study. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[71]  Samani A. Talab,et al.  Clinical Data Warehouse Issues and Challenges , 2014 .

[72]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..