Enhanced data extraction, transforming and loading processing for Traditional Chinese Medicine clinical data warehouse

Clinical data warehouse has been developed as a fundamental data infrastructure for large scale TCM clinical data management and decision support services. However, as a key component, data extraction, transforming and loading (ETL) is a complicated and labor intensive task to ensure high data quality before all kinds of data analyses. This paper introduces an enhanced ETL technique framework, which includes operational data store (ODS) model and two step data preprocessing subcomponents, to perform the ETL tasks. The ODS data model was designed to integrate the heterogeneous clinical data sources and support the direct copy from these data sources to ODS database by ETL. Therefore, ETL task has been separated into two core steps in enhanced ETL component: (1) dynamic filter and copy of the original operational data sources to ODS; (2) specialized transforming the ODS data to detailed clinical data warehouse. This enhanced technique framework improves the ETL performance to be used in clinical data center since there would have various kinds of operational data sources that need be integrated in this data environments. This paper has a description of the related enhanced ETL framework and proposes some key procedures to accomplish the tasks.

[1]  Torben Bach Pedersen,et al.  Research issues in clinical data warehousing , 1998, Proceedings. Tenth International Conference on Scientific and Statistical Database Management (Cat. No.98TB100243).

[2]  Baoyan Liu,et al.  Building Clinical Data Warehouse for Traditional Chinese Medicine Knowledge Discovery , 2008, 2008 International Conference on BioMedical Engineering and Informatics.

[3]  Tony R. Sahama,et al.  A Data Warehouse Architecture for Clinical Data Warehousing , 2007, ACSW.

[4]  Yonghong Peng,et al.  Text mining for traditional Chinese medical knowledge discovery: A survey , 2010, J. Biomed. Informatics.

[5]  Baoyan Liu,et al.  Development of traditional Chinese medicine clinical data warehouse for medical knowledge discovery and decision support , 2010, Artif. Intell. Medicine.

[6]  Baoyan Liu,et al.  Network Analysis System for Traditional Chinese Medicine Clinical Data , 2009, 2009 2nd International Conference on Biomedical Engineering and Informatics.