Presenting and sharing clinical data using the eTRIKS Standards Master Tree for tranSMART

Abstract Motivation Standardization and semantic alignment have been considered one of the major challenges for data integration in clinical research. The inclusion of the CDISC SDTM clinical data standard into the tranSMART i2b2 via a guiding master ontology tree positively impacts and supports the efficacy of data sharing, visualization and exploration across datasets. Results We present here a schema for the organization of SDTM variables into the tranSMART i2b2 tree along with a script and test dataset to exemplify the mapping strategy. The eTRIKS master tree concept is demonstrated by making use of fictitious data generated for four patients, including 16 SDTM clinical domains. We describe how the usage of correct visit names and data labels can help to integrate multiple readouts per patient and avoid ETL crashes when running a tranSMART loading routine. Availability and implementation The eTRIKS Master Tree package and test datasets are publicly available at https://doi.org/10.5281/zenodo.1009098 and a functional demo installation at https://public.etriks.org/transmart/datasetExplorer/ under eTRIKS—Master Tree branch, where the discussed examples can be visualized.

[1]  Yike Guo,et al.  tranSMART: An Open Source and Community-Driven Informatics and Data Sharing Platform for Clinical and Translational Research , 2013, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[2]  Marek Ostaszewski,et al.  Integration and Visualization of Translational Medicine Data for Better Understanding of Human Diseases , 2016, Big Data.

[3]  Acmg Board of Directors,et al.  Laboratory and clinical genomic data sharing is crucial to improving genetic health care: a position statement of the American College of Medical Genetics and Genomics , 2017, Genetics in Medicine.

[4]  David Henderson,et al.  Key factors for successful data integration in biomarker research , 2016, Nature Reviews Drug Discovery.

[5]  C. Begley,et al.  Drug development: Raise standards for preclinical cancer research , 2012, Nature.

[6]  Aaron Abend,et al.  Integrating Clinical Data into the i2b2 Repository , 2009, Summit on translational bioinformatics.

[7]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[8]  Anita Burgun-Parenthoine,et al.  Exploring and visualizing multidimensional data in translational research platforms , 2016, Briefings Bioinform..

[9]  Keiichi Yamamoto,et al.  A pragmatic method for transforming clinical research data from the research electronic data capture "REDCap" to Clinical Data Interchange Standards Consortium (CDISC) Study Data Tabulation Model (SDTM): Development and evaluation of REDCap2SDTM , 2017, J. Biomed. Informatics.