Data Harmonization for a Molecularly Driven Health System

Data commons have emerged as the best current method for enabling data aggregation across multiple projects and multiple data sources. Good data harmonization techniques are critical to maintain quality of data within a data commons, as well as to allow future meta-analysis across different data commons. We present some of the current best practices for data harmonization.

[1]  S. H. van der Burg,et al.  Rationally combining immunotherapies to improve efficacy of immune checkpoint blockade in solid tumors. , 2017, Cytokine & growth factor reviews.

[2]  Henry Rodriguez,et al.  Revolutionizing Precision Oncology through Collaborative Proteogenomics and Data Sharing , 2018, Cell.

[3]  Harald Barsnes,et al.  BioContainers: an open-source and community-driven framework for software standardization , 2017, Bioinform..

[4]  Benjamin E. Gross,et al.  The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. , 2012, Cancer discovery.

[5]  Benjamin E. Gross,et al.  Integrative Analysis of Complex Cancer Genomics and Clinical Profiles Using the cBioPortal , 2013, Science Signaling.

[6]  Benedict Paten,et al.  The Dockstore: enabling modular, community-focused sharing of Docker-based genomics tools and workflows , 2017, F1000Research.

[7]  Huanyu Chen,et al.  FDA approval summary: crizotinib for the treatment of metastatic non-small cell lung cancer with anaplastic lymphoma kinase rearrangements. , 2014, The oncologist.

[8]  Samuel V. Angiuoli,et al.  Collaborating to Compete: Blood Profiling Atlas in Cancer (BloodPAC) Consortium , 2017, Clinical pharmacology and therapeutics.

[9]  G. Parmigiani,et al.  The Consensus Coding Sequences of Human Breast and Colorectal Cancers , 2006, Science.

[10]  J. Sicklick,et al.  Prevalence of PDL1 Amplification and Preliminary Response to Immune Checkpoint Blockade in Solid Tumors , 2018, JAMA oncology.

[11]  Stephen R. Piccolo,et al.  A cloud-based workflow to quantify transcript-expression levels in public cancer compendia , 2016, Scientific Reports.

[12]  AACR Project GENIE: Powering Precision Medicine through an International Consortium. , 2017, Cancer discovery.