Translational research platforms integrating clinical and omics data: a review of publicly available solutions

The rise of personalized medicine and the availability of high-throughput molecular analyses in the context of clinical care have increased the need for adequate tools for translational researchers to manage and explore these data. We reviewed the biomedical literature for translational platforms allowing the management and exploration of clinical and omics data, and identified seven platforms: BRISK, caTRIP, cBio Cancer Portal, G-DOC, iCOD, iDASH and tranSMART. We analyzed these platforms along seven major axes. (1) The community axis regrouped information regarding initiators and funders of the project, as well as availability status and references. (2) We regrouped under the information content axis the nature of the clinical and omics data handled by each system. (3) The privacy management environment axis encompassed functionalities allowing control over data privacy. (4) In the analysis support axis, we detailed the analytical and statistical tools provided by the platforms. We also explored (5) interoperability support and (6) system requirements. The final axis (7) platform support listed the availability of documentation and installation procedures. A large heterogeneity was observed in regard to the capability to manage phenotype information in addition to omics data, their security and interoperability features. The analytical and visualization features strongly depend on the considered platform. Similarly, the availability of the systems is variable. This review aims at providing the reader with the background to choose the platform best suited to their needs. To conclude, we discuss the desiderata for optimal translational research platforms, in terms of privacy, interoperability and technical features.

[1]  Douglas MacFadden,et al.  SHRINE: Enabling Nationally Scalable Multi-Site Disease Studies , 2013, PloS one.

[2]  P. Laird Principles and challenges of genome-wide DNA methylation analysis , 2010, Nature Reviews Genetics.

[3]  Riccardo Bellazzi,et al.  The ONCO-I2b2 Project: Integrating Biobank Information and Clinical Data to Support Translational Research in Oncology , 2011, MIE.

[4]  E. Perakslis,et al.  Effective knowledge management in translational medicine , 2010, Journal of Translational Medicine.

[5]  J. Mesirov,et al.  GenePattern 2.0 , 2006, Nature Genetics.

[6]  Ibrahim Emam,et al.  ArrayExpress update—an archive of microarray and high-throughput sequencing-based functional genomics experiments , 2010, Nucleic Acids Res..

[7]  Dennis B. Troup,et al.  NCBI GEO: mining tens of millions of expression profiles—database and tools update , 2006, Nucleic Acids Res..

[8]  Lennart Martens,et al.  PRIDE: The proteomics identifications database , 2005, Proteomics.

[9]  Trey Ideker,et al.  Integrating physical and genetic maps: from genomes to interaction networks , 2007, Nature Reviews Genetics.

[10]  Benjamin E. Gross,et al.  The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. , 2012, Cancer discovery.

[11]  L. Hood,et al.  A personal view on systems medicine and the emergence of proactive P4 medicine: predictive, preventive, personalized and participatory. , 2012, New biotechnology.

[12]  Sabine Tejpar,et al.  Effects of KRAS, BRAF, NRAS, and PIK3CA mutations on the efficacy of cetuximab plus chemotherapy in chemotherapy-refractory metastatic colorectal cancer: a retrospective consortium analysis. , 2010, The Lancet. Oncology.

[13]  Subha Madhavan,et al.  G-CODE: enabling systems medicine through innovative informatics , 2011, Genome Biology.

[14]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[15]  G. Hon,et al.  Next-generation genomics: an integrative approach , 2010, Nature Reviews Genetics.

[16]  Woo Yong Lee,et al.  Clinical Validation of Colorectal Cancer Biomarkers Identified from Bioinformatics Analysis of Public Expression Data , 2010, Clinical Cancer Research.

[17]  Cui Tao,et al.  Unified Medical Language System term occurrences in clinical notes: a large-scale corpus analysis , 2012, J. Am. Medical Informatics Assoc..

[18]  Susan C. Weber,et al.  STRIDE - An Integrated Standards-Based Translational Research Informatics Platform , 2009, AMIA.

[19]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[20]  Hiroshi Tanaka,et al.  iCOD : an integrated clinical omics database based on the systems-pathology view of disease , 2010, BMC Genomics.

[21]  P. Park ChIP–seq: advantages and challenges of a maturing technology , 2009, Nature Reviews Genetics.

[22]  Patrick McConnell,et al.  The cancer translational research informatics platform , 2008, BMC Medical Informatics Decis. Mak..

[23]  James J. Cimino,et al.  The Clinical Research Data Repository of the US National Institutes of Health , 2010, MedInfo.

[24]  J. Reid,et al.  Analysis of PTEN, BRAF, and EGFR status in determining benefit from cetuximab therapy in wild-type KRAS metastatic colon cancer. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[25]  M. Metzker Sequencing technologies — the next generation , 2010, Nature Reviews Genetics.

[26]  Michael Y. Galperin,et al.  The 2012 Nucleic Acids Research Database Issue and the online Molecular Biology Database Collection , 2011, Nucleic Acids Res..

[27]  Robert Gentleman,et al.  An extensible application for assembling annotation for genomic data , 2003, Bioinform..

[28]  Subha Madhavan,et al.  G-DOC: a systems medicine platform for personalized oncology. , 2011, Neoplasia.

[29]  Donny D. Licatalosi,et al.  RNA processing and its regulation: global insights into biological networks , 2010, Nature Reviews Genetics.

[30]  Griffin M. Weber,et al.  Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2) , 2010, J. Am. Medical Informatics Assoc..

[31]  Patrice Degoulet,et al.  Methodology of integration of a clinical data warehouse with a clinical information system: the HEGP case , 2010, MedInfo.

[32]  P. V. Biron,et al.  The HL7 Clinical Document Architecture. , 2001, Journal of the American Medical Informatics Association : JAMIA.

[33]  R. Altman,et al.  Detecting Drug Interactions From Adverse‐Event Reports: Interaction Between Paroxetine and Pravastatin Increases Blood Glucose Levels , 2011, Clinical pharmacology and therapeutics.

[34]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[35]  I. Kohane,et al.  A glimpse of the next 100 years in medicine. , 2012, The New England journal of medicine.

[36]  Alan Tan,et al.  BRISK - research-oriented storage kit for biology-related data , 2011, Bioinform..

[37]  Michael Y. Galperin,et al.  The 2012 Nucleic Acids Research Database Issue and the online Molecular Biology Database Collection , 2011, Nucleic Acids Res..

[38]  Jihoon Kim,et al.  iDASH: integrating data for analysis, anonymization, and sharing , 2012, J. Am. Medical Informatics Assoc..

[39]  P. Farnham Insights from genomic profiling of transcription factors , 2009, Nature Reviews Genetics.

[40]  Russ B. Altman,et al.  Introduction to Translational Bioinformatics Collection , 2012, PLoS Comput. Biol..