Data and knowledge management in translational research: implementation of the eTRIKS platform for the IMI OncoTrack consortium

BackgroundFor large international research consortia, such as those funded by the European Union’s Horizon 2020 programme or the Innovative Medicines Initiative, good data coordination practices and tools are essential for the successful collection, organization and analysis of the resulting data. Research consortia are attempting ever more ambitious science to better understand disease, by leveraging technologies such as whole genome sequencing, proteomics, patient-derived biological models and computer-based systems biology simulations.ResultsThe IMI eTRIKS consortium is charged with the task of developing an integrated knowledge management platform capable of supporting the complexity of the data generated by such research programmes. In this paper, using the example of the OncoTrack consortium, we describe a typical use case in translational medicine. The tranSMART knowledge management platform was implemented to support data from observational clinical cohorts, drug response data from cell culture models and drug response data from mouse xenograft tumour models. The high dimensional (omics) data from the molecular analyses of the corresponding biological materials were linked to these collections, so that users could browse and analyse these to derive candidate biomarkers.ConclusionsIn all these steps, data mapping, linking and preparation are handled automatically by the tranSMART integration platform. Therefore, researchers without specialist data handling skills can focus directly on the scientific questions, without spending undue effort on processing the data and data integration, which are otherwise a burden and the most time-consuming part of translational research data analysis.

[1]  Patrice Degoulet,et al.  Translational research platforms integrating clinical and omics data: a review of publicly available solutions , 2014, Briefings Bioinform..

[2]  Thomas Lumley,et al.  Review of Statistical Learning Methods in Integrated Omics Studies (An Integrated Information Science) , 2018, Bioinformatics and biology insights.

[3]  Catherine L. Worth,et al.  Molecular dissection of colorectal cancer in pre-clinical models identifies biomarkers predicting sensitivity to EGFR inhibitors , 2017, Nature Communications.

[4]  Lori C. Phillips,et al.  Using the i2b2 hive for clinical discovery: an example. , 2007, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[5]  Philip R. O. Payne,et al.  TRIAD: The Translational Research Informatics and Data Management Grid , 2011, Applied Clinical Informatics.

[6]  Benjamin E. Gross,et al.  The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. , 2012, Cancer discovery.

[7]  Joaquín Dopazo,et al.  Paintomics: a web based tool for the joint visualization of transcriptomics and metabolomics data , 2010, Bioinform..

[8]  Tomasz Waller,et al.  DNA microarray integromics analysis platform , 2015, BioData Mining.

[9]  Ghita Rahal,et al.  eTRIKS platform: Conception and operation of a highly scalable cloud-based platform for translational research and applications development , 2018, Comput. Biol. Medicine.

[10]  Reinhard Schneider,et al.  Fractalis: a scalable open-source service for platform-independent interactive visual analysis of biomedical data , 2018, GigaScience.

[11]  X. Montalban,et al.  Clinical practice of analysis of anti-drug antibodies against interferon beta and natalizumab in multiple sclerosis patients in Europe: A descriptive study of test results , 2017, PloS one.

[12]  Joel H. Saltz,et al.  Model Formulation: caGrid 1.0: An Enterprise Grid Infrastructure for Biomedical Research , 2008, J. Am. Medical Informatics Assoc..

[13]  Subha Madhavan,et al.  Platform for Personalized Oncology: Integrative analyses reveal novel molecular signatures associated with colorectal cancer relapse. , 2013, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[14]  Anita Burgun-Parenthoine,et al.  Exploring and visualizing multidimensional data in translational research platforms , 2016, Briefings Bioinform..

[15]  Susan C. Weber,et al.  STRIDE - An Integrated Standards-Based Translational Research Informatics Platform , 2009, AMIA.

[16]  Christina Backes,et al.  Multi-omics enrichment analysis using the GeneTrail2 web service , 2016, Bioinform..

[17]  Egon L. Willighagen,et al.  Automatically visualise and analyse data on pathways using PathVisioRPC from any programming environment , 2015, BMC Bioinformatics.

[18]  Keith Marsolo,et al.  An i2b2-based, generalizable, open source, self-scaling chronic disease registry , 2012, J. Am. Medical Informatics Assoc..

[19]  Subha Madhavan,et al.  G-DOC Plus – an integrative bioinformatics platform for precision medicine , 2016, BMC Bioinformatics.

[20]  Yufeng J. Tseng,et al.  3Omics: a web-based systems biology tool for analysis, integration and visualization of human transcriptomic, proteomic and metabolomic data , 2013, BMC Systems Biology.

[21]  Andriani Daskalaki,et al.  Prediction in the face of uncertainty: a Monte Carlo-based approach for systems biology of cancer treatment. , 2012, Mutation research.

[22]  Alan Tan,et al.  BRISK - research-oriented storage kit for biology-related data , 2011, Bioinform..

[23]  Thomas Heinis,et al.  eTRIKS analytical environment: A modular high performance framework for medical data analysis , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[24]  Jeremy Miller,et al.  A Pilot Trial Testing the Feasibility of Using Molecular-Guided Therapy in Patients with Recurrent Neuroblastoma , 2012 .

[25]  Charles Auffray,et al.  Application of ’omics technologies to biomarker discovery in inflammatory lung diseases , 2013, European Respiratory Journal.

[26]  Parnesh Raniga,et al.  Design, implementation and operation of a multimodality research imaging informatics repository , 2015, Health Information Science and Systems.

[27]  Marek Ostaszewski,et al.  Integration and Visualization of Translational Medicine Data for Better Understanding of Human Diseases , 2016, Big Data.

[28]  N. Vu,et al.  Good manufacturing practice-compliant isolation and culture of human umbilical cord blood-derived mesenchymal stem cells , 2014, Journal of Translational Medicine.

[29]  Haiming Wang,et al.  EuPathDB: the eukaryotic pathogen genomics database resource , 2016, Nucleic Acids Res..

[30]  Anita Burgun,et al.  Integrating Heterogeneous Biomedical Data for Cancer Research: the CARPEM infrastructure , 2016, Applied Clinical Informatics.

[31]  Tian Xia,et al.  OmicsAnalyzer: a Cytoscape plug-in suite for modeling omics data , 2010, Bioinform..

[32]  Jihoon Kim,et al.  iDASH: integrating data for analysis, anonymization, and sharing , 2012, J. Am. Medical Informatics Assoc..

[33]  Ralf Herwig,et al.  DIPSBC - data integration platform for systems biology collaborations , 2012, BMC Bioinformatics.

[34]  Arthur W. Toga,et al.  Big biomedical data as the key resource for discovery science , 2015, J. Am. Medical Informatics Assoc..

[35]  Adriano Barbosa-Silva,et al.  SmartR: an open-source platform for interactive visual analytics for translational research data , 2017, Bioinform..

[36]  Marc-Thorsten Hütt,et al.  Interdisciplinary approach towards a systems medicine toolbox using the example of inflammatory diseases , 2016, Briefings Bioinform..

[37]  Ulrich Keilholz,et al.  Personalized medicine approaches for colon cancer driven by genomics and systems biology: OncoTrack , 2014, Biotechnology journal.

[38]  Ryan Ramanujam,et al.  Occurrence of Anti-Drug Antibodies against Interferon-Beta and Natalizumab in Multiple Sclerosis: A Collaborative Cohort Analysis , 2016, PloS one.

[39]  David Gomez-Cabrero,et al.  The COPD Knowledge Base: enabling data analysis and computational simulation in translational COPD research , 2014, Journal of Translational Medicine.

[40]  E. Perakslis,et al.  Effective knowledge management in translational medicine , 2010, Journal of Translational Medicine.

[41]  Kay Nieselt,et al.  Mayday SeaSight: Combined Analysis of Deep Sequencing and Microarray Data , 2011, PloS one.