Use of "big data" in drug discovery and clinical trials.

Oncology is undergoing a data-driven metamorphosis. Armed with new and ever more efficient molecular and information technologies, we have entered an era where data is helping us spearhead the fight against cancer. This technology driven data explosion, often referred to as "big data", is not only expediting biomedical discovery, but it is also rapidly transforming the practice of oncology into an information science. This evolution is critical, as results to-date have revealed the immense complexity and genetic heterogeneity of patients and their tumors, a sobering reminder of the challenge facing every patient and their oncologist. This can only be addressed through development of clinico-molecular data analytics that provide a deeper understanding of the mechanisms controlling the biological and clinical response to available therapeutic options. Beyond the exciting implications for improved patient care, such advancements in predictive and evidence-based analytics stand to profoundly affect the processes of cancer drug discovery and associated clinical trials.

[1]  D. Lindberg,et al.  Building the Unified Medical Language System , 1989 .

[2]  Steven J. M. Jones,et al.  Integrated genomic characterization of endometrial carcinoma , 2013, Nature.

[3]  J. Ross,et al.  MammaPrint™ 70-gene signature: another milestone in personalized medical care for breast cancer patients , 2009, Expert review of molecular diagnostics.

[4]  Jingyuan Fu,et al.  Systems genetics: From GWAS to disease pathways. , 2014, Biochimica et biophysica acta.

[5]  B. Monk,et al.  The vaginal and gastrointestinal microbiomes in gynecologic cancers: a review of applications in etiology, symptoms and treatment. , 2015, Gynecologic oncology.

[6]  Sungsam Gong,et al.  MetaBase—the wiki-database of biological databases , 2011, Nucleic Acids Res..

[7]  Stephen D. Wang,et al.  Redefining big-data clinical trial (BCT). , 2014, Annals of translational medicine.

[8]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[9]  Gary D. Bader,et al.  Pathguide: a Pathway Resource List , 2005, Nucleic Acids Res..

[10]  R. Iyengar,et al.  Systems pharmacology: network analysis to identify multiscale mechanisms of drug action. , 2012, Annual review of pharmacology and toxicology.

[11]  R Bellazzi,et al.  Big Data and Biomedical Informatics: A Challenging Opportunity , 2014, Yearbook of Medical Informatics.

[12]  Alex Bateman,et al.  Databases, data tombs and dust in the wind , 2008, Bioinform..

[13]  Kyung-Ah Sohn,et al.  Integrative network analysis for survival-associated gene-gene interactions across multiple genomic profiles in ovarian cancer , 2015, Journal of Ovarian Research.

[14]  A. Nobel,et al.  Concordance among Gene-Expression – Based Predictors for Breast Cancer , 2011 .

[15]  Min Zhao,et al.  ECGene: A Literature‐Based Knowledgebase of Endometrial Cancer Genes , 2016, Human mutation.

[16]  Euan A Ashley,et al.  The precision medicine initiative: a new national effort. , 2015, JAMA.

[17]  C. Creighton,et al.  Reverse phase protein arrays in signaling pathways: a data integration perspective , 2015, Drug design, development and therapy.

[18]  Benjamin J. Raphael,et al.  Integrated Genomic Analyses of Ovarian Carcinoma , 2011, Nature.

[19]  J. Lee,et al.  An overview of the design and conduct of the BATTLE trials. , 2015, Chinese clinical oncology.

[20]  Theodoros G. Soldatos,et al.  Correction: Corrigendum: Src activation by β-adrenoreceptors is a key switch for tumour metastasis , 2013, Nature Communications.

[21]  Thomas A Trikalinos,et al.  The appropriateness of asymmetry tests for publication bias in meta-analyses: a large survey , 2007, Canadian Medical Association Journal.

[22]  Sivanesan Dakshanamurthy,et al.  Big data: the next frontier for innovation in therapeutics and healthcare , 2014, Expert review of clinical pharmacology.

[23]  R. Altman,et al.  Data-Driven Prediction of Drug Effects and Interactions , 2012, Science Translational Medicine.

[24]  Naren Ramakrishnan,et al.  Experiences with mining temporal event sequences from electronic medical records: initial successes and some challenges , 2011, KDD.

[25]  Eytan Domany,et al.  Outcome signature genes in breast cancer: is there a unique set? , 2004, Breast Cancer Research.

[26]  I. Lipkus,et al.  Treatment Choices Based on OncotypeDx in the Breast Oncology Care Setting , 2012, Journal of cancer epidemiology.

[27]  Avi Ma'ayan,et al.  Lean Big Data integration in systems biology and systems pharmacology. , 2014, Trends in pharmacological sciences.

[28]  David B Jackson,et al.  Src activation by β-adrenoreceptors is a key switch for tumor metastasis , 2012, Nature Communications.

[29]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[30]  H. Goldstein,et al.  Evaluating bias due to data linkage error in electronic healthcare records , 2014, BMC Medical Research Methodology.

[31]  K. Bretonnel Cohen,et al.  Text mining and manual curation of chemical-gene-disease networks for the Comparative Toxicogenomics Database (CTD) , 2009, BMC Bioinformatics.

[32]  David B Jackson,et al.  Genetic determinants of anticancer drug activity: towards a global approach to personalized cancer medicine , 2011, Expert review of molecular diagnostics.

[33]  Rui Chen,et al.  Promise of personalized omics to precision medicine , 2013, Wiley interdisciplinary reviews. Systems biology and medicine.

[34]  Zhiwei Wang,et al.  Forkhead box M1 transcription factor: a novel target for cancer therapy. , 2010, Cancer treatment reviews.

[35]  Olivier Bodenreider Using SNOMED CT in combination with MedDRA for reporting signal detection and adverse drug reactions reporting , 2009, AMIA.

[36]  L. Hood,et al.  P4 medicine: how systems medicine will transform the healthcare sector and society. , 2013, Personalized medicine.

[37]  David Haussler,et al.  Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM , 2010, Bioinform..

[38]  David F. Horrobin,et al.  Modern biomedical research: an internally self-consistent universe with little contact with medical reality? , 2003, Nature Reviews Drug Discovery.

[39]  R. Costa,et al.  Chk2 Mediates Stabilization of the FoxM1 Transcription Factor To Stimulate Expression of DNA Repair Genes , 2006, Molecular and Cellular Biology.

[40]  Qingyang Zhang,et al.  Integrative network analysis of TCGA data for ovarian cancer , 2014, BMC Systems Biology.

[41]  Griffin M. Weber,et al.  Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2) , 2010, J. Am. Medical Informatics Assoc..

[42]  C. Ré,et al.  A Machine Reading System for Assembling Synthetic Paleontological Databases , 2014, PloS one.

[43]  Fabrizio De Ponti,et al.  Data Mining Techniques in Pharmacovigilance: Analysis of the Publicly Accessible FDA Adverse Event Reporting System (AERS) , 2012 .