Perspective on Oncogenic Processes at the End of the Beginning of Cancer Genomics

The Cancer Genome Atlas (TCGA) has catalyzed systematic characterization of diverse genomic alterations underlying human cancers. At this historic junction marking the completion of genomic characterization of over 11,000 tumors from 33 cancer types, we present our current understanding of the molecular processes governing oncogenesis. We illustrate our insights into cancer through synthesis of the findings of the TCGA PanCancer Atlas project on three facets of oncogenesis: (1) somatic driver mutations, germline pathogenic variants, and their interactions in the tumor; (2) the influence of the tumor genome and epigenome on transcriptome and proteome; and (3) the relationship between tumor and the microenvironment, including implications for drugs targeting driver events and immunotherapies. These results will anchor future characterization of rare and common tumor types, primary and relapsed tumors, and cancers across ancestry groups and will guide the deployment of clinical genomic sequencing.

[1]  F. Supek,et al.  The rules and impact of nonsense-mediated mRNA decay in human cancers , 2016, Nature Genetics.

[2]  Ludmila V. Danilova,et al.  Association of PD-1/PD-L axis expression with cytolytic activity, mutational load, and prognosis in melanoma and other solid tumors , 2016, Proceedings of the National Academy of Sciences.

[3]  A. Godzik,et al.  Mutation Drivers of Immunological Responses to Cancer , 2016, Cancer Immunology Research.

[4]  Derek W Wright,et al.  Gateways to the FANTOM5 promoter level mammalian expression atlas , 2015, Genome Biology.

[5]  Adam B. Olshen,et al.  Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis , 2009, Bioinform..

[6]  Marc E. Lenburg,et al.  Gene Co-Expression Modules as Clinically Relevant Hallmarks of Breast Cancer Diversity , 2014, PloS one.

[7]  Ashton C. Berger,et al.  Genomic and Functional Approaches to Understanding Cancer Aneuploidy. , 2018, Cancer cell.

[8]  Joshua M. Stuart,et al.  RADIA: RNA and DNA Integrated Analysis for Somatic Mutation Detection , 2014, PloS one.

[9]  L. Attardi,et al.  Unravelling mechanisms of p53-mediated tumour suppression , 2014, Nature Reviews Cancer.

[10]  Ben Lehner,et al.  Cancer type-dependent genetic interactions between cancer driver alterations indicate plasticity of epistasis across cell types , 2015, Molecular systems biology.

[11]  D. Bates,et al.  Hallmarks of alternative splicing in cancer , 2014, Oncogene.

[12]  S. Sugano,et al.  Frequent pathway mutations of splicing machinery in myelodysplasia , 2011, Nature.

[13]  Michael T. Zimmermann,et al.  Genomic and Molecular Landscape of DNA Damage Repair Deficiency across The Cancer Genome Atlas , 2018, Cell reports.

[14]  Steven J. M. Jones,et al.  Circos: an information aesthetic for comparative genomics. , 2009, Genome research.

[15]  Eunhee Kim,et al.  RNA splicing factors as oncoproteins and tumour suppressors , 2016, Nature Reviews Cancer.

[16]  Kern Rei Chng,et al.  Patient-specific driver gene prediction and risk assessment through integrated network analysis of cancer omics profiles , 2015, Nucleic acids research.

[17]  M. Nielsen,et al.  NetMHCpan-3.0; improved prediction of binding to MHC class I molecules integrating information from multiple receptor and peptide length datasets , 2016, Genome Medicine.

[18]  Charles H. Yoon,et al.  An immunogenic personal neoantigen vaccine for patients with melanoma , 2017, Nature.

[19]  Steven J. M. Jones,et al.  Comprehensive Characterization of Cancer Driver Genes and Mutations , 2018, Cell.

[20]  A. Bashashati,et al.  DriverNet: uncovering the impact of somatic driver mutations on transcriptional networks in cancer , 2012, Genome Biology.

[21]  Matthew D. Wilkerson,et al.  ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking , 2010, Bioinform..

[22]  Peter W. Laird,et al.  Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer , 2018, Cell.

[23]  F. Stossi,et al.  Mutual Regulation of Tumour Vessel Normalization and Immunostimulatory Reprogramming , 2017, Nature.

[24]  Ash A. Alizadeh,et al.  Robust enumeration of cell subsets from tissue expression profiles , 2015, Nature Methods.

[25]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[26]  Ken Chen,et al.  SomaticSniper: identification of somatic point mutations in whole genome sequencing data , 2012, Bioinform..

[27]  Li Ding,et al.  Driver Fusions and Their Implications in the Development and Treatment of Human Cancers , 2018, Cell reports.

[28]  Su-In Lee,et al.  The Proteomic Landscape of Triple-Negative Breast Cancer. , 2015, Cell reports.

[29]  Stephen R. Piccolo,et al.  A cloud-based workflow to quantify transcript-expression levels in public cancer compendia , 2016, Scientific Reports.

[30]  David Tamborero,et al.  OncodriveCLUST: exploiting the positional clustering of somatic mutations to identify cancer genes , 2013, Bioinform..

[31]  Li Ding,et al.  MIRMMR: binary classification of microsatellite instability using methylation and mutations , 2017, Bioinform..

[32]  Gianluca Bontempi,et al.  Moonlight: a tool for biological interpretation and driver genes discovery , 2018, bioRxiv.

[33]  Rui-Ru Ji,et al.  An immune-active tumor microenvironment favors clinical response to ipilimumab , 2012, Cancer Immunology, Immunotherapy.

[34]  Benjamin J. Raphael,et al.  Mutational landscape and significance across 12 major cancer types , 2013, Nature.

[35]  A. Godzik,et al.  Comparison of algorithms for the detection of cancer drivers at subgene resolution , 2017, Nature Methods.

[36]  Colin N. Dewey,et al.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome , 2011, BMC Bioinformatics.

[37]  Axel Benner,et al.  Effects of infiltrating lymphocytes and estrogen receptor on gene expression and prognosis in breast cancer , 2009, Breast Cancer Research and Treatment.

[38]  Gianluca Bontempi,et al.  TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages , 2016, F1000Research.

[39]  Christopher A. Miller,et al.  VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. , 2012, Genome research.

[40]  Ping Zhu,et al.  Somatic Mutational Landscape of Splicing Factor Genes and Their Functional Consequences across 33 Cancer Types. , 2018, Cell reports.

[41]  Thomas D. Wu,et al.  A comprehensive transcriptional portrait of human cancer cell lines , 2014, Nature Biotechnology.

[42]  Guey-Shin Wang,et al.  Splicing in disease: disruption of the splicing code and the decoding machinery , 2007, Nature Reviews Genetics.

[43]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[44]  S. Gabriel,et al.  Discovery and saturation analysis of cancer genes across 21 tumor types , 2014, Nature.

[45]  Matthew B. Callaway,et al.  MuSiC: Identifying mutational significance in cancer genomes , 2012, Genome research.

[46]  P. A. Futreal,et al.  MuSE: accounting for tumor heterogeneity using a sample-specific error model improves sensitivity and specificity in mutation calling from sequencing data , 2016, Genome Biology.

[47]  A. Sivachenko,et al.  Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples , 2013, Nature Biotechnology.

[48]  Benjamin Schubert,et al.  OptiType: precision HLA typing from next-generation sequencing data , 2014, Bioinform..

[49]  Steven J. M. Jones,et al.  Comprehensive, Integrative Genomic Analysis of Diffuse Lower-Grade Gliomas. , 2015, The New England journal of medicine.

[50]  Steven J. M. Jones,et al.  The Immune Landscape of Cancer , 2018, Immunity.

[51]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[52]  Rajarsi R. Gupta,et al.  Spatial Organization and Molecular Correlation of Tumor-Infiltrating Lymphocytes Using Deep Learning on Pathology Images. , 2018, Cell reports.

[53]  A. Gonzalez-Perez,et al.  OncodriveFML: a general framework to identify coding and non-coding regions with cancer driver mutations , 2016, Genome Biology.

[54]  Gary D Bader,et al.  Systematic analysis of somatic mutations in phosphorylation signaling predicts novel cancer drivers , 2013 .

[55]  Derek Y. Chiang,et al.  MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery , 2010, Nucleic acids research.

[56]  J. Kench,et al.  Whole genomes redefine the mutational landscape of pancreatic cancer , 2015, Nature.

[57]  Matthew A. Wyczalkowski,et al.  Divergent viral presentation among human tumors and adjacent normal tissues , 2016, Scientific Reports.

[58]  A. Scorilas,et al.  A new tumor suppressor role for the Notch pathway in bladder cancer , 2014, Nature Medicine.

[59]  L. Riboni,et al.  HMGB1 as an autocrine stimulus in human T98G glioblastoma cells: role in cell growth and migration , 2008, Journal of Neuro-Oncology.

[60]  Adam Godzik,et al.  e-Driver: a novel method to identify protein regions driving cancer , 2014, Bioinform..

[61]  Steven J. M. Jones,et al.  Genomic Classification of Cutaneous Melanoma , 2015, Cell.

[62]  Ash A. Alizadeh,et al.  Gene Expression Signature of Fibroblast Serum Response Predicts Human Cancer Progression: Similarities between Tumors and Wounds , 2004, PLoS biology.

[63]  Luca Scrucca,et al.  mclust 5: Clustering, Classification and Density Estimation Using Gaussian Finite Mixture Models , 2016, R J..

[64]  N. Hacohen,et al.  Molecular and Genetic Properties of Tumors Associated with Local Immune Cytolytic Activity , 2015, Cell.

[65]  Alex Arenas,et al.  Improved prognostic classification of breast cancer defined by antagonistic activation patterns of immune response pathway modules , 2010, BMC Cancer.

[66]  Andrew H. Beck,et al.  The Macrophage Colony-Stimulating Factor 1 Response Signature in Breast Carcinoma , 2009, Clinical Cancer Research.

[67]  C. Yau,et al.  A pan-cancer genome-wide analysis reveals tumour dependencies by induction of nonsense-mediated decay , 2017, Nature Communications.

[68]  A. McKenna,et al.  Absolute quantification of somatic DNA alterations in human cancer , 2012, Nature Biotechnology.

[69]  E. Birney,et al.  HRDetect is a predictor of BRCA1 and BRCA2 deficiency based on mutational signatures , 2017, Nature Medicine.

[70]  Benjamin J. Raphael,et al.  Multiplatform Analysis of 12 Cancer Types Reveals Molecular Classification within and across Tissues of Origin , 2014, Cell.

[71]  G. Parmigiani,et al.  The Consensus Coding Sequences of Human Breast and Colorectal Cancers , 2006, Science.

[72]  J. Costello,et al.  Isocitrate dehydrogenase mutations suppress STAT1 and CD8+ T cell accumulation in gliomas , 2017, The Journal of clinical investigation.

[73]  P. Lønning,et al.  Impact of KRAS, BRAF, PIK3CA, TP53 status and intraindividual mutation heterogeneity on outcome after liver resection for colorectal cancer metastases , 2016, International journal of cancer.

[74]  M. Stratton,et al.  High burden and pervasive positive selection of somatic mutations in normal human skin , 2015, Science.

[75]  Markus K. Muellner,et al.  Gain- and Loss-of-Function Mutations in the Breast Cancer Gene GATA3 Result in Differential Drug Sensitivity , 2016, PLoS genetics.

[76]  T. Cedena,et al.  Prognostic value of deep sequencing method for minimal residual disease detection in multiple myeloma. , 2014, Blood.

[77]  Christopher D. Brown,et al.  Robust stratification of breast cancer subtypes using differential patterns of transcript isoform expression , 2017, PLoS genetics.

[78]  K. Kinzler,et al.  Evaluating the evaluation of cancer driver genes , 2016, Proceedings of the National Academy of Sciences.

[79]  Gianluca Bontempi,et al.  TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data , 2015, Nucleic acids research.

[80]  B. Taylor,et al.  Recurrent activating mutations of G-protein-coupled receptor CYSLTR2 in uveal melanoma , 2016, Nature Genetics.

[81]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[82]  M. Miyazaki,et al.  Identification of a novel serum biomarker for pancreatic cancer, C4b-binding protein α-chain (C4BPA) by quantitative proteomic analysis using tandem mass tags , 2016, British Journal of Cancer.

[83]  Ludmila V. Danilova,et al.  Mismatch repair deficiency predicts response of solid tumors to PD-1 blockade , 2017, Science.

[84]  Knut Reinert,et al.  Fast and accurate read mapping with approximate seeds and multiple backtracking , 2012, Nucleic acids research.

[85]  A. Drilon,et al.  Safety and Antitumor Activity of the Multitargeted Pan-TRK, ROS1, and ALK Inhibitor Entrectinib: Combined Results from Two Phase I Trials (ALKA-372-001 and STARTRK-1). , 2017, Cancer discovery.

[86]  Li Ding,et al.  Scalable Open Science Approach for Mutation Calling of Tumor Exomes Using Multiple Genomic Pipelines. , 2018, Cell systems.

[87]  Li Ding,et al.  Protein-structure-guided discovery of functional mutations across 19 cancer types , 2016, Nature Genetics.

[88]  Kai Ye,et al.  MSIsensor: microsatellite instability detection using paired tumor-normal sequence data , 2014, Bioinform..

[89]  Trevor J Pugh,et al.  Initial genome sequencing and analysis of multiple myeloma , 2011, Nature.

[90]  Li Ding,et al.  Patterns and functional implications of rare germline variants across 12 cancer types , 2015, Nature Communications.

[91]  Adrian V. Lee,et al.  An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics , 2018, Cell.

[92]  Steven J. M. Jones,et al.  Pathogenic Germline Variants in 10,389 Adult Cancers. , 2018, Cell.

[93]  Benjamin J. Raphael,et al.  A weighted exact test for mutually exclusive mutations in cancer , 2016, Bioinform..

[94]  Matthew A. Wyczalkowski,et al.  Systematic Analysis of Splice-Site-Creating Mutations in Cancer , 2018, Cell reports.

[95]  Benjamin J. Raphael,et al.  GenomeVIP: a cloud platform for genomic variant discovery and interpretation , 2017, Genome research.

[96]  Justin Guinney,et al.  GSVA: gene set variation analysis for microarray and RNA-Seq data , 2013, BMC Bioinformatics.

[97]  Junjie Hou,et al.  Integrated proteomic and N-glycoproteomic analyses of doxorubicin sensitive and resistant ovarian cancer cells reveal glycoprotein alteration in protein abundance and glycosylation , 2017, Oncotarget.

[98]  Benjamin J. Raphael,et al.  Integrated Analysis of Germline and Somatic Variants in Ovarian Cancer , 2014, Nature Communications.

[99]  Mariano J. Alvarez,et al.  Network-based inference of protein activity helps functionalize the genetic landscape of cancer , 2016, Nature Genetics.

[100]  Wenjun Chang,et al.  Androgen receptor inhibitor–induced “BRCAness” and PARP inhibition are synthetically lethal for castration-resistant prostate cancer , 2017, Science Signaling.