Network-based stratification of tumor mutations

Many forms of cancer have multiple subtypes with different causes and clinical outcomes. Somatic tumor genome sequences provide a rich new source of data for uncovering these subtypes but have proven difficult to compare, as two tumors rarely share the same mutations. Here we introduce network-based stratification (NBS), a method to integrate somatic tumor genomes with gene networks. This approach allows for stratification of cancer into informative subtypes by clustering together patients with mutations in similar network regions. We demonstrate NBS in ovarian, uterine and lung cancer cohorts from The Cancer Genome Atlas. For each tissue, NBS identifies subtypes that are predictive of clinical outcomes such as patient survival, response to therapy or tumor histology. We identify network regions characteristic of each subtype and show how mutation-derived subtypes can be used to train an mRNA expression signature, which provides similar information in the absence of DNA sequence.

[1]  C. Waddington Canalization of Development and the Inheritance of Acquired Characters , 1942, Nature.

[2]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[3]  C. K. Yuen,et al.  Theory and Application of Digital Signal Processing , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[4]  R. Gill,et al.  Cox's regression model for counting processes: a large sample study : (preprint) , 1982 .

[5]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[6]  B. Strauss Role in tumorigenesis of silent mutations in the TP53 gene. , 2000, Mutation research.

[7]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[9]  R. Tibshirani,et al.  Diagnosis of multiple cancer types by shrunken centroids of gene expression , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Jill P. Mesirov,et al.  Consensus Clustering: A Resampling-Based Method for Class Discovery and Visualization of Gene Expression Microarray Data , 2003, Machine Learning.

[11]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[12]  Pablo Tamayo,et al.  Metagenes and molecular pattern discovery using matrix factorization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Kevin Coombes,et al.  Prognostic Role of a Multigene Reverse Transcriptase-PCR Assay in Patients with Node-Negative Breast Cancer Not Receiving Adjuvant Systemic Therapy , 2005, Clinical Cancer Research.

[14]  T J Hudson,et al.  Gene expression profiling of primary cultures of ovarian epithelial cells identifies novel molecular classifiers of ovarian cancer , 2006, British Journal of Cancer.

[15]  Lippincott-Schwartz,et al.  Supporting Online Material Materials and Methods Som Text Figs. S1 to S8 Table S1 Movies S1 to S3 a " Silent " Polymorphism in the Mdr1 Gene Changes Substrate Specificity Corrected 30 November 2007; See Last Page , 2022 .

[16]  Ross S Berkowitz,et al.  Whole genome oligonucleotide-based array comparative genomic hybridization analysis identified fibroblast growth factor 1 as a prognostic marker for advanced-stage serous ovarian adenocarcinomas. , 2007, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[17]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[18]  T. Ideker,et al.  Network-based classification of breast cancer metastasis , 2007, Molecular systems biology.

[19]  R. Tothill,et al.  Novel Molecular Subtypes of Serous and Endometrioid Ovarian Cancer Linked to Clinical Outcome , 2008, Clinical Cancer Research.

[20]  Jiawei Han,et al.  Non-negative Matrix Factorization on Manifold , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[21]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[22]  Laurent Ozbun,et al.  A gene signature predicting for survival in suboptimally debulked patients with ovarian cancer. , 2008, Cancer research.

[23]  S. Cannistra,et al.  Gene-expression profiling in epithelial ovarian cancer , 2008, Nature Clinical Practice Oncology.

[24]  Weixiong Zhang,et al.  Identifying network communities with a high resolution. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  Michael O Dorschner,et al.  Sequencing newly replicated DNA reveals widespread plasticity in human replication timing , 2009, Proceedings of the National Academy of Sciences.

[26]  Leyla Isik,et al.  Cancer-specific high-throughput annotation of somatic mutations: computational prediction of driver missense mutations. , 2009, Cancer research.

[27]  Kenneth H. Buetow,et al.  PID: the Pathway Interaction Database , 2008, Nucleic Acids Res..

[28]  M. Somerfield,et al.  American Society of Clinical Oncology provisional clinical opinion: testing for KRAS gene mutations in patients with metastatic colorectal carcinoma to predict response to anti-epidermal growth factor receptor monoclonal antibody therapy. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[29]  J. Stamatoyannopoulos,et al.  Human mutation rate associated with DNA replication timing , 2009, Nature Genetics.

[30]  Pamela K. Kreeger,et al.  Cancer systems biology: a network modeling perspective , 2009, Carcinogenesis.

[31]  S. Gabriel,et al.  Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. , 2010, Cancer cell.

[32]  David Haussler,et al.  Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM , 2010, Bioinform..

[33]  Caroline Dive,et al.  Inhibition of FGFR2 and FGFR1 increases cisplatin sensitivity in ovarian cancer , 2010, Cancer biology & therapy.

[34]  klaguia International Network of Cancer Genome Projects , 2010 .

[35]  Gary D Bader,et al.  International network of cancer genome projects , 2010, Nature.

[36]  N. Turner,et al.  Fibroblast growth factor signalling: from development to cancer , 2010, Nature Reviews Cancer.

[37]  Roded Sharan,et al.  Associating Genes and Protein Complexes with Disease via Network Propagation , 2010, PLoS Comput. Biol..

[38]  B. Karlan,et al.  Gene expression profile of BRCAness that correlates with responsiveness to chemotherapy and with outcome in patients with epithelial ovarian cancer. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[39]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..

[40]  C. Sander,et al.  Predicting the functional impact of protein mutations: application to cancer genomics , 2011, Nucleic acids research.

[41]  J. Weidhaas,et al.  miRNAs in the spotlight: Making 'silent' mutations speak up , 2011, Nature Medicine.

[42]  Benjamin J. Raphael,et al.  Integrated Genomic Analyses of Ovarian Carcinoma , 2011, Nature.

[43]  C. Kimchi-Sarfaty,et al.  Understanding the contribution of synonymous mutations to human disease , 2011, Nature Reviews Genetics.

[44]  A. Cheema,et al.  Small Changes Huge Impact: The Role of Protein Posttranslational Modifications in Cellular Homeostasis and Disease , 2011, Journal of amino acids.

[45]  Eli Upfal,et al.  Algorithms for Detecting Significantly Mutated Pathways in Cancer , 2010, RECOMB.

[46]  Gary D. Bader,et al.  Pathway Commons, a web resource for biological pathway data , 2010, Nucleic Acids Res..

[47]  K. Wilson,et al.  The nucleoskeleton as a genome-associated dynamic 'network of networks' , 2011, Nature Reviews Molecular Cell Biology.

[48]  E. Marcotte,et al.  Prioritizing candidate disease genes by network-based boosting of genome-wide association data. , 2011, Genome research.

[49]  Trey Ideker,et al.  Protein Networks as Logic Functions in Development and Cancer , 2011, PLoS Comput. Biol..

[50]  L. Pusztai,et al.  Gene expression profiling in breast cancer: classification, prognostication, and prediction , 2011, The Lancet.

[51]  Zhengyan Kan,et al.  Exome sequencing identifies frequent mutation of ARID1A in molecular subtypes of gastric cancer , 2011, Nature Genetics.

[52]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[53]  K. Hess,et al.  Association of BRCA1 and BRCA2 mutations with survival, chemotherapy sensitivity, and gene mutator phenotype in patients with ovarian cancer. , 2011, JAMA.

[54]  Jannik N. Andersen,et al.  Cancer genomics: from discovery science to personalized medicine , 2011, Nature Medicine.

[55]  Jay W. Wright,et al.  BRCAness Profile of Sporadic Ovarian Cancer Predicts Disease Recurrence , 2012, PloS one.

[56]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[57]  Steven J. M. Jones,et al.  Comprehensive molecular characterization of human colon and rectal cancer , 2012, Nature.

[58]  Michael R. Hayden,et al.  Whole-Genome Sequencing: The New Standard of Care? , 2012, Science.

[59]  M. Roizen,et al.  Hallmarks of Cancer: The Next Generation , 2012 .

[60]  E. Mardis Genome sequencing and cancer. , 2012, Current opinion in genetics & development.

[61]  G. Berx,et al.  Gene expression profiling to dissect the complexity of cancer biology: pitfalls and promise. , 2012, Seminars in cancer biology.

[62]  Z. Szallasi,et al.  Implementing an online tool for genome-wide validation of survival-associated biomarkers in ovarian-cancer using microarray data from 1287 patients. , 2012, Endocrine-related cancer.

[63]  P. Stenson,et al.  The Human Gene Mutation Database (HGMD) and Its Exploitation in the Fields of Personalized Genomics and Molecular Evolution , 2012, Current protocols in bioinformatics.

[64]  Ilya Shmulevich,et al.  Integrated Analysis of Gene Expression and Tumor Nuclear Image Profiles Associated with Chemotherapy Response in Serous Ovarian Carcinoma , 2012, PloS one.

[65]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[66]  A. McKenna,et al.  Exome and whole genome sequencing of esophageal adenocarcinoma identifies recurrent driver events and mutational complexity , 2013, Nature Genetics.

[67]  H. Carter,et al.  Identifying Mendelian disease genes with the Variant Effect Scoring Tool , 2013, BMC Genomics.

[68]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[69]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumours , 2013 .

[70]  Hannah Carter,et al.  CRAVAT: cancer-related analysis of variants toolkit , 2013, Bioinform..

[71]  Steven A. Roberts,et al.  Mutational heterogeneity in cancer and the search for new cancer-associated genes , 2013 .

[72]  K. Cibulskis,et al.  Prognostically relevant gene signatures of high-grade serous ovarian carcinoma. , 2012, The Journal of clinical investigation.

[73]  Steven J. M. Jones,et al.  Integrated genomic characterization of endometrial carcinoma , 2013, Nature.

[74]  G. Barry,et al.  The emerging role of RNA and DNA editing in cancer. , 2014, Biochimica et biophysica acta.