NetSig: network-based discovery from cancer genomes

Methods that integrate molecular network information and tumor genome data could complement gene-based statistical tests to identify likely new cancer genes; but such approaches are challenging to validate at scale, and their predictive value remains unclear. We developed a robust statistic (NetSig) that integrates protein interaction networks with data from 4,742 tumor exomes. NetSig can accurately classify known driver genes in 60% of tested tumor types and predicts 62 new driver candidates. Using a quantitative experimental framework to determine in vivo tumorigenic potential in mice, we found that NetSig candidates induce tumors at rates that are comparable to those of known oncogenes and are ten-fold higher than those of random genes. By reanalyzing nine tumor-inducing NetSig candidates in 242 patients with oncogene-negative lung adenocarcinomas, we find that two (AKT2 and TFDP2) are significantly amplified. Our study presents a scalable integrated computational and experimental workflow to expand discovery from cancer genomes.

[1]  Steve D. M. Brown,et al.  Defective myosin VIIA gene responsible for Usher syndrome type IB , 1995, Nature.

[2]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[3]  Anthony E. Reeve,et al.  E-cadherin germline mutations in familial gastric cancer , 1998, Nature.

[4]  U. Wolfrum,et al.  Vezatin, a novel transmembrane protein, bridges myosin VIIA to the cadherin–catenins complex , 2000, The EMBO journal.

[5]  C. Baird,et al.  The pilot study. , 2000, Orthopedic nursing.

[6]  M. Maniak,et al.  Cell adhesion: Ushering in a new understanding of myosin VII , 2001, Current Biology.

[7]  Toshio Kuroki,et al.  Role of Smad4 (DPC4) inactivation in human cancer. , 2003, Biochemical and biophysical research communications.

[8]  Min Wu,et al.  Expression of cell cycle/growth regulator genes in human hepatocellular carcinoma and adjacent normal liver tissues. , 2003, Oncology reports.

[9]  C. Hill,et al.  Molecular and functional consequences of Smad4 C-terminal missense mutations in colorectal tumour cells. , 2004, The Biochemical journal.

[10]  R. Bast,et al.  Biochemistry and biology of ARHI (DIRAS3), an imprinted tumor suppressor gene whose expression is lost in ovarian and breast cancers. , 2006, Methods in enzymology.

[11]  Pall I. Olason,et al.  A human phenome-interactome network of protein complexes implicated in genetic disorders , 2007, Nature Biotechnology.

[12]  Eric S. Lander,et al.  Integrative Genomic Approaches Identify IKBKE as a Breast Cancer Oncogene , 2007, Cell.

[13]  Francisco S. Roque,et al.  A large-scale analysis of tissue-specific pathology and gene expression of human disease genes and complexes , 2008, Proceedings of the National Academy of Sciences.

[14]  C. Yeang,et al.  Combinatorial patterns of somatic gene mutations in cancer , 2008, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[15]  C. Sander,et al.  Functional Copy-Number Alterations in Cancer , 2008, PloS one.

[16]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[17]  Christopher A. Miller,et al.  Discovering functional modules by identifying recurrent and mutually exclusive mutational patterns in tumors , 2011, BMC Medical Genomics.

[18]  M. Daly,et al.  Proteins Encoded in Genomic Regions Associated with Immune-Mediated Disease Physically Interact and Suggest Underlying Biology , 2011, PLoS genetics.

[19]  G. Getz,et al.  GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers , 2011, Genome Biology.

[20]  Lee T. Sam,et al.  Personalized Oncology Through Integrative High-Throughput Sequencing: A Pilot Study , 2011, Science Translational Medicine.

[21]  T. Schlomm,et al.  Transcription alterations of members of the ubiquitin–proteasome network in prostate carcinoma , 2011, Prostate Cancer and Prostatic Diseases.

[22]  Eli Upfal,et al.  Algorithms for Detecting Significantly Mutated Pathways in Cancer , 2010, RECOMB.

[23]  D. Bar-Sagi,et al.  RAS oncogenes: weaving a tumorigenic web , 2011, Nature Reviews Cancer.

[24]  T. Nakazawa,et al.  Transcription factor Runx2 is a regulator of epithelial–mesenchymal transition and invasion in thyroid carcinomas , 2012, Laboratory Investigation.

[25]  A. Gonzalez-Perez,et al.  Functional impact bias reveals cancer drivers , 2012, Nucleic acids research.

[26]  Nikhil Wagle,et al.  High-throughput detection of actionable genomic alterations in clinical tumor samples by targeted, massively parallel sequencing. , 2012, Cancer discovery.

[27]  Martha L Slattery,et al.  MAP kinase genes and colon and rectal cancer. , 2012, Carcinogenesis.

[28]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[29]  A. McKenna,et al.  Absolute quantification of somatic DNA alterations in human cancer , 2012, Nature Biotechnology.

[30]  D. Bae,et al.  Identification of differentially expressed genes according to chemosensitivity in advanced ovarian serous adenocarcinomas: expression of GRIA2 predicts better survival , 2012, British Journal of Cancer.

[31]  Eric S. Lander,et al.  Discovery and prioritization of somatic mutations in diffuse large B-cell lymphoma (DLBCL) by whole-exome sequencing , 2012, Proceedings of the National Academy of Sciences.

[32]  Takashi Suzuki,et al.  Runt‐related transcription factor 2 in human colon carcinoma: A potent prognostic factor associated with estrogen receptor , 2012, International journal of cancer.

[33]  Angela N. Brooks,et al.  Mapping the Hallmarks of Lung Adenocarcinoma with Massively Parallel Sequencing , 2012, Cell.

[34]  Gabor T. Marth,et al.  Integrative Annotation of Variants from 1092 Humans: Application to Cancer Genomics , 2013, Science.

[35]  Andrew M. Gross,et al.  Network-based stratification of tumor mutations , 2013, Nature Methods.

[36]  J. Mesirov,et al.  β-Catenin-Driven Cancers Require a YAP1 Transcriptional Complex for Survival and Tumorigenesis , 2013, Cell.

[37]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[38]  Alex M. Fichtenholtz,et al.  Development and validation of a clinical cancer genomic profiling test based on massively parallel DNA sequencing , 2013, Nature Biotechnology.

[39]  E. Lander,et al.  Lessons from the Cancer Genome , 2013, Cell.

[40]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumours , 2013 .

[41]  X. Miao,et al.  NT5E and FcGBP as key regulators of TGF-1-induced epithelial–mesenchymal transition (EMT) are associated with tumor progression and survival of patients with gallbladder cancer , 2013, Cell and Tissue Research.

[42]  David Tamborero,et al.  OncodriveCLUST: exploiting the positional clustering of somatic mutations to identify cancer genes , 2013, Bioinform..

[43]  D. Beer,et al.  Inactivation of SAG/RBX2 E3 ubiquitin ligase suppresses KrasG12D-driven lung tumorigenesis. , 2014, The Journal of clinical investigation.

[44]  C. Rommel,et al.  PI3K and cancer: lessons, challenges and opportunities , 2014, Nature Reviews Drug Discovery.

[45]  S. Gabriel,et al.  Somatic ERCC2 mutations correlate with cisplatin sensitivity in muscle-invasive urothelial carcinoma. , 2014, Cancer discovery.

[46]  Steven J. M. Jones,et al.  Comprehensive molecular profiling of lung adenocarcinoma , 2014, Nature.

[47]  Peilin Jia,et al.  VarWalker: Personalized Mutation Network Analysis of Putative Cancer Genes from Next-Generation Sequencing Data , 2014, PLoS Comput. Biol..

[48]  S. Gabriel,et al.  Discovery and saturation analysis of cancer genes across 21 tumor types , 2014, Nature.

[49]  W. Hahn,et al.  In vivo multiplexed interrogation of amplified genes identifies GAB2 as an ovarian cancer oncogene , 2014, Proceedings of the National Academy of Sciences.

[50]  Steven J. M. Jones,et al.  Comprehensive molecular profiling of lung adenocarcinoma , 2014, Nature.

[51]  Benjamin J. Raphael,et al.  Pan-Cancer Network Analysis Identifies Combinations of Rare Somatic Mutations across Pathways and Protein Complexes , 2014, Nature Genetics.

[52]  Pradip De,et al.  MYC-xing it up with PIK3CA mutation and resistance to PI3K inhibitors: summit of two giants in breast cancers. , 2015, American journal of cancer research.

[53]  S. Cook,et al.  Identification of DYRK1B as a substrate of ERK1/2 and characterisation of the kinase activity of DYRK1B mutants from cancer and metabolic syndrome , 2015, Cellular and Molecular Life Sciences.

[54]  Lawrence D. True,et al.  Integrative Clinical Genomics of Advanced Prostate Cancer , 2015, Cell.

[55]  C. Sander,et al.  Systematic identification of cancer driving signaling pathways based on mutual exclusivity of genomic alterations , 2014, Genome Biology.

[56]  Todd R. Golub,et al.  An ecosystem of cancer cell line factories to support a cancer dependency map , 2015, Nature Reviews Genetics.

[57]  Gary D Bader,et al.  Pathway and network analysis of cancer genomes , 2015, Nature Methods.

[58]  Pablo Tamayo,et al.  Systematic Functional Interrogation of Rare Cancer Variants Identifies Oncogenic Alleles. , 2016, Cancer discovery.

[59]  S. K. Zaidi,et al.  Thyroid Hormone Receptor-β (TRβ) Mediates Runt-Related Transcription Factor 2 (Runx2) Expression in Thyroid Cancer Cells: A Novel Signaling Pathway in Thyroid Cancer. , 2016, Endocrinology.

[60]  Angela N. Brooks,et al.  High-throughput Phenotyping of Lung Cancer Somatic Mutations. , 2016, Cancer cell.

[61]  Jianxing He,et al.  Identification of TRPCs genetic variants that modify risk for lung cancer based on the pathway and two-stage study , 2016, Meta gene.

[62]  Chandra Sekhar Pedamallu,et al.  Distinct patterns of somatic genome alterations in lung adenocarcinomas and squamous cell carcinomas , 2016, Nature Genetics.

[63]  F. Supek,et al.  MUFFINN: cancer gene discovery via network analysis of somatic mutation data , 2016, Genome Biology.

[64]  Daniel Marbach,et al.  Tissue-specific regulatory circuits reveal variable modular perturbations across complex diseases , 2016, Nature Methods.

[65]  S. Brunak,et al.  A scored human protein–protein interaction network to catalyze genomic interpretation , 2017, Nature Methods.

[66]  S. Chakrabarti,et al.  Integrative genomic and network analysis identified novel genes associated with the development of advanced cervical squamous cell carcinoma. , 2017, Biochimica et biophysica acta. General subjects.