Identification of driver genes based on gene mutational effects and network centrality

Background As one of the deadliest diseases in the world, cancer is driven by a few somatic mutations that disrupt the normal growth of cells, and leads to abnormal proliferation and tumor development. The vast majority of somatic mutations did not affect the occurrence and development of cancer; thus, identifying the mutations responsible for tumor occurrence and development is one of the main targets of current cancer treatments. Results To effectively identify driver genes, we adopted a semi-local centrality measure and gene mutation effect function to assess the effect of gene mutations on changes in gene expression patterns. Firstly, we calculated the mutation score for each gene. Secondly, we identified differentially expressed genes (DEGs) in the cohort by comparing the expression profiles of tumor samples and normal samples, and then constructed a local network for each mutation gene using DEGs and mutant genes according to the protein–protein interaction network. Finally, we calculated the score of each mutant gene according to the objective function. The top-ranking mutant genes were selected as driver genes. We name the proposed method as mutations effect and network centrality. Conclusions Four types of cancer data in The Cancer Genome Atlas were tested. The experimental data proved that our method was superior to the existing network-centric method, as it was able to quickly and easily identify driver genes and rare driver factors.

[1]  Xing-Ming Zhao,et al.  Identifying cancer-related microRNAs based on gene expression data , 2015, Bioinform..

[2]  K. Aihara,et al.  Personalized characterization of diseases using sample-specific networks , 2016, bioRxiv.

[3]  David Haussler,et al.  PARADIGM-SHIFT predicts the function of mutations in multiple cancers using pathway impact analysis , 2012, Bioinform..

[4]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[5]  J. P. Hou,et al.  DawnRank: discovering personalized driver genes in cancer , 2014, Genome Medicine.

[6]  Kern Rei Chng,et al.  Patient-specific driver gene prediction and risk assessment through integrated network analysis of cancer omics profiles , 2015, Nucleic acids research.

[7]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[8]  A. Bashashati,et al.  DriverNet: uncovering the impact of somatic driver mutations on transcriptional networks in cancer , 2012, Genome Biology.

[9]  H. Carter,et al.  Identifying Mendelian disease genes with the Variant Effect Scoring Tool , 2013, BMC Genomics.

[10]  R. Shamir,et al.  PRODIGY: personalized prioritization of driver genes , 2018, bioRxiv.

[11]  M. Guyer,et al.  Charting a course for genomic medicine from base pairs to bedside , 2011, Nature.

[12]  J. Soulier,et al.  Copy-number analysis identified new prognostic marker in acute myeloid leukemia , 2017, Leukemia.

[13]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[14]  Tom R. Gaunt,et al.  Predicting the functional consequences of cancer-associated amino acid substitutions , 2013, Bioinform..

[15]  Dennis C. Friedrich,et al.  Whole-exome sequencing and clinical interpretation of formalin-fixed , paraffin-embedded tumor samples to guide precision cancer medicine , 2014 .

[16]  Kyusam Choi,et al.  The expression of phospho-AKT1 and phospho-MTOR is associated with a favorable prognosis independent of PTEN expression in intrahepatic cholangiocarcinomas , 2012, Modern Pathology.

[17]  Hyunju Lee,et al.  DEOD: uncovering dominant effects of cancer-driver genes based on a partial covariance selection method , 2015, Bioinform..

[18]  Matthew B. Callaway,et al.  MuSiC: Identifying mutational significance in cancer genomes , 2012, Genome research.

[19]  M. Frame,et al.  Src in cancer: deregulation and consequences for cell behaviour. , 2002, Biochimica et biophysica acta.

[20]  A. Sivachenko,et al.  Sequence analysis of mutations and translocations across breast cancer subtypes , 2012, Nature.

[21]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[22]  M. Stratton,et al.  The cancer genome , 2009, Nature.

[23]  Thomas Sauerwald,et al.  HIT'nDRIVE: patient-specific multidriver gene prioritization for precision oncology , 2017, Genome research.

[24]  A. Sparks,et al.  The Genomic Landscapes of Human Breast and Colorectal Cancers , 2007, Science.

[25]  R. Salgia,et al.  Fyn , 2010, Cancer.

[26]  Hannah Carter,et al.  CHASM and SNVBox: toolkit for detecting biologically important single nucleotide mutations in cancer , 2011, Bioinform..

[27]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[28]  Guojun Li,et al.  MaxMIF: A New Method for Identifying Cancer Driver Genes through Effective Data Integration , 2018, Advanced science.

[29]  S. Gabriel,et al.  Discovery and saturation analysis of cancer genes across 21 tumor types , 2014, Nature.

[30]  M. Stratton Journeys into the genome of cancer cells , 2013, EMBO molecular medicine.

[31]  Shi-Hua Zhang,et al.  Identification of mutated core cancer modules by integrating somatic mutation, copy number variation, and gene expression data , 2013, BMC Systems Biology.

[32]  Emidio Capriotti,et al.  Bioinformatics for personal genome interpretation , 2012, Briefings Bioinform..

[33]  Tom R. Gaunt,et al.  Predicting the Functional, Molecular, and Phenotypic Consequences of Amino Acid Substitutions using Hidden Markov Models , 2012, Human mutation.

[34]  Huanbin Wang,et al.  Somatic gene copy number alterations in colorectal cancer: new quest for cancer drivers and biomarkers , 2016, Oncogene.

[35]  C. Onodera,et al.  Genomic Analysis of Pigmented Epithelioid Melanocytomas Reveals Recurrent Alterations in PRKAR1A, and PRKCA Genes , 2017, The American journal of surgical pathology.

[36]  Jie Xu,et al.  Gastric cancer and gene copy number variation: emerging cancer drivers for targeted therapy , 2016, Oncogene.

[37]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[38]  R. Salgia,et al.  Fyn: a novel molecular target in cancer. , 2010, Cancer.

[39]  Gary D Bader,et al.  International network of cancer genome projects , 2010, Nature.

[40]  Luonan Chen,et al.  Biomolecular Networks: Methods and Applications in Systems Biology , 2009 .

[41]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[42]  P. Bork,et al.  GEAR: A database of Genomic Elements Associated with drug Resistance , 2017, Scientific Reports.

[43]  Peilin Jia,et al.  VarWalker: Personalized Mutation Network Analysis of Putative Cancer Genes from Next-Generation Sequencing Data , 2014, PLoS Comput. Biol..

[44]  X. Hua,et al.  DriverML: a machine learning algorithm for identifying driver genes in cancer sequencing studies , 2019, Nucleic acids research.

[45]  Xing-Ming Zhao,et al.  Integrative analysis of mutational and transcriptional profiles reveals driver mutations of metastatic breast cancers , 2016, Cell Discovery.

[46]  S. Brunak,et al.  Network biology concepts in complex disease comorbidities , 2016, Nature Reviews Genetics.

[47]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[48]  Xing-Ming Zhao,et al.  Identifying Disease Associated miRNAs Based on Protein Domains , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[49]  Yicheng Zhang,et al.  Identifying influential nodes in complex networks , 2012 .

[50]  G. Parmigiani,et al.  Core Signaling Pathways in Human Pancreatic Cancers Revealed by Global Genomic Analyses , 2008, Science.

[51]  Brian H. Dunford-Shore,et al.  Somatic mutations affect key pathways in lung adenocarcinoma , 2008, Nature.

[52]  Lei Zhang,et al.  Discovering personalized driver mutation profiles of single samples in cancer by network control strategy , 2018, Bioinform..

[53]  David Tamborero,et al.  OncodriveCLUST: exploiting the positional clustering of somatic mutations to identify cancer genes , 2013, Bioinform..

[54]  Leyla Isik,et al.  Cancer-specific high-throughput annotation of somatic mutations: computational prediction of driver missense mutations. , 2009, Cancer research.

[55]  G. Mills,et al.  CanDrA: Cancer-Specific Driver Missense Mutation Annotation with Optimized Features , 2013, PloS one.

[56]  Chuang Liu,et al.  A Gene Gravity Model for the Evolution of Cancer Genomes: A Study of 3,000 Cancer Genomes across 9 Cancer Types , 2015, PLoS Comput. Biol..

[57]  Xing-Ming Zhao,et al.  CSTEA: a webserver for the Cell State Transition Expression Atlas , 2017, Nucleic Acids Res..

[58]  Adam Kiezun,et al.  Whole-exome sequencing and clinical interpretation of FFPE tumor samples to guide precision cancer medicine , 2013, Nature Medicine.

[59]  Junfeng Xia,et al.  LNDriver: identifying driver genes by integrating mutation and expression data based on gene-gene interaction network , 2016, BMC Bioinformatics.

[60]  R. Bernards,et al.  Unresponsiveness of colon cancer to BRAF(V600E) inhibition through feedback activation of EGFR , 2012, Nature.

[61]  Hannah Carter,et al.  CRAVAT: cancer-related analysis of variants toolkit , 2013, Bioinform..

[62]  Benjamin J. Raphael,et al.  Pan-Cancer Network Analysis Identifies Combinations of Rare Somatic Mutations across Pathways and Protein Complexes , 2014, Nature Genetics.

[63]  Joshua F. McMichael,et al.  DGIdb - Mining the druggable genome , 2013, Nature Methods.

[64]  Tao Zeng,et al.  A novel network control model for identifying personalized driver genes in cancer , 2019, PLoS Comput. Biol..

[65]  F. Supek,et al.  MUFFINN: cancer gene discovery via network analysis of somatic mutation data , 2016, Genome Biology.

[66]  klaguia International Network of Cancer Genome Projects , 2010 .