Detection of Driver Modules with Rarely Mutated Genes in Cancers

Identifying driver modules or pathways is a key challenge to interpret the molecular mechanisms and pathogenesis underlying cancer. An increasing number of studies suggest that rarely mutated genes are important for the development of cancer. However, the driver modules consisting of mutated genes with low-frequency driver mutations are not well characterized. To identify driver modules with rarely mutated genes, we propose a functional similarity index to quantify the functional relationship between rarely mutated genes and other ones in the same module. Then, we develop a method to detect Driver Modules with Rarely mutated Genes (DMRG) by incorporating the functional similarity, coverage and mutual exclusivity. By applying DMRG on TCGA cancer dataset on three networks: HINT+HI2012, iRefIndex and MultiNet, we detect driver modules intersecting with the well-known signalling pathways and protein complexes, such as the cell cycle pathway and the mediator complex. DMRG can also detect driver modules effectively with 20, 40, 60 and 80 percent of samples by random selection. When compared with HotNet2, DMRG detects more rarely mutated cancer genes and has higher pathway enrichment. Overall, DMRG provides an effective method for the identification of driver modules with rarely mutated genes.

[1]  Hing Y Leung,et al.  Upregulation and Nuclear Recruitment of HDAC1 in Hormone Refractory Prostate Cancer , 2004, The Prostate.

[2]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[3]  Ian M. Donaldson,et al.  iRefIndex: A consolidated protein interaction database with provenance , 2008, BMC Bioinformatics.

[4]  David E Hill,et al.  next-generation sequencing to generate interactome datasets , 2011 .

[5]  Matthew B. Callaway,et al.  MuSiC: Identifying mutational significance in cancer genomes , 2012, Genome research.

[6]  C. Sander,et al.  Systematic identification of cancer driving signaling pathways based on mutual exclusivity of genomic alterations , 2014, Genome Biology.

[7]  Ao Li,et al.  DGPathinter: a novel model for identifying driver genes via knowledge-driven matrix factorization with prior knowledge from interactome and pathways , 2017, PeerJ Comput. Sci..

[8]  M. Garabedian,et al.  MED14 and MED1 differentially regulate target-specific gene activation by the glucocorticoid receptor. , 2006, Molecular endocrinology.

[9]  Eli Upfal,et al.  Algorithms for Detecting Significantly Mutated Pathways in Cancer , 2010, RECOMB.

[10]  Christine Clayton,et al.  The Trypanosoma brucei CCCH zinc finger proteins ZC3H12 and ZC3H13. , 2012, Molecular and biochemical parasitology.

[11]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[12]  Christian Ruiz,et al.  MED15, encoding a subunit of the mediator complex, is overexpressed at high frequency in castration‐resistant prostate cancer , 2014, International journal of cancer.

[13]  Xing-Ming Zhao,et al.  Integrative analysis of mutational and transcriptional profiles reveals driver mutations of metastatic breast cancers , 2016, Cell Discovery.

[14]  G. Abecasis,et al.  Rare-variant association analysis: study designs and statistical tests. , 2014, American journal of human genetics.

[15]  R. Schiff,et al.  ESR1 mutations—a mechanism for acquired endocrine resistance in breast cancer , 2015, Nature Reviews Clinical Oncology.

[16]  Dylan J. Taatjes,et al.  The Mediator complex: a central integrator of transcription , 2015, Nature Reviews Molecular Cell Biology.

[17]  Shi-Hua Zhang,et al.  Efficient methods for identifying mutated driver pathways in cancer , 2012, Bioinform..

[18]  Roded Sharan,et al.  Associating Genes and Protein Complexes with Disease via Network Propagation , 2010, PLoS Comput. Biol..

[19]  R. Conaway,et al.  Function and regulation of the Mediator complex. , 2011, Current opinion in genetics & development.

[20]  G. Getz,et al.  GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers , 2011, Genome Biology.

[21]  Roded Sharan,et al.  Simultaneous Identification of Multiple Driver Pathways in Cancer , 2013, PLoS Comput. Biol..

[22]  Ao Li,et al.  A novel network regularized matrix decomposition method to detect mutated cancer genes in tumour samples with inter-patient heterogeneity , 2017, Scientific Reports.

[23]  W. McGuire,et al.  Human breast cancer: correlation of relapse and survival with amplification of the HER-2/neu oncogene. , 1987, Science.

[24]  J. Mesirov,et al.  The Molecular Signatures Database Hallmark Gene Set Collection , 2015 .

[25]  D. Goldstein,et al.  Uncovering the roles of rare variants in common disease through whole-genome sequencing , 2010, Nature Reviews Genetics.

[26]  D. Eisenberg,et al.  Use of Logic Relationships to Decipher Protein Network Organization , 2004, Science.

[27]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[28]  Juancarlos Chan,et al.  Gene Ontology Consortium: going forward , 2014, Nucleic Acids Res..

[29]  Mingming Jia,et al.  COSMIC: exploring the world's knowledge of somatic mutations in human cancer , 2014, Nucleic Acids Res..

[30]  N. Kim,et al.  Mediator and human disease. , 2011, Seminars in cell & developmental biology.

[31]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[32]  Li Ding,et al.  Patterns and functional implications of rare germline variants across 12 cancer types , 2015, Nature Communications.

[33]  Mark Gerstein,et al.  Interpretation of Genomic Variants Using a Unified Biological Network Approach , 2013, PLoS Comput. Biol..

[34]  Ao Li,et al.  Discovering potential driver genes through an integrated model of somatic mutation profiles and gene functional information. , 2017, Molecular bioSystems.

[35]  K. Frazer,et al.  Human genetic variation and its contribution to complex traits , 2009, Nature Reviews Genetics.

[36]  Gary D Bader,et al.  International network of cancer genome projects , 2010, Nature.

[37]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[38]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[39]  Dylan J. Taatjes,et al.  The Mediator complex and transcription regulation , 2013, Critical reviews in biochemistry and molecular biology.

[40]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[41]  Hua Yu,et al.  STATs in cancer inflammation and immunity: a leading role for STAT3 , 2009, Nature Reviews Cancer.

[42]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[43]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[44]  C I Amos,et al.  Evolutionary evidence of the effect of rare variants on disease etiology , 2011, Clinical genetics.

[45]  Haiyuan Yu,et al.  HINT: High-quality protein interactomes and their applications in understanding human disease , 2012, BMC Systems Biology.

[46]  Shihua Zhang,et al.  Discovery of cancer common and specific driver gene sets , 2016, Nucleic acids research.

[47]  Lin Gao,et al.  Detection of driver pathways using mutated gene network in cancer. , 2016, Molecular bioSystems.

[48]  D. Hanahan,et al.  The Hallmarks of Cancer , 2000, Cell.

[49]  Benjamin J. Raphael,et al.  Network propagation: a universal amplifier of genetic associations , 2017, Nature Reviews Genetics.

[50]  D. Busam,et al.  SMAD2, SMAD3 and SMAD4 mutations in colorectal cancer. , 2013, Cancer research.

[51]  Steven A. Roberts,et al.  Mutational heterogeneity in cancer and the search for new cancer-associated genes , 2013 .

[52]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[53]  M. Daly,et al.  Searching for missing heritability: Designing rare variant association studies , 2014, Proceedings of the National Academy of Sciences.

[54]  Junhua Zhang,et al.  The Discovery of Mutated Driver Pathways in Cancer: Models and Algorithms , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[55]  G van der Pluijm,et al.  Smad2 and Smad3 have opposing roles in breast cancer bone metastasis by differentially affecting tumor angiogenesis , 2010, Oncogene.

[56]  Javed Siddiqui,et al.  Activating ESR1 mutations in hormone-resistant metastatic breast cancer , 2013, Nature Genetics.

[57]  Niko Beerenwinkel,et al.  Computational approaches for the identification of cancer genes and pathways , 2016, Wiley interdisciplinary reviews. Systems biology and medicine.

[58]  Niko Beerenwinkel,et al.  Modeling Mutual Exclusivity of Cancer Mutations , 2014, RECOMB.

[59]  Benjamin J. Raphael,et al.  Pan-Cancer Network Analysis Identifies Combinations of Rare Somatic Mutations across Pathways and Protein Complexes , 2014, Nature Genetics.

[60]  Christopher Oldmeadow,et al.  Combined analysis of exon splicing and genome wide polymorphism data predict schizophrenia risk loci. , 2014, Journal of psychiatric research.

[61]  A. Clark,et al.  Recent Explosive Human Population Growth Has Resulted in an Excess of Rare Genetic Variants , 2012, Science.

[62]  Shi-Hua Zhang,et al.  Identification of mutated core cancer modules by integrating somatic mutation, copy number variation, and gene expression data , 2013, BMC Systems Biology.

[63]  Jacob A. Tennessen,et al.  Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes , 2012, Science.

[64]  Hua Yu,et al.  Revisiting STAT3 signalling in cancer: new and unexpected biological functions , 2014, Nature Reviews Cancer.

[65]  Teresa M. Przytycka,et al.  MEMCover: integrated analysis of mutual exclusivity and functional network reveals dysregulated pathways across multiple cancer types , 2015, Bioinform..

[66]  Eli Upfal,et al.  De Novo Discovery of Mutated Driver Pathways in Cancer , 2011, RECOMB.