ModulOmics: Integrating Multi-Omics Data to Identify Cancer Driver Modules

The identification of molecular pathways driving cancer progression is a fundamental unsolved problem in tumorigenesis, which can substantially further our understanding of cancer mechanisms and inform the development of targeted therapies. Most current approaches to address this problem use primarily somatic mutations, not fully exploiting additional layers of biological information. Here, we describe ModulOmics, a method to de novo identify cancer driver pathways, or modules, by integrating multiple data types (protein-protein interactions, mutual exclusivity of mutations or copy number alterations, transcriptional co-regulation, and RNA co-expression) into a single probabilistic model. To efficiently search the exponential space of candidate modules, ModulOmics employs a two-step optimization procedure that combines integer linear programming with stochastic search. Across several cancer types, ModulOmics identifies highly functionally connected modules enriched with cancer driver genes, outperforming state-of-the-art methods. For breast cancer subtypes, the inferred modules recapitulate known molecular mechanisms and suggest novel subtype-specific functionalities. These findings are supported by an independent patient cohort, as well as independent proteomic and phosphoproteomic datasets.

[1]  Jack Kuipers,et al.  pathTiMEx: Joint Inference of Mutually Exclusive Cancer Pathways and Their Progression Dynamics , 2017, J. Comput. Biol..

[2]  G. Konecny,et al.  PARP inhibitors for BRCA1/2-mutated and sporadic ovarian cancer: current practice and future directions , 2016, British Journal of Cancer.

[3]  Sourav Bandyopadhyay,et al.  Challenges in identifying cancer genes by analysis of exome sequencing data , 2016, Nature Communications.

[4]  Jack Kuipers,et al.  pathTiMEx: Joint Inference of Mutually Exclusive Cancer Pathways and Their Dependencies in Tumor Progression , 2016, RECOMB.

[5]  Eytan Ruppin,et al.  System-wide Clinical Proteomics of Breast Cancer Reveals Global Remodeling of Tissue Homeostasis. , 2016, Cell systems.

[6]  M. Mann,et al.  Proteomic maps of breast cancer subtypes , 2016, Nature Communications.

[7]  Francesca D. Ciccarelli,et al.  NCG 5.0: updates of a manually curated repository of cancer genes and associated properties from cancer mutational screenings , 2015, Nucleic Acids Res..

[8]  Teresa M. Przytycka,et al.  MEMCover: integrated analysis of mutual exclusivity and functional network reveals dysregulated pathways across multiple cancer types , 2015, Bioinform..

[9]  Jung Eun Shim,et al.  TRRUST: a reference database of human transcriptional regulatory interactions , 2015, Scientific Reports.

[10]  Niko Beerenwinkel,et al.  TiMEx: a waiting time model for mutually exclusive cancer alterations , 2015, Bioinform..

[11]  Janos X. Binder,et al.  DISEASES: Text mining and data integration of disease–gene associations , 2014, bioRxiv.

[12]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[13]  P. Campbell,et al.  Combining gene mutation with gene expression data improves outcome prediction in myelodysplastic syndromes , 2015, Nature Communications.

[14]  Benjamin J. Raphael,et al.  Pan-Cancer Network Analysis Identifies Combinations of Rare Somatic Mutations across Pathways and Protein Complexes , 2014, Nature Genetics.

[15]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[16]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[17]  C. Sander,et al.  Systematic identification of cancer driving signaling pathways based on mutual exclusivity of genomic alterations , 2014, Genome Biology.

[18]  Janos X. Binder,et al.  DISEASES: Text mining and data integration of disease–gene associations , 2014, bioRxiv.

[19]  Jack P. Hou,et al.  DawnRank: discovering personalized driver genes in cancer , 2014, Genome Medicine.

[20]  Yan Cheng,et al.  XPO1 (CRM1) Inhibition Represses STAT3 Activation to Drive a Survivin-Dependent Oncogenic Switch in Triple-Negative Breast Cancer , 2014, Molecular Cancer Therapeutics.

[21]  S. Elledge,et al.  Cumulative Haploinsufficiency and Triplosensitivity Drive Aneuploidy Patterns and Shape the Cancer Genome , 2013, Cell.

[22]  David Haussler,et al.  Discovering causal pathways linking genomic events to transcriptional states using Tied Diffusion Through Interacting Events (TieDIE) , 2013, Bioinform..

[23]  Roded Sharan,et al.  Simultaneous Identification of Multiple Driver Pathways in Cancer , 2013, PLoS Comput. Biol..

[24]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[25]  A. Bashashati,et al.  DriverNet: uncovering the impact of somatic driver mutations on transcriptional networks in cancer , 2012, Genome Biology.

[26]  Takashi Kohno,et al.  CBP and p300 Histone Acetyltransferases Contribute to Homologous Recombination by Transcriptionally Activating the BRCA1 and RAD51 Genes , 2012, PloS one.

[27]  Alfonso Valencia,et al.  EnrichNet: network-based gene set enrichment analysis , 2012, Bioinform..

[28]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[29]  P. Fink,et al.  Dietary Essential Amino Acids Affect the Reproduction of the Keystone Herbivore Daphnia pulex , 2011, PloS one.

[30]  Sven Diederichs,et al.  The hallmarks of cancer , 2012, RNA biology.

[31]  Lynda Chin,et al.  Preclinical evaluation of selective inhibitors of nuclear export (SINE) in basal-like breast cancer (BLBC). , 2012 .

[32]  P. Pandolfi,et al.  The functions and regulation of the PTEN tumour suppressor , 2012, Nature Reviews Molecular Cell Biology.

[33]  Benjamin E. Gross,et al.  The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. , 2012, Cancer discovery.

[34]  F. Markowetz,et al.  The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups , 2012, Nature.

[35]  C. Feschotte,et al.  Endogenous viruses: insights into viral evolution and impact on host biology , 2012, Nature Reviews Genetics.

[36]  Martin H. Schaefer,et al.  HIPPIE: Integrating Protein Interaction Networks with Experiment Based Quality Scores , 2012, PloS one.

[37]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[38]  C. Cole,et al.  COSMIC: the catalogue of somatic mutations in cancer , 2011, Genome Biology.

[39]  Benjamin J. Raphael,et al.  Integrated Genomic Analyses of Ovarian Carcinoma , 2011, Nature.

[40]  Hans Clevers,et al.  Distinct gene mutation profiles among luminal-type and basal-type breast cancer cell lines , 2010, Breast Cancer Research and Treatment.

[41]  Mingming Jia,et al.  COSMIC (the Catalogue of Somatic Mutations in Cancer): a resource to investigate acquired mutations in human cancer , 2009, Nucleic Acids Res..

[42]  A. Nobel,et al.  Supervised risk predictor of breast cancer based on intrinsic subtypes. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[43]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[44]  F. Schmitt,et al.  EGFR/HER2 in breast cancer: a biological approach for molecular diagnosis and therapy , 2008, Expert review of molecular diagnostics.

[45]  H. Mewes,et al.  CORUM: the comprehensive resource of mammalian protein complexes—2009 , 2007, Nucleic Acids Res..

[46]  A. Ashworth,et al.  BRCA1 dysfunction in sporadic basal-like breast cancer , 2007, Oncogene.

[47]  J. Reis-Filho,et al.  Basal-like breast cancer and the BRCA1 phenotype , 2006, Oncogene.

[48]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[49]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[50]  Hanina Hibshoosh,et al.  PIK3CA mutations correlate with hormone receptors, node metastasis, and ERBB2, and are mutually exclusive with PTEN loss in human breast carcinoma. , 2005, Cancer research.

[51]  P. Dessen,et al.  An Atlas on genes and chromosomes in oncology and haematology. , 2004, Cellular and molecular biology.

[52]  B. Henderson,et al.  Regulation of tumor suppressors by nuclear-cytoplasmic shuttling. , 2003, Experimental cell research.

[53]  R. Tibshirani,et al.  Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[54]  G. Berx,et al.  The E-cadherin/catenin complex: an important gatekeeper in breast cancer tumorigenesis and malignant progression , 2001, Breast Cancer Research.

[55]  Christian A. Rees,et al.  Molecular portraits of human breast tumours , 2000, Nature.

[56]  H. Ruffner,et al.  CBP/p300 interact with and function as transcriptional coactivators of BRCA1. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[57]  A. Levine,et al.  Nuclear Export Is Required for Degradation of Endogenous p53 by MDM2 and Human Papillomavirus E6 , 1998, Molecular and Cellular Biology.

[58]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumours , 2013 .

[59]  Amy E. Hawkins,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2009 .

[60]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[61]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[62]  K. Lund,et al.  Production of compartmented cultures of rat sympathetic neurons , 2009 .