An information theoretic method to identify combinations of genomic alterations that promote glioblastoma.

Tumors are the result of accumulated genomic alterations that cooperate synergistically to produce uncontrollable cell growth. Although identifying recurrent alterations among large collections of tumors provides a way to pinpoint genes that endow a selective advantage in oncogenesis and progression, it fails to address the genetic interactions behind this selection process. A non-random pattern of co-mutated genes is evidence for selective forces acting on tumor cells that harbor combinations of these genetic alterations. Although existing methods have successfully identified mutually exclusive gene sets, no current method can systematically discover more general genetic relationships. We develop Genomic Alteration Modules using Total Correlation (GAMToC), an information theoretic framework that integrates copy number and mutation data to identify gene modules with any non-random pattern of joint alteration. Additionally, we present the Seed-GAMToC procedure, which uncovers the mutational context of any putative cancer gene. The software is publicly available. Applied to glioblastoma multiforme samples, GAMToC results show distinct subsets of co-occurring mutations, suggesting distinct mutational routes to cancer and providing new insight into mutations associated with proneural, proneural/G-CIMP, and classical types of the disease. The results recapitulate known relationships such as mutual exclusive mutations, place these alterations in the context of other mutations, and find more complex relationships such as conditional mutual exclusivity.

[1]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[2]  L. Stein,et al.  A human functional protein interaction network and its application to cancer data analysis , 2010, Genome Biology.

[3]  S. Gabriel,et al.  Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. , 2010, Cancer cell.

[4]  Niko Beerenwinkel,et al.  Modeling Mutual Exclusivity of Cancer Mutations , 2014, RECOMB.

[5]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[6]  Giancarlo Mauri,et al.  Inferring Tree Causal Models of Cancer Progression with Probability Raising , 2013, bioRxiv.

[7]  R. Wilson,et al.  Identification of a CpG island methylator phenotype that defines a distinct subgroup of glioma. , 2010, Cancer cell.

[8]  J. Shay,et al.  BRAFE600-associated senescence-like cell cycle arrest of human naevi , 2005, Nature.

[9]  Roded Sharan,et al.  Simultaneous Identification of Multiple Driver Pathways in Cancer , 2013, PLoS Comput. Biol..

[10]  A. Nicholson,et al.  Mutations of the BRAF gene in human cancer , 2002, Nature.

[11]  Pooja Mittal,et al.  A novel signaling pathway impact analysis , 2009, Bioinform..

[12]  Eli Upfal,et al.  Algorithms for Detecting Significantly Mutated Pathways in Cancer , 2010, RECOMB.

[13]  Steven A. Roberts,et al.  Mutational heterogeneity in cancer and the search for new cancer genes , 2014 .

[14]  Andrew P Feinberg,et al.  A nucleolar protein, H19 opposite tumor suppressor (HOTS), is a tumor growth inhibitor encoded by a human imprinted H19 antisense transcript , 2011, Proceedings of the National Academy of Sciences of the United States of America.

[15]  B. Clurman,et al.  Cyclin E in normal and neoplastic cell cycles , 2005, Oncogene.

[16]  Giovanni Parmigiani,et al.  Patient-oriented gene set analysis for cancer mutation data , 2010, Genome Biology.

[17]  C. Sander,et al.  Pattern discovery and cancer gene identification in integrated cancer genomic data , 2013, Proceedings of the National Academy of Sciences.

[18]  F. Markowetz,et al.  Cancer Evolution: Mathematical Models and Computational Inference , 2014, Systematic biology.

[19]  David Haussler,et al.  Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM , 2010, Bioinform..

[20]  C. Croce,et al.  Gain of imprinting at chromosome 11p15: A pathogenetic mechanism identified in human hepatocarcinomas. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[21]  D. Haussler,et al.  The Somatic Genomic Landscape of Glioblastoma , 2013, Cell.

[22]  C. Sander,et al.  Automated Network Analysis Identifies Core Pathways in Glioblastoma , 2010, PloS one.

[23]  Xiang-Sun Zhang,et al.  Detecting and analyzing differentially activated pathways in brain regions of Alzheimer's disease patients. , 2011, Molecular bioSystems.

[24]  Y. Colin,et al.  αII-Spectrin Is Critical for Cell Adhesion and Cell Cycle* , 2009, Journal of Biological Chemistry.

[25]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[26]  Luca Laurenti,et al.  Tumor evolutionary directed graphs and the history of chronic lymphocytic leukemia , 2014, eLife.

[27]  D. Housman,et al.  Oncogenic EGFR signaling cooperates with loss of tumor suppressor gene functions in gliomagenesis , 2009, Proceedings of the National Academy of Sciences.

[28]  E. Lander,et al.  Assessing the significance of chromosomal aberrations in cancer: Methodology and application to glioma , 2007, Proceedings of the National Academy of Sciences.

[29]  Benjamin J. Raphael,et al.  De novo discovery of mutated driver pathways in cancer , 2011 .

[30]  Gregory A Petsko,et al.  No stone unturned , 2010, Genome Biology.

[31]  F. James Rohlf,et al.  Biometry: The Principles and Practice of Statistics in Biological Research , 1969 .

[32]  Long Yu,et al.  BRSK2 is regulated by ER stress in protein level and involved in ER stress-induced apoptosis. , 2012, Biochemical and biophysical research communications.

[33]  Christopher A. Miller,et al.  Discovering functional modules by identifying recurrent and mutually exclusive mutational patterns in tumors , 2011, BMC Medical Genomics.

[34]  Zaher Dawy,et al.  An approximation to the distribution of finite sample size mutual information estimates , 2005, IEEE International Conference on Communications, 2005. ICC 2005. 2005.

[35]  G. Getz,et al.  GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers , 2011, Genome Biology.

[36]  Long Yu,et al.  APC/CCdh1 Targets Brain-Specific Kinase 2 (BRSK2) for Degradation via the Ubiquitin-Proteasome Pathway , 2012, PloS one.

[37]  Dimitris Anastassiou,et al.  Inference of Disease-Related Molecular Logic from Systems-Based Microarray Analysis , 2006, PLoS Comput. Biol..